An Outlier is an observation in the dataset that is far away from other observations in the dataset. Tools used to discover outliers are
- Box plot
- Z-score
- Scatter plot, etc.
Typically, we need to follow three simple strategies to handle outliers:
- We can drop them.
- We can mark them as outliers and include them as a feature.
- Likewise, we can transform the feature to reduce the effect of the outlier.