Hi @PhamBao ,
Outlier detection and classification can be a deep, deep pool to dip your toes in to with a number of different approaches. One thing you should consider is how you are defining an outlier (that isn't too clear in your post): are you looking for an outlier that doesn't fit your model, or are the points in red areas that you have defined as outliers?
As a quick thought, here's some areas to try out:
- Cooks D Outliers - this defines how much your model coefficients would change if you remove a point - generally a threshold of any value over 4/n (n= no of rows) is a good place to start). This link gives a lot of guides as well as on Cooks D.
-Studentised Residual Plots - these give a good visualisation of the residuals from your rows and helps to define points that have entered a certain region as probable outliers.
Check both of these links here and here for more info.
Thanks,
Ben
“All models are wrong, but some are useful”