Subscribe Bookmark RSS Feed

Is there a simple way to clean big data with many different variables from outliers (gt or lt 4 stdev)?

konradkk

Community Trekker

Joined:

Jun 20, 2014

Dear All,

I am wondering if there is a simple way to clean many variables at the same time from the outliers (> or < 4 std.dev.) - each variable has different std.dev. For example, I have 77,000 records, 13 variables and I can do cleaning the data one variable at a time but maybe there is a way to clean all 13 variables from the outliers at the same time (but keeping in mind that each variable has a different std.dev.)?

Thank you in advance for any reply,

Konrad