Is there a simple way to clean big data with many different variables from outliers (gt or lt 4 stdev)?
Oct 10, 2014 1:08 PM(1436 views)
I am wondering if there is a simple way to clean many variables at the same time from the outliers (> or < 4 std.dev.) - each variable has different std.dev. For example, I have 77,000 records, 13 variables and I can do cleaning the data one variable at a time but maybe there is a way to clean all 13 variables from the outliers at the same time (but keeping in mind that each variable has a different std.dev.)?