Hi,
I have a simple question that I was wondering if people with more experience could shed some light on.
I know that OLS does not necessarily require normality assumption, but normality on the residuals leads to unbiased estimates with minimum variance.
I tried to fit a model on a continuous response with three independent variables using Least Squares in the Fit Model platform. This model had a Radj of about 0.45 and the residuals plot exhibited a clear pattern.
Next, I check the distribution of the continuous response and saw that it was far from normal.
Using Generalized Regression (Lasso) and Normal response distribution, the model still led to a Generalized Rsquare of about 0.45. However, when I switched to an exponential response distribution, the Rsquare increased to 0.84.
If normality on the response is not needed on Least Squares, what could explain such a difference?
Thank you!