Should the independent (or dependent) variables in a linear regression model be normal or just...
ajakanvao
Answered
2022-11-23
Should the independent (or dependent) variables in a linear regression model be normal or just the residual?
Answer & Explanation
barene55d
Expert
2022-11-24Added 23 answers
Linear regression expresses a relationship between a response and covariates that is linear in terms of coefficients. In the simple case it associates one-dimensional response with one-dimensional as follows.
where and ϵ are considered as random variables and are coefficients (model parameters) to be estimated. Being a regression to the mean, the model specifies: with an implied assumption that and also constant. Thus, model restrictions are placed only on the conditional distribution of given , or equivalently on given . A convenient distribution used for residuals () is Normal/Gaussian, but the regression model, in general, works with other distributions as well. Not to confuse things further here, but it should still be noted that the regression analysis doesn't have to make any distributional assumptions. In estimation of the coefficients, for example, we use least squares method with no mention of any distributions. However, for more complex analysis, statisticians use various probability distributions to specify models, make assumptions explicit and use probability theory to justify results.