I would like to know something easy but very important.
Imagine I have a database with 0 NA, a perfect database who has been clean. And I have to do a PCA on this database. This datebase got a lot of individuals and variables ( 95 individuals and 10 variables)
I have to do a multiple regression and a PCA.
I must start per my multiple regression and eventually delete somme individuals who has been a Cook's distance > at the limit. And after I do my PCA on " new data base"
OR I must start per my PCA on my complete database, and after I do my multiple regression.
In conclusion, I must do :
- PCA
- multiple Regression
or
-multiple Regression
-PCA
Ty for helping me !