A Note on Collinearity Diagnostics and Centering Articles
Overview
published in
- AMERICAN STATISTICIAN Journal
publication date
- January 2018
start page
- 140
end page
- 146
volume
- 72
Digital Object Identifier (DOI)
International Standard Serial Number (ISSN)
- 0003-1305
Electronic International Standard Serial Number (EISSN)
- 1537-2731
abstract
- The usual approach for diagnosing collinearity proceeds by centering and standardizing the regressors. The sample correlation matrix of the predictors is then the basic tool for describing approximate linear combinations that may distort the conclusions of a standard least-square analysis. However, as indicated by several authors, centering may eventually fail to detect the sources of ill-conditioning. In spite of this earlier claim, there does not seem to be in the literature a fully clear explanation of the reasons for this bad potential behavior of the traditional strategy for analyzing collinearity. This note studies this issue in some detail. Results derived are motivated by the analysis of a well-known real dataset. Practical conclusions are illustrated with several examples.
Classification
keywords
- condition numbers; multiple linear regression; normalized linear combinations; variance inflation factors