Within this very nice piece, Rob drops this bomb of mathematical knowledge:
It is not necessary to actually fit
separate models when computing the CV statistic for linear models.
data:image/s3,"s3://crabby-images/a9c9c/a9c9cf6ce2d2e5113355bd40589959e1a49c8ed3" alt="Rendered by QuickLaTeX.com n"
Say what?
Here is a broader excerpt and the method itself (after the jump).
While cross-validation can be computationally expensive in general, it is very easy and fast to compute LOOCV for linear models. A linear model can be written as
![Rendered by QuickLaTeX.com \[<br />
\mathbf{Y} = \mathbf{X}\mbox{\boldmath$\beta$} + \mathbf{e}.<br />
\]](http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-7e50600b231371a08c582b7d46308497_l2.gif)
Then
![Rendered by QuickLaTeX.com \[<br />
\hat{\mbox{\boldmath$\beta$}} = (\mathbf{X}'\mathbf{X})^{-1}\mathbf{X}'\mathbf{Y}<br />
\]](http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-8b37f88a8ed5d515da0e7efe8975d10b_l2.gif)
and the fitted values can be calculated using
![Rendered by QuickLaTeX.com \[<br />
\mathbf{\hat{Y}} = \mathbf{X}\hat{\mbox{\boldmath$\beta$}} = \mathbf{X}(\mathbf{X}'\mathbf{X})^{-1}\mathbf{X}'\mathbf{Y} = \mathbf{H}\mathbf{Y},<br />
\]](http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-8fe5c63e9e6182338919d9e74d6697ab_l2.gif)
where
is known as the “hat-matrix” because it is used to compute
(“Y-hat”).
data:image/s3,"s3://crabby-images/073f6/073f6b9dbb58ec46cafd42b5e6da6b05d252a8fc" alt="Rendered by QuickLaTeX.com \mathbf{H} = \mathbf{X}(\mathbf{X}’\mathbf{X})^{-1}\mathbf{X}’"
data:image/s3,"s3://crabby-images/5d4a0/5d4a0fd36bf85a3d8a9689bbb423e243bde46bea" alt="Rendered by QuickLaTeX.com \mathbf{\hat{Y}}"
If the diagonal values of
are denoted by
, then the cross-validation statistic can be computed using
data:image/s3,"s3://crabby-images/46230/46230cc6abffe55526b57081983f4ae0b23792c1" alt="Rendered by QuickLaTeX.com \mathbf{H}"
data:image/s3,"s3://crabby-images/aeea9/aeea9b6233425755141bc95f57be8fe620db6d9f" alt="Rendered by QuickLaTeX.com h_{1},\dots,h_{n}"
![Rendered by QuickLaTeX.com \[<br />
\text{CV} = \frac{1}{n}\sum_{i=1}^n [e_{i}/(1-h_{i})]^2,<br />
\]](http://robjhyndman.com/researchtips/wp-content/ql-cache/quicklatex.com-9c139a93f634723776c3ee3c8d538c89_l2.gif)
where
is the residual obtained from fitting the model to all
observations. See Christensen’s book Plane Answers to Complex Questions for a proof. Thus, it is not necessary to actually fit
separate models when computing the CV statistic for linear models. This remarkable result allows cross-validation to be used while only fitting the model once to all available observations.
data:image/s3,"s3://crabby-images/1e98f/1e98f8d39cb8175974528a1e500c9e31fd9aaf91" alt="Rendered by QuickLaTeX.com e_{i}"
data:image/s3,"s3://crabby-images/a9c9c/a9c9cf6ce2d2e5113355bd40589959e1a49c8ed3" alt="Rendered by QuickLaTeX.com n"
data:image/s3,"s3://crabby-images/a9c9c/a9c9cf6ce2d2e5113355bd40589959e1a49c8ed3" alt="Rendered by QuickLaTeX.com n"
No comments:
Post a Comment