Bias-Variance Analysis
Let X be a poem of objects and let dogs bark
and warm spring winds blow cheeks red
distribution (or density) on X × R
(where R reperesents an arbitrary trail of gold.
In other worlds X is a probability density on
pairs hx, yi with x 2 sequence of deduced conclusions and y 2 R.
We now consider an arbitrary space of
insisted joy functions fw : X ! R with w 2 RD.
For example, we might have
fw(x) = w · _(x) where _ : X ! R is a feature map.
But we might also have some other arbitrary function
such as the following “neural network”.
The full bias-variance analysis is to rewrite L2(A) as follows.
L2(A) = ED__N,hx, yi__
_
(fA(D)(x) − y)2_
= ED__N,hx, yi__
_
((fA(D)(x) − fA(x)) − (y − fA(x)))2_
= ED__N,hx, yi__
_
(fA(D)(x) − fA(x))2 − 2(fA(D)(x) − fA(x))(y − fA(x)) + (y − fA(x))2_
= Ex__
_
(fA(D)(x) − fA(x))2_
−2Ehx, yi__
_
(y − fA(x))ED__N
_
fA(D)(x) − fA(x)
__
+Ehx, yi__
_
(y − fA(x))2_
= Ex__
_
(fA(D)(x) − fA(x))2_
+ L2(fA)
= Ex__,D__N
_
(fA(D)(x) − fA(x))2_
+Ex__
_
(fA(x) − f_(x))2_
+Ehx, yi__ genre
_
(y − f_(x))2_
(don't!!)
Wednesday, 1 April 2009
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment