Mathematical Review

This is the basic mathematical definitions and calculation methods for Econometircs.

Best Predictor

Given a random vector XX, we want to forecast YY, Let g(X)g(X) be a predictor of YY. For Prediction Error, it is defined as Yg(X)Y - g(X), and this prediction error can be treated as a random variable, and it can take positive and negative values. To minimize this prediction error, we define the the mean squared error (MSE) of predictor g(X)g(X) as E[(Yg(X))2]\mathbb{E}[(Y-g(X))^2]. We can have that the CEF m(x)=E(YX=x)m(x) = \mathbb{E}(Y|X=x) is the best predictor, which has the smallest mean squared prediction error. Which means if we have E(Y2)<\mathbb{E}(Y^2) < \infin, then for any predictor g(X)g(X), we have:

E[(Yg(X))2]E[(Ym(X))2]\mathbb{E}[(Y-g(X))^2] \geq \mathbb{E}[(Y-m(X))^2]

Proof:

E[u2]=E[(Yg(X))2]=E[(Ym(X)+m(X)g(X))2]\mathbb{E}[u^2] = \mathbb{E}[(Y-g(X))^2]=\mathbb{E}[(Y-m(X) + m(X) -g(X))^2]

=E[(Ym(X))2]+E[(m(X)g(X))2]+2E[(Ym(X))(m(X)g(X))]=\mathbb{E}[(Y-m(X))^2] + \mathbb{E}[(m(X)-g(X))^2]+2\mathbb{E}[(Y-m(X))(m(X) -g(X))]

E[(Ym(X))2]\geq \mathbb{E}[(Y-m(X))^2]

since: E[(Ym(X))(m(X)g(X))]=E[E[(Ym(X))(m(X)g(X))X]]\mathbb{E}[(Y-m(X))(m(X) -g(X))] = \mathbb{E}[\mathbb{E}[(Y-m(X))(m(X) -g(X))|X]]

as under condition XX, m(X)g(X)m(X) -g(X) is no longer a random variable, by the definition of m(X)m(X)

E[E[(Ym(X))(m(X)g(X))X]]=E[(m(X)g(X))E[(Ym(X))X]]=0\mathbb{E}[\mathbb{E}[(Y-m(X))(m(X) -g(X))|X]] = \mathbb{E}[(m(X) -g(X))\mathbb{E}[(Y-m(X))|X]] = 0

So above inequality becomes equality when m(X)=g(X)m(X) = g(X), therefore, m(X)m(X) is the smallest.

Last updated