Under the classical assumptions, ordinary least squares is the
best linear unbiased estimator (BLUE), i.e., it is unbiased and efficient. It remains unbiased under heteroskedasticity, but efficiency is lost. Before deciding upon an estimation method, one may conduct the Breusch–Pagan test to examine the presence of heteroskedasticity. The Breusch–Pagan test is based on models of the type \sigma_i^2 = h(z_i'\gamma) for the variances of the observations where z_i = (1, z_{2i}, \ldots, z_{pi}) explain the difference in the variances. The null hypothesis is equivalent to the (p - 1)\, parameter restrictions: : \gamma_2 = \cdots = \gamma_p = 0. The following
Lagrange multiplier (LM) yields the
test statistic for the Breusch–Pagan test: : \text{LM}=\left (\frac{\partial\ell}{\partial\theta} \right )^{\mathsf{T}} \left (-E\left [\frac{\partial^2\ell}{\partial\theta\, \partial\theta'} \right ] \right )^{-1} \left(\frac{\partial\ell}{\partial\theta} \right ). This test can be implemented via the following three-step procedure: •
Step 1: Apply OLS in the model :: y_i = X_i\beta+\varepsilon_i, \quad i=1,\dots{},n •
Step 2: Compute the regression residuals, \hat{\varepsilon}_i, square them, and divide by the Maximum Likelihood estimate of the error variance from the Step 1 regression, to obtain what Breusch and Pagan call g_i: :: g_i = \hat{\varepsilon}_i^2 / \hat{\sigma}^2, \quad \hat{\sigma}^2 = \sum{\hat{\varepsilon}_i^2}/n •
Step 2: Estimate the auxiliary regression :: g_i=\gamma_1+\gamma_2z_{2i}+\cdots+\gamma_pz_{pi}+\eta_i. where the
z terms will typically but not necessarily be the same as the original covariates
x. •
Step 3: The LM test statistic is then half of the explained sum of squares from the auxiliary regression in Step 2: :: \text{LM}=\frac{1}{2}\left(\text{TSS} - \text{RSS}\right). where TSS is the sum of squared deviations of the g_i from their mean of 1, and RSS is the sum of squared residuals from the auxiliary regression. The test statistic is
asymptotically distributed as \chi^2_{p - 1} under the
null hypothesis of homoskedasticity and normally distributed \varepsilon_i, as proved by Breusch and Pagan in their 1979 paper. == Robust variant ==