The following three equations illustrate this decomposition. Estimate separate linear wage
regressions for individuals
i in groups
A and
B: : \begin{align} (1) \qquad \ln(\text{wages}_{A_i}) & = X_{A_i} \beta_A + \mu_{A_i} \\ (2) \qquad \ln(\text{wages}_{B_i}) & = X_{B_i} \beta_B + \mu_{B_i} \end{align} where
Χ is a vector of explanatory variables such as education, experience, industry, and occupation,
βA and
βB are vectors of coefficients and
μ is an
error term. Suppose we have the regression estimates \hat{\beta}_A, \hat{\beta}_B. Then, since the average value of
residuals in a linear regression is zero, we have the KOB decomposition as: : \begin{align} (3) \qquad & E(\ln(\text{wages}_A)) - E(\ln(\text{wages}_B)) \\[4pt] = {} & \hat{\beta}_A E(X_A) - \hat{\beta}_B E(X_B) \\[4pt] = {} & \hat{\beta}_A (E(X_A) - E(X_B)) + E(X_B) (\hat{\beta}_A - \hat{\beta}_B). \end{align} In the last line of (3), the first part is the impact of between-group differences in the explanatory variables
X, evaluated using the coefficients for group
A. The second part is the differential not explained by these differences in observed characteristics
X. ==Interpretation==