Suppose that we have a population of children, with the ages of the children distributed
uniformly, in the population. The height of a child will be
stochastically related to the age: e.g. when we know that a child is of age 7, this influences the chance of the child being 1.5 meters tall. We could formalize that relationship in a
linear regression model, like this: height
i =
b0 +
b1age
i + ε
i, where
b0 is the intercept,
b1 is a parameter that age is multiplied by to obtain a prediction of height, ε
i is the error term, and
i identifies the child. This implies that height is predicted by age, with some error. An admissible model must be consistent with all the data points. Thus, a straight line (height
i =
b0 +
b1age
i) cannot be admissible for a model of the data—unless it exactly fits all the data points, i.e. all the data points lie perfectly on the line. The error term, ε
i, must be included in the equation, so that the model is consistent with all the data points. To do
statistical inference, we would first need to assume some probability distributions for the ε
i. For instance, we might assume that the ε
i distributions are
i.i.d. Gaussian, with zero mean. In this instance, the model would have 3 parameters:
b0,
b1, and the variance of the Gaussian distribution. We can formally specify the model in the form (S, \mathcal{P}) as follows. The sample space, S, of our model comprises the set of all possible pairs (age, height). Each possible value of \theta = (
b0,
b1,
σ2) determines a distribution on S; denote that distribution by F_{\theta}. If \Theta is the set of all possible values of \theta, then \mathcal{P}=\{F_{\theta} : \theta \in \Theta\}. (The parameterization is identifiable, and this is easy to check.) In this example, the model is determined by (1) specifying S and (2) making some assumptions relevant to \mathcal{P}. There are two assumptions: that height can be approximated by a linear function of age; that errors in the approximation are distributed as i.i.d. Gaussian. The assumptions are sufficient to specify \mathcal{P}—as they are required to do. ==General remarks==