In
item response theory, person "locations" (measures of some quality being assessed by a test) are estimated on an
interval scale; i.e., locations are estimated in relation to a unit and origin. It is common in
educational assessment to employ tests in order to assess different groups of students with the intention of establishing a common scale by equating the origins, and when appropriate also the units, of the scales obtained from response data from the different tests. The process is referred to as equating or test equating. In item response theory, two different kinds of equating are horizontal and vertical equating. Vertical equating refers to the process of equating tests administered to groups of students with different abilities, such as students in different grades (years of schooling). Horizontal equating refers the equating of tests administered to groups with similar abilities; for example, two tests administered to students in the same grade in two consecutive calendar years. Different tests are used to avoid practice effects. In terms of item response theory, equating is just a special case of the more general process of
scaling, applicable when more than one test is used. In practice, though, scaling is often implemented separately for different tests and then the scales subsequently equated. A distinction is often made between two methods of equating;
common person and
common item equating. Common person equating involves the administration of two tests to a common group of persons. The mean and standard deviation of the scale locations of the group on the two tests are equated using a linear transformation. Common item equating involves the use of a set of common items referred to as the
anchor test embedded in two different tests. The mean item location of the common items is equated. == Observed-score equating ==