While L-estimators are not as efficient as other statistics, they often have reasonably high relative efficiency, and show that a large fraction of the information used in estimation can be obtained using only a few points – as few as one, two, or three. Alternatively, they show that order statistics contain a significant amount of information. For example, in terms of efficiency, given a
sample of a
normally-distributed numerical parameter, the
arithmetic mean (average) for the
population can be estimated with maximum efficiency by computing the
sample mean – adding all the members of the sample and dividing by the number of members. However, for a large
data set (over 100 points) from a symmetric population, the mean can be estimated reasonably efficiently relative to the best estimate by L-estimators. Using a single point, this is done by taking the
median of the sample, with no calculations required (other than sorting); this yields an efficiency of 64% or better (for all
n). Using two points, a simple estimate is the
midhinge (the 25%
trimmed mid-range), but a more efficient estimate is the 29% trimmed mid-range, that is, averaging the two values 29% of the way in from the smallest and the largest values: the 29th and 71st percentiles; this has an efficiency of about 81%. For three points, the
trimean (average of median and midhinge) can be used, though the average of the 20th, 50th, and 80th percentile yields 88% efficiency. Using further points yield higher efficiency, though it is notable that only 3 points are needed for very high efficiency. For estimating the standard deviation of a normal distribution, the scaled
interdecile range gives a reasonably efficient estimator, though instead taking the 7% trimmed range (the difference between the 7th and 93rd percentiles) and dividing by 3 (corresponding to 86% of the data of a normal distribution falling within 1.5 standard deviations of the mean) yields an estimate of about 65% efficiency. For small samples, L-estimators are also relatively efficient: the midsummary of the 3rd point from each end has an efficiency around 84% for samples of size about 10, and the range divided by \sqrt{n} has reasonably good efficiency for sizes up to 20, though this drops with increasing
n and the scale factor can be improved (efficiency 85% for 10 points). Other heuristic estimators for small samples include the range over
n (for
standard error), and the range squared over the median (for the chi-squared of a
Poisson distribution). ==See also==