Duncan's new multiple range test

Assumptions: 1.A sample of observed means m_{1},m_{2},...,m_{n} , which have been drawn independently from n normal populations with "true" means, \mu_{1},\mu_{2},...,\mu_{n} respectively. 2.A common standard error \sigma . This standard error is unknown, but there is available the usual estimate s_{m} , which is independent of the observed means and is based on a number of degrees of freedom, denoted by n_{2} . (More precisely, S_{m}, has the property that \frac{n_{2}\cdot S_{m}^2}{\sigma^2_{m}} is distributed as \chi^2 with n_2 degrees of freedom, independently of sample means). The exact definition of the test is: The difference between any two means in a set of n means is significant provided the range of each and every subset which contains the given means is significant according to an \alpha_{p} level range test where \alpha_p=1-\gamma_p , \gamma_p =(1-\alpha)^{(p-1)} and p is the number of means in the subset concerned. Exception: The sole exception to this rule is that no difference between two means can be declared significant if the two means concerned are both contained in a subset of the means which has a non-significant range. Procedure The procedure consists of a series of pairwise comparisons between means. Each comparison is performed at a significance level \alpha_{p} , defined by the number of means separating the two means compared (\alpha_p for p-2 separating means). The test are performed sequentially, where the result of a test determines which test is performed next. The tests are performed in the following order: the largest minus the smallest, the largest minus the second smallest, up to the largest minus the second largest; then the second largest minus the smallest, the second largest minus the second smallest, and so on, finishing with the second smallest minus the smallest. With only one exception, given below, each difference is significant if it exceeds the corresponding shortest significant range; otherwise it is not significant. Where the shortest significant range is the significant studentized range, multiplied by the standard error. The shortest significant range will be designated as R_{(p,\alpha)} , where p is the number means in the subset. The sole exception to this rule is that no difference between two means can be declared significant if the two means concerned are both contained in a subset of the means which has a non-significant range. An algorithm for performing the test is as follows: 1.Rank the sample means, largest to smallest. 2. For each m_{i} sample mean, largest to smallest, do the following: 2.1 for each sample mean, (denoted m_{j}), for smallest up to m_{(i-1)} . 2.1.1 compare m_i -m_j to critical value \sigma_m \cdot R_{(p,\alpha)} , P=i-j, \alpha=\alpha_p 2.1.2 if m_i-m_j does not exceed the critical value, the subset (m_j , m_{j+1},...,m_{I}) is declared not significantly different: 2.1.2.1 Go to next iteration of loop 2. 2.1.3 Otherwise, keep going with loop 2.1 Critical values Duncan's multiple range test makes use of the studentized range distribution in order to determine critical values for comparisons between means. Note that different comparisons between means may differ by their significance levels- since the significance level is subject to the size of the subset of means in question. Let us denote Q_{(p,\nu,\gamma_{(p,\alpha)})} as the \gamma_{\alpha} quantile of the studentized range distribution, with p observations, and \nu degrees of freedom for the second sample (see studentized range for more information). Let us denote r_{(p,\nu,\alpha)} as the standardized critical value, given by the rule: If p=2 r_{(p,\nu,\alpha)}= Q_{(p,\nu,\gamma_{(p,\alpha)})} Else r_{(p,\nu,\alpha)}= max( Q_{(p,\nu,\gamma_{(p,\alpha)})}, r_{(p-1,\nu,\alpha)} ) The shortest critical range, (the actual critical value of the test) is computed as : R_{(p,\nu,\alpha)}=\sigma_{m} \cdot r_{(p,\nu,\alpha)}. For \nu->∞, a tabulation exists for an exact value of Q (see link). A word of caution is needed here: notations for Q and R are not the same throughout literature, where Q is sometimes denoted as the shortest significant interval, and R as the significant quantile for studentized range distribution (Duncan's 1955 paper uses both notations in different parts). == Numeric example ==