6. 3 A Nonpararnetric Alternative The Wilcoxon Rank Sum Test The two-sample t test of the previous section was based on several conditions: independent samples, normality, and equal variances. When the conditions of normality and equal variances are not valia but the sample sizes are large, the ~.. . ~ . Wilcoxon rank sum test – ; . . ::::[h, 1 FIGURE

6. 7 Skewed population distributions identical in shape but shifted 0. 08 0. 06 0. 04 0. 02 0. 0 0 10 20 30 ” y, value ofrandom vasiable results using a r (or 1′) test are approximately correct.

There IS, however, an alternative test procedure that requires less stringent conditions. This procedure, — the Wilcoxon rank sum test, IS discussed here. called The assumptions for this test are that we have independent random samples8 taken from two populations whose distributions are identical except that one distribution may be shifted to the right of the other distribution, as shown in1 Figure

6. 7. T h e Wilcoxon rank sum test does not require that populations have( normal distributions.

Thus, we have removed one of the three conditions that/ were required of the t-based procedures. The other conditions, equal variancesi and independence of the random samples, are still required for the Wilcoxon rank sum test. Because the two population distributions are assumed to be identical. under the null hypothesis, independent random samples from the two populations1 should be similar if the null hypothesis is true. Because we are now allowing thei population distributions to be nonnormal, the rank sum procedure must deal with1 the possibility of extreme observations in the data.

One way to handle samples containing extreme values is t o replace each data value with its rank (from lowest to highest) in the combined sample-that is, the sample consisting of the data’ from both populations. T h e smallest value in the combined sample is assigned the rank of 1 and the largest value is assigned the rank of N = n, + ni.

The ranks are not affected by how far the smallest (largest) data value is from next smalles4 I (largest) data value. Thus, extreme values in data sets do not have a strong e f f e a I i on the rank sum statistic as they did in the 1-based procedures.

~~ i I 1 T h e calculation of the rank sum statistic consists of the following steps: 1. List the data values for both samples from smallest to largest. 2 In the next column, assign the numbers 1 to N to the data values . 1 to the smallest value and N to the largest vaiue. These are the ran of the observations. 3. If there are ties-that is, duplicated values-in the combined data set the ranks for the observations in a tie are taken to be the average of the ranks for those observations. 4. Let T denote the sum of the ranks for the observations from population 1. i I I ranks

If the null hypothesis of identical population distributions is true, the n , ranks from population 1 are just a random sample from the iV integers 1, . . . , N. Thus, under the null hypothesis, the distribution of the sum of the ranks Tdepends only on the sample sizes, n , and n ~ and does not depend on the shape of the , population distributions. Under the null hypothesis, the sampling distribution of T has mean and variance given by Intuitively, if T is much smaller (or larger) than py. we have evidence that the null hypothesis is false and in fact the population distributions are not equal.

The rejection region for the rank sum test specifies the size of the difference between T and pr for the null hypothesis to be rejected. Because the distribution of T under the null hypothesis does not depend on the shape of the population distributions, Table 5 provides the critical values for the test regardless of the shape of the population distribution. The Wilcoxon rank sum test is summarized here. L W)koxon Rank Sum Test* Ho: The two populations are identical. H. : 1. Population 1 is shifted to the right of population 2. 2. Population 1is shifted to the left of population

2.3. Populations I and 2 are shifted from each other. 3 (n, 5 10, n2 c 10) T. S. : T, the sum of the ranks in sample 1 R. R. : For a = . 05, use Table 5 in the Appendix to find critical values for Tu and TL; 1. Reject H i if T > T,. 2. Reject Ho if T < TL. 3. Reject HOif T > Tu or T < TL. Wilcoxon Rank Sum Test: n, > 10 and n, > 10 T-,LT >\ T. S. : z = . where Xdenotes the sum of the ranks in sample 1: ’71 R. R. : For a specified value of a : , 1. Reject HOif 7 2 z . 2. Reject Ho if 7 c -2.. , EXAMPLE 6. 4 3. Reject HOif z Placebo O. 90 2 z. ,~. b 0. 3 1. 45 1. 63 1. 76 0. 83

11 . 1 0. 95 1. 11 0. 78 3. 0: 0. 86 0. 98 0. 61 1. 77 0. 38 1. 46 2. 36 i Many states are considering lowering the blood-alcohol level at which a driver is a legislative committee designed the following test to study the effect of alcohol 1 designated as driving under the influence (DUI) of alcohol. A n investigator for 1 / on reaction time. Ten participants consumed a specified amount of alcohol. An1 I i other group of ten participants consumed the same amount of a nonalcoholic drink, a placebo. The two groups did not know whether they were receiving 1 alcoholor the placebo.

The twenty participants’ average reaction times (inseconds) / to a series of simulated driving situations are reported in the following table. Does / it appear that alcohol consumption increases reaction time? a. Why is the t test inappropriate for analyzing the data in this study’? b. Use the Wilcoxon rank sum test to test the hypotheses: Ho: The distributions of reaction times for [he placebo and alto- hol populations are identical. .f o Ha: The distribution of reaction times for the placebo consumption population is shifted to the left of the distribution for the alcohol population.

(Larger reaction times are associated with the consumption of alcohol. ) Solution 2 0 Placebo population Alcohol a. A boxplot of the two samples is given here. The plots indicate that the population distributions are skewed to the right, because 10% of the data values are large outliers and the upper whiskers are longer than the lower whiskers. The sample sizes are both small, and hence the t test may be inappropriate for analyzing this study. b. The Wilcoxon rank sum test will be conducted to evaluate whether alcohol consumption increases reaction time. Table 6.

6 contains the ordered data for the combined samples, along with their associated ranks. We will designate observations from the placebo Eroup as 1 an from the alcohol group as 2. For a = . 05, reject Ho if T < 83, using Table 5 in the Appendix with a = . 05, one-tailed. and n, = n2 = 10. The value of T is compute by summing the ranks from group 1: T = l + 2 + 3 + 4 + 5 + 6 + , 7 + 8 + 16 + 18 = 70. Because 70 is less than 83, we reject Ho and conclude there is significant evidence that the placebo population has: . . . . . smaller reaction times than the population of alcohol consumers. .