The Wilcoxon Two-sample Statistic on Strongly Mixing processes

Kenneth H. Davis; R. E. Folsom; F. Ivy Carroll; Phillip C. Cooley; Lyle M. Retzlaff; Michael F. Weeks; H. Koo; Herbert H. Seltzman; E. Pellizzari; Deborah W. McFadden; Roger D. Austin; Martin B. Lee; Judith T. Lynch; Debra M. Fleischmann; Carol Place; Stephen D. Cooper; Rick L. Williams; Lillie B. Barber; Doris J. Rouse; Ivey A. McDaniel; Charlotte O. Scheper; Paul Kizakevich; Donna M. Jewell; Marion E. Deerhake; Cora B. Parker; Eugene P. Brantly; Phyllis D. Elkins; Patricia A. Cunningham; Robert S. Truesdale; Sara C. Wheeless; Robert M. Bray; Cynthia A. Salmons; Gary B. Howe; Coleen M. Northeim; Susan K. Myers; Gordon M. Cressman; Josephine A. Mauskopf; Tim J. Gabel; Luis Arturo Crouch; Celia D. Keller; Lisa E. Packer; Pamela B. Lamb; Keith White Little; Nathaniel F. Rodman; M. Owen; James P. Hayes; Daniel L. Winfield; Deborah A. Gibbs; Janice E. Kelly-Reid; Wayne G. Winstead; Cindy O. McClintock; Terrence K. Pierson; Johnny R. Albritton; Sherry L. Black; Jeffrey B. Coburn; Joe B. Simpson; Ed E. Rickman; James T. Hanley; R. Suresh; Barbara L. Kroner; K. Heller; James C. Blake; Barri B. Burrus; Anthony C. Clayton; Donna S. Womack; S. Mascarella; Scott A. Guthrie; Sheryl C. Cates; Albert D. Bethke; Suson F. VonLehmden; Kathryn L. Dowd; Daniel J. Pratt; Lisa J. McQuay; Matthew A. Koch; Jonathan T. Ennis; Robert A. Zerbonia; Nick L. Kinsey; Susan H. Kinsey; Christopher P. Carson; J. Newsome; S. Keesling; James A. O'Rourke; Kathryn R. Batts; Craig R. Hollingsworth; Priya Suresh; Joseph Wendell Wilson; Vorapranee Wickelgren; E. Andrew Jessup; Diana S. Goss; Sheri E. Fehnel; Gayle S. Bieler; Donna P. Coleman; R. Crawford; Jeanne Ann Snodgrass; Nellie I. Hansen; Michelle Lang; Deirdre M. Mladsi; Gary A. Zarkin; Rachel A. Caspar; Carol L. Woodell; RJ Serfling

The Wilcoxon Two-sample Statistic on Strongly Mixing processes

Serfling, RJ. (1968). The Wilcoxon Two-sample Statistic on Strongly Mixing processes. Annals of Mathematical Statistics, 39(4), 1202-1209.

Copy citation

Abstract

On the basis of independent samples $\{X_1, \cdots, X_m\}$ and $\{Y_1, \cdots, Y_n\}$ with distributions $F$ and $G$, respectively, the hypothesis that $F \equiv G$ may be tested. Given the functional forms $F(x_1, \cdots, x_m)$ and $G(y_1, \cdots, y_n)$ of the sampling distributions except for values of certain parameters, the likelihood ratio approach, for example, can be used. In this case it is not crucial to assume that the samples are random, i.e., that $F(x_1, \cdots, x_m) = F(x_1) \cdots F(x_m)$ and $G(y_1, \cdots, y_n) = G(y_1) \cdots G(y_n)$, although such a simplification is useful whenever realistic. However, the nonparametric treatment of the problem has relied heavily on the assumption of random samples. Yet if the samples arise as realizations of two stochastic processes, the assumption of randomness is not realistic except in the case of renewal processes. Thus it is desirable to extend the scope of established nonparametric procedures to more general applications. The present paper deals with the Wilcoxon two-sample statistic. Among the desirable features of this statistic, when defined on independent random samples, is its asymptotically normal distribution, which for large samples facilitates a test of the hypothesis that $F \equiv G$ and a calculation of the power for any alternative $(F, G)$. It shall be seen that these aspects are true also when the samples arise from stochastic processes belonging to a wide class, including strictly stationary strongly mixing processes. Assume that the samples $\{X_1, \cdots, X_m\}$ and $\{Y_1, \cdots, Y_n\}$ are independent of each other, but let the random variables within a sample be possibly dependent. Assume that the functions $F(\cdot)$ and $G(\cdot)$ are continuous. The hypothesis $H: F \equiv G$ may be tested (conservatively) by testing the hypothesis $H_0: \gamma = 0$, where $\gamma = 2P\{Y > X\} - 1$. A representation of the Wilcoxon two-sample statistic is the $U$-statistic with sign function as kernel, \begin{equation*}\tag{1.1}U = (mn)^{-1}\sum^m_{i=1} \sum^n_{j=1} s(Y_j - X_i),\end{equation*} where $s(u) = -1, 0, 1$ according as $u < 0, = 0, > 0$. Since $Es(Y - X) = \gamma$, the statistic $U$ affords a natural basis for testing $H_0$. Under appropriate conditions, the statistic $Z = m^{\frac{1}{2}}(U - \gamma)$ has a limiting normal distribution with mean 0 and variance \begin{equation*}\tag{1.2}A^2 = 4 \lim_{k\rightarrow\infty} k^{-1} \operatorname{Var}\lbrack\sum^k_{i=1} G(X_i)\rbrack + 4c \lim_{k\rightarrow\infty}k^{-1} \operatorname{Var}\lbrack\sum^k_{i=1} F(Y_i)\rbrack,\end{equation*} as $m$ and $n \rightarrow \infty$ such that $m/n$ has a limit $c \neq 0$. The main conclusions of this nature are given in Theorems 3.1 and 3.2. Some areas of application are indicated in Section 4. The business of dealing with the quantity $A^2$ is discussed in Section 5. The limiting behavior of $Z$ is obtained by consideration of a statistic asymptotically equivalent in distribution but more amenable to the direct application of central limit theory, an approach put forth by Hoeffding [3] in dealing with a wide class of $U$-statistics as defined on a single sample of mutually independent rv's. The present contribution adapts the method to a single, but important, (two-sample) $U$-statistic with dependence allowed within samples. Define: \begin{equation*}\tag{1.3}W = m^{-\frac{1}{2}} \sum^m_{i=1} \lbrack f_{10}(X_i) - \gamma\rbrack + m^{\frac{1}{2}}n^{-1} \sum^n_{j=1} \lbrack f_{01} (Y_j) - \gamma\rbrack,\end{equation*} where $f_{10}(t) = Es(Y - t) = 1 - 2G(t)$ and $f_{01}(t) = Es(t - X) = 2F(t) - 1$. Since $Ef_{10}(X) = Ef_{01}(Y) = \gamma$, we have $EW = E(Z - W) = 0$. In Section 2 we find conditions such that $E(Z - W)^2 \rightarrow 0$, in which case it follows by Chebyshev's inequality that $(Z - W) \rightarrow 0$ in probability and hence that the statistics $Z$ and $W$ have the same limiting distribution (if any). The application of central limit theory to $W$ is through the sums $\sum^m_1 f_{10}(X_i)$ and $\sum^n_1 f_{01}(Y_j)$, or equivalently through $m^{-\frac{1}{2}} \sum^m_1 G(X_i)$ and $n^{-\frac{1}{2}}\sum^n_1 F(Y_j)$. If each of these independent normed sums has a limiting normal distribution, then $W$ is asymptotically normal, as $m$ and $n \rightarrow \infty$ such that $m/n \rightarrow c \neq 0$. Relevant central limit theorems for sums of dependent variables are utilized in Section 3

Meet the Experts

Navigate to Barbara L. Kroner

James Blake

Recent Publications

Article

Communicating therapeutic indication information in direct-to-consumer television ads for prescription cancer drugs

March 01, 2025

Article

Patient experience with acute hepatic porphyria before and after long-term givosiran treatment in a qualitative interview study

March 01, 2025

Article

Satisfaction with and adherence to off-label corticosteroids in adolescents and adults with eosinophilic esophagitis

February 01, 2025

Article

Situating public management's contributions to social equity

February 01, 2025

View All Publications

The Wilcoxon Two-sample Statistic on Strongly Mixing processes

Abstract

Meet the Experts

Barbara L. Kroner

Barri Burrus

Dan Pratt

Donna Womack

Gary A. Zarkin

Gayle Bieler

J. Todd Ennis

James Blake

Recent Publications

Communicating therapeutic indication information in direct-to-consumer television ads for prescription cancer drugs

Patient experience with acute hepatic porphyria before and after long-term givosiran treatment in a qualitative interview study

Satisfaction with and adherence to off-label corticosteroids in adolescents and adults with eosinophilic esophagitis

Situating public management's contributions to social equity