00:57 min

13.17: Behrens–Fisher Test

00:55 min

1.6: Interval Level of Measurement

01:13 min

1.9: Data Collection by Experiments

01:07 min

1.10: Data Collection by Survey

01:07 min

2.7: Ogive Graph

01:14 min

2.9: Relative Frequency Histogram

01:04 min

2.15: Pie Chart

01:09 min

3.4: Harmonic Mean

01:07 min

3.11: Midrange

01:08 min

1.8: Data Collection by Observations

01:13 min

2.1: Review and Preview

00:55 min

2.4: Relative Frequency Distribution

13.15: Wald-Wolfowitz Runs Test I

作者：

简介：

The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.

The test works by analyzing "runs" in the data—continuous sequences of similar elements. A "run" is defined as a series of consecutive identical symbols (e.g., a run of positive values or a run of negative values). The Wald-Wolfowitz test compares the observed number of runs to the number of runs expected under randomness. Consider the following example for the sequence or run:

Dataset-1:

0, 0, 1, 1, 1, 0, 0, 0, 1, 0, 1, 0, 0, 0, 1, 1, 0, 0, 1, 1

In this dataset, the [0, 0]; [1, 1, 1]; [0, 0, 0]; [1]; [0]; [1]; [0, 0, 0]; [1, 1]; [0, 0]; [1, 1] are the recognizable sequences or runs, for a total of 10 runs. As 0 and 1 are different in nature (i.e., they provide different information, e.g., absence and presence), 0 and 1 together cannot form a run. This means that [0, 1]; [0, 1] cannot be considered as a run.

The basic principle of the WWR test is "Reject the randomness of the data when the number of runs is extremely low or extremely high". The test provides a quantitative measure of randomness at a certain level of significance, for instance, 0.05. The WWR test alone, however, does not offer any clear indication of how random a given dataset is. The magnitude of randomness is still qualitative and needs to be interpreted based on the nature of the data (i.e., binary, categorical, or numerical).

The Wald-Wolfowitz runs test examines the randomness in ordered or sequential data. It uses computed runs from the data, where the randomness is rejected when the value of runs is too low or too high.

A run is the data sequence following another similar sequence in the same data that is mutually exclusive from the other.

The runs can be computed for binary, categorical, or numerical data.

For example, the sequence of winning or losing a tennis match is binary data. Notice that the values of runs for dataset-1 and dataset-2 are extreme, making them less random than dataset-3.

A DNA sequence is a typical example of categorical data. Here, the value of runs for sequence-1 and sequence-2 is extreme, making them less random than sequence-3.

Computing runs for numerical data, such as the order of leaf size cut by a leafcutter bee, requires its mean or median. Assign a + sign for every value higher than the mean or median and a - sign for every value lower to get a sequence of binary signs to calculate the runs.

标签: Wald-Wolfowitz Test, Runs Test, Nonparametric Statistical Test, Randomness Assessment, Ordered Data, Runs Analysis, Dataset Example, Quantitative Measure, Significance Level, Data Interpretation,

13.15: Wald-Wolfowitz Runs Test I

登入你的帐号

注册一个新帐号