Estimating the p-value in Permutation Tests

Permutation Tests vs Traditional Parametric Tests

P-values and its Interpretation in Permutation Tests

Permutation tests offer a powerful approach for hypothesis testing without relying on stringent assumptions about data distributions. Here’s a systematic method to evaluate the p-value using permutation tests:

Step 1: Choose a Test Statistic

Begin by selecting a suitable test statistic that captures the essence of the hypothesis being tested. Common test statistics include the difference in means, correlation coefficient, or any other relevant measure depending on the nature of the data and the research question.

Step 2: Shuffle the Data (Force the Null Hypothesis to be True)

Under the null hypothesis , assume that there is no meaningful difference or association between groups or variables. To simulate this scenario, shuffle or permute the observed data while maintaining the group labels or the relationship between variables intact. This shuffling process ensures that any observed differences or associations are due to random chance alone.

Step 3: Create a Null Sampling Distribution of the Test Statistic (under )

Generate multiple permutations of the shuffled data and compute the test statistic for each permutation. This results in a null sampling distribution of the test statistic, representing the distribution of values expected under the null hypothesis. The number of permutations should be large enough to adequately approximate the null distribution.

Step 4: Find the Observed Test Statistic on the Null Sampling Distribution and Compute the p-value

Compare the observed test statistic, calculated from the original unshuffled data, with the null sampling distribution. Determine the proportion of permutations where the test statistic is as extreme as or more extreme than the observed value. This proportion represents the p-value.

For a one-sided test, the p-value corresponds to the proportion of permutations where the test statistic is greater than or equal to the observed value (if testing for significance in one direction). For a two-sided test, it includes both extremes (greater and smaller values) depending on the directionality of the hypothesis.

Permutation tests in Machine Learning

Permutation tests become quite useful in these situations, offering a reliable and adaptable substitute for hypothesis testing. The basic idea behind permutation testing is to generate a null distribution by randomly permuting the observed data. This allows for inference to be made without making strict assumptions about the distribution of the data. This article aims to demystify permutation tests in machine learning.

Table of Content

What are Permutation Tests?
Permutation Tests vs Traditional Parametric Tests
Estimating the p-value in Permutation Tests
P-values and its Interpretation in Permutation Tests
Permutation Test in Python
Benefits and Limitations of Permutation Test
Applications of Permutation Tests in Machine Learning