Fisher's Exact Test

 

Menu location: Analysis_Exact_Fisher.

 

Like the chi-square test for fourfold (2 by 2) tables, Fisher's exact test examines the relationship between the two dimensions of the table (classification into rows vs. classification into columns). The null hypothesis is that these two classifications are not different.

 

The P values in this test are computed by considering all possible tables that could give the row and column totals observed. A mathematical short cut relates these permutations to factorials; a form shown in many textbooks. StatsDirect uses the hypergeometric distribution for the calculation (Conover 1999). The test statistic that is hypergeometrically distributed is the expected value of the first count A.

 

This exact treatment of the fourfold table should be used instead of the chi-square test when any expected frequency is less than 1 or 20% of expected frequencies are less than or equal to 5. With StatsDirect, it is reasonable to use Fisher's exact test by default because the computational method used can cope with large numbers.

 

StatsDirect uses the definition of a two sided P value described by Bailey (1977) (P values for all possible tables with P less than or equal to that for the observed table are summed). Some authors prefer simply to double the one sided P value (Armitage and Berry, 1994; Bland, 2000).

 

Consider using mid-P values and intervals when you have several similar studies within an overall investigation (Armitage and Berry, 1994; Barnard, 1989). Mid-P results are not shown for very large tables; if you want to calculate mid-P for large numbers then please use the odds ratio confidence interval function.

 

Assumptions:

·   each observation is classified into exactly one cell

·   the row and column totals are fixed, not random

 

The assumption of fixed marginal (row/column) totals is controversial and causes disagreements such as the best approach to two sided inference from this test.

 

DATA INPUT:

Observed frequencies should be entered as a standard fourfold table:

 

  feature present feature absent
outcome positive: a b
outcome negative: c d

 

Example

From Armitage and Berry (1994, p. 138).

 

The following data compare malocclusion of teeth with method of feeding infants.

 

  Normal teeth Malocclusion
Breast fed: 4 16
Bottle fed: 1 21

 

To analyse these data in StatsDirect you must select Fisher's exact test from the exact tests section of the analysis menu. Enter the frequencies into the contingency table on screen as shown above.

 

For this example:

 

Rearranged table:

4 1 5
     
16 21 37
     
20 22 42

 

Expectation of A = 2.380952

 

One sided (upper tail) P = 0.1435 (doubled one sided P = 0.2871)

Two sided (by summation) P = 0.1745

 

One sided mid-P = 0.0809

Two sided mid-P = 0.1618

 

Here we cannot reject the null hypothesis that there is no association between these two classifications, i.e. between feeding method and malocclusion.

 

P values

confidence intervals