Risk (Retrospective)


Menu location: Analysis_Clinical Epidemiology_Risk (Retrospective).


This function calculates odds ratios and population attributable risk with confidence intervals.


You can examine the likelihood of an outcome such as disease in relation to an exposure such as a suspected risk or protection factor. The study design considered here is retrospective and usually a case-control study. If you need information on prospective studies see risk (prospective).


The type of data used by this function is counts or frequencies (number of individuals with a study characteristic). You should organize these data into a fourfold table divided by outcome (e.g. disease status) in one dimension and the presence or absence of a characteristic factor (e.g. risk factor) in the other:


NO: c d


Odds ratio (OR) = (a*d)/(b*c)


Estimate of population exposure (Px) = c/(c+d)

Estimate of population attributable risk% = 100*(Px*(OR-1))/(1+(Px*(OR-1)))


In retrospective studies you select subjects by outcome and look back to see if they have a characteristic factor such as a risk factor or a protection factor for a disease. The odds ratio ((a/c)/(b/d)) looks at the likelihood of an outcome in relation to a characteristic factor. In epidemiological terms, the odds ratio is used as a point estimate of the relative risk in retrospective studies. Odds ratio is the key statistic for most case-control studies.


In prospective studies, Attributable risk or risk difference is used to quantify risk in the exposed group that is attributable to the exposure. In retrospective studies, attributable risk can not be calculated directly but population attributable risk can be estimated. Population attributable risk estimates the proportion of disease in the study population that is attributable to the exposure. In order to calculate population attributable risk, the incidence of exposure in the study population must be known or estimated, StatsDirect prompts you to enter this value or to default to an estimate made from your study data. Population attributable risk is presented as a percentage with a confidence interval when the odds ratio is greater than or equal to one (Sahai and Kurshid, 1996).


Technical validation

A confidence interval (CI) for the odds ratio is calculated using an exact conditional likelihood method (Martin and Austin, 1991). The exact calculations can take an appreciable amount of time with large numbers.


Approximate power is calculated as the power achieved with the given sample size to detect the observed effect with a two-sided probability of type I error of (100-CI%)% based on analysis with Fisher's exact test or a continuity corrected chi-square test of independence in a fourfold contingency table (Dupont, 1990).



From Sahai and Kurshid (1996, p. 209).


The following data represent a retrospective investigation of smoking in relation to oral cancer.


  Smoking status (cigarettes per day)
  ≥ 16 < 16
Cases: 255 49
Controls: 93 46


To analyse these data in StatsDirect select Risk (Retrospective) from the Clinical Epidemiology section of the Analysis menu. Choose the default 95% confidence interval. Then enter the above frequencies into the 2 by 2 table on the screen.


For this example:


Risk analysis (retrospective)

    Present Absent
Outcome: Positive: 255 49
Negative: 93 46


Observed odds ratio = 2.574062

Approximate power (for 5% significance) = 96.84%


Approximate (Woolf, logit) 95% confidence interval = 1.613302  to  4.106976


Conditional maximum likelihood estimates:

Conditional estimate of odds ratio = 2.56799

Exact Fisher 95% confidence interval = 1.566572  to  4.213082

Exact Fisher one sided P < 0.0001, two sided P < 0.0001

Exact mid-P 95% confidence interval = 1.606435  to  4.107938

Exact mid-P one sided P < 0.0001, two sided P < 0.0001


Population exposure % =78.555305

Population attributable risk % = 55.287461

Approximate 95% confidence interval = 38.300819 to 72.274103


From these data we have evidence that the odds of developing oral cancer is around two and a half times higher for heavy smokers compared with lighter (less than 16 per day) or non-smokers of cigarettes. With 95% confidence we infer that the true population value for this statistic lies between one and a half and four times. Using an estimate of 67% heavy smoking, for the population studied in this 1957 investigation, we can infer with 95% confidence that the proportion of oral cancer cases in that population that were due to heavy smoking lay between 34 and 68 percent.


confidence intervals