Brief solution to Problem on matching,
Epidemiology Course VHM 811 at AVC - Fall Semester 2008

  1. As a start, we compute risks for the four population groups in Table 10-1 (table numbers refer to Rothman & Greenland, 1998):

    Table 10-1MalesFemales
    ExposedUnexposedExposedUnexposed
    No. cases in 1 year45005010090
    Total population900,000100,000 100,000900,000
    Risk (%)0.50.050.10.01

    The table shows confirms that: within sex, exposed have 10 times the risk of unexposed, and within exposure levels, males have five times the risk of females. The crude risk for exposed subjects is (4500+100)/1000000=0.46%, and the crude risk for unexposed subjects is (50+90)/1000000=0.014%, giving a risk ratio of 32.9.

  2. The proportion of males among exposed subjects is 9/10=0.9, and among 100,000 exposed subjects we therefore expect 90,000 males and 10,000 females. When matching the sex distribution for the unexposed subjects to the exposed subjects, we try to achieve the same distribution of males and females, i.e., 90,000 males and 10,000 females in the unexposed group as well. The expected numbers of cases in the four groups are given in Table 10-2 below. They were computed from the risk ratios in Table 10-1, e.g. for exposed males: 0.005*90000=450.

    Table 10-2MalesFemales
    ExposedUnexposedExposedUnexposed
    Expected cases45045101
    Total study90,00090,000 10,00010,000

    We immediately see that the risk ratio equals 10 within each sex, and the crude risk ratio is (450+10)/(90000+10000) / (45+1)/(90000+10000) = 10. Therefore, there is no confounding by sex.

  3. The total number of cases is 4740. The number of male cases is 4550, and when matching the sex distribution for the controls to the cases, we try to achieve the same distribution of males and females, i.e., 4550 males and 190 females. The expected numbers of exposed and unexposed subjects in the four groups are given in Table 10-4 below. The numbers for the cases we had already, in Table 10-1. For the controls, the distribution of exposed and unexposed subjects within sex in the population from Table 10-1 were used, e.g. for exposed males: 4550*9/10=4095.

    Table 10-4MalesFemales
    ExposedUnexposedTotalExposedUnexposedTotal
    Cases4500504550 10090190
    Controls40954554550 19171190

    Within both males and females, the (expected) odds-ratio equals 10 (e.g., (4500*455)/(50*4095)=10). However, the crude odds-ratio is (4600*626)/(140*4114)=5.0. This shows that an analysis stratified on sex will produce the correct odds-ratio but the unstratified analysis will not. Note that the bias introduced is in the opposite direction of the confounding bias in the population.

    The selection bias for the exposure is seen from the proportion of exposed subjects among the selected controls being (4095+19)/4740=0.868, whereas the population proportion is 0.5. Note that the proportion among the cases is 4600/4740=0.97, so the proportion among the controls is indeed between these two numbers.

  4. The desired sex distribution within controls is the same as within cases, i.e. a proportion of 4550/4740=0.96 males. Among 1000 cases this corresponds to 960 males and 40 females. We therefore aim for 960 males and 40 females among the controls as well. The expected numbers for these samples are given in the table.

    Sample of 1000 cases/controlsMalesFemales
    ExposedUnexposedTotalExposedUnexposedTotal
    Cases949.4510.55960 21.0518.9540
    Controls86496960 43640


    Henrik Stryhn (hstryhn@upei.ca) 2008-11-18