Page 69 - Special Topic Session (STS) - Volume 4
P. 69

STS563 Patrick Graham et al.
            missing data absent from the list are the indicators for inclusion in the target

            population. If we could determine the over-coverage,  01 , then since the list
            total,  , is directly observed, we could immediately  obtain  the number of
                   
            people  both  in  the  target  population  and  on  the  list  as
             =   –  . On the other hand, an unknown   individuals are in the target
                   
                        01
                                                           10
              11
            population but not on the list. This group represents the “under-coverage” of
            the list with respect to the target population. If we could estimate  , then
                                                                                11
            given  an  estimate, ̂ 10   of  10  we  could  obtain  an  estimate  of  the  target
            population total   as
                              

                                                      ̂
                                    ̂
                                    = ̂ 11  + ̂ 10  =  − ̂ 01  + ̂ .
                                                       
                                                                  10
                                     

            Ideally, we would like to estimate not just the total population size   but the
                                                                               
            number of people in the target population by characteristics such as age, sex,
            ethnic group and area. Therefore, we assume a structure such as Table 1 for
            each combination of these variables. We let X denote the covariates of interest
            and X = x a particular combination of these variables.
                Allowing for dependence on the covariates, Table 2, describes a probability
            model underpinning the cross-tabulation of the target population and the list.
            The probabilities for the three occupied cells in Table 2 sum to one. Under this
            model, an individual in the target population-list union, with covariates x is
            allocated to one of the three possible cells with the probabilities given in Table
            2. Thus, at the unit level, we posit a multinomial model,with one trial. Given the
            cell probabilities from Table 2 we can define the under-coverage probability,

            Pr( not on list |in Target, X = x) as   () =  (x)/(  (x) +  (x)) and
                                                                     11
                                                             10
                                                                              10
            the over coverage probability for the list, Pr( not in Target|on list, X = x) as
              () =   (x)/(   (x) +  (x)). Since   (x) +  (x) +  (x) = 1 we
                                                         11
                                           01
                                                                           01
                                  11
                         01
                                                                  10
            need specify only two of the cell probabilities to fully specify the multinomial
            model implied by Table 2. A convenient approach is to model   () and
             (x). The remaining cell probabilities can then be obtained as   (x) = (1 –
                                                                            11
              01
             (x))(1 –   ()),  (x) = (1 –  (x))   ().
              01
                                                 01
                                    10

                  Table 3: Cell-probabilities for the sample-list union at setting x of the covariates
                                                List
                                                  1                          0
               Sample      1                  λ(x)φ11(x)                 λ(x)φ10(x)
                           0            (1 − λ(x))φ11(x) +φ01(x)       (1 − λ(x))φ10(x)






                                                                58 | I S I   W S C   2 0 1 9
   64   65   66   67   68   69   70   71   72   73   74