Page 30 - Contributed Paper Session (CPS) - Volume 7
P. 30

CPS2020 Honeylet T. S.
                  4.  Discussion and Conclusion
                      Simulation  results  presented  above  summarized  the  effects  of  various
                  likely settings in statistical matching with the objective of estimating a count
                  regression  model.  Bootstrap-based  approaches  for  estimation  of  synthetic
                  data created from various imputation methods are in general at par with or
                  better than the benchmark regression approach. It might be of interest for
                  future studies to include a  donation class to  improve the estimates of the
                  model when synthetic datasets are created using random hot deck imputation.
                      Simulations  show  that  MCMC  imputation  and  Poisson  regression
                  imputation  produce  comparable  results  when  it  comes  to  accuracy  of
                  estimates.  Particularly,  MCMC  imputation  yields  lowest  RBIAS.  In  terms  of
                  predictive  ability  of  estimated  models,  Poisson  regression  imputation  and
                  MCMC  imputation  still  produce  comparable  results  although  Poisson
                  regression imputation has most of the lowest MAE values. Under random hot
                  deck  imputation,  RBIAS  and  MAE  values  are  larger  compared  to  those
                  produced by the other two matching methods.
                      Moreover, under Poisson regression imputation and MCMC imputation,
                  the three estimation procedures yield similar results. Accuracy of estimates
                  and predictive ability are comparable. In terms of predictive ability, the three
                  estimation procedures also yield similar results when using Poisson regression
                  imputation and MCMC imputation although bootstrap within and bootstrap
                  across produce lower MAE values compared to Poisson regression without
                  bootstrapping.
                       The predictive ability of the models estimated using bootstrap within is
                  good for all sample sizes and ratios of data sources (to total sample) in the
                  scenario  settings.  Moreover,  regardless  of  the  matching  method  used,
                  bootstrap within produces low RBIAS and MAE values. For instance, if random
                  hot deck is really necessary as the matching method because of the nature of
                  the data and no donation classes were specified, bootstrap within can yield
                  lower  RBIAS  and  MAE  values  compared  to  Poisson  regression  without
                  bootstrapping and bootstrap across method. It is interesting for further study
                  to incorporate donation classes in random hot deck imputation and evaluate
                  the  performance  of  bootstrap  within  and  bootstrap  across  estimation
                  methods.









                                                                      19 | I S I   W S C   2 0 1 9
   25   26   27   28   29   30   31   32   33   34   35