Page 371 - Contributed Paper Session (CPS)

Page 371 - Contributed Paper Session (CPS) - Volume 6

P. 371

CPS1969 Janna M. De Veyra
3.3 Both y and z have 5 categories
Unlike in the first two sections, the size of the test when both y and z have 5
categories will be correctly sized even without the application of the bootstrap
procedure in the test when random error was included in the imputation.
Though the application of bootstrap procedure would still produce a correctly
sized test upon the inclusion of random error in the imputation, the test has a
higher power when bootstrap procedure was not applied. Similar as in the first
two sections, power in this case slightly increases when MCMC was applied in
the imputation.

Table 3. Average Size and Power of the Test when both y and z have 5 categories
Regression MCMC Regression Stochastic MCMC Stochastic

Imputation Imputation Imputation Imputation
Evaluation w/o Bootstrap Bootstrap within Bootstrap across w/o Bootstrap Bootstrap within Bootstrap across w/o Bootstrap Bootstrap within Bootstrap across w/o Bootstrap Bootstrap within Bootstrap across

Power 0.61 0.53 0.43 0.61 0.54 0.43 0.80 0.61 0.38 0.82 0.61 0.38
Size 0.55 0.39 0.16 0.54 0.39 0.16 0.03 0.00 0.00 0.03 0.00 0.00

4. Discussion and Conclusion
This paper analyzed the effect of matching categorical variables to test for
their independence using different simulation scenario on these matching
techniques (1) Logistic Regression, (2) MCMC on Logistic Regression, (3)
Logistic Regression with the inclusion of random error, and (4) MCMC on
Logistic Regression with the inclusion of random error. The test here is to be
obtained by (1) not resampling from the synthetic data (without bootstrap),
(2) resampling with replacement within the synthetic data (bootstrap within),
and (3) resampling with replacement across the synthetic data (bootstrap
across). These methods were evaluated by computing for the size and power
of the test.
Simulation shows that the use of MCMC slightly increases the power of the
test. The increase in power can evidently be seen when random error was
included in the imputation model. Bootstrap, on the other hand, produces a
correctly sized test when applied in an imputation procedure that has a
random error in the model. Among the two bootstrap approaches that were
considered, bootstrap within yields a higher power than bootstrap across.
However, when the variables of interest both have 5 categories, the test is
already correctly sized upon the inclusion random error in the model even
without applying the bootstrap procedure. Hence, it is safe to say that the
bootstrap procedure is more useful when the number of categories for both y
and z is small.

360 | I S I W S C 2 0 1 9

366 367 368 369 370 371 372 373 374 375 376