Page 146 - Contributed Paper Session (CPS)

Page 146 - Contributed Paper Session (CPS) - Volume 6

P. 146

CPS1851 Hee Young Chung et al.
model. We also suggest the optimal number of substrata that can be used in
practice based on the simulation results.

2. Estimation of bias using informative sampling technique
a. Review of the exponential response rate function
i. Review of informative sampling technique
The informative sampling is a sampling design in which there is a sample
selection mechanism which is the inclusion probability is influenced by the
values of the variable of interest and there is a super population model which
is the model between the variable of interest and the auxiliary variable.
Pfeffermann et al. (1988) showed that under the informative sampling, we have

∗
( | , ) = ( | ∈ , ) = Pr( ∈ | , ) ( |, ) where ∗ is a

Pr( ∈ | )

function of θ . Also with Pr( ∈ | , ) = ( | , ) and Pr( ∈ | ) =

( | ), we have the following relationship,

( | , ) ( | )

( | ) = ( | ) (1)

where ( | ) is the population distribution, ( | ) is the sample

distribution and ( | , ) is the probability that the datum will be included

in the sample when , are given. Whenever ( | , ) = ( | ), the

population distribution is the same as the sample distribution. When the super
population model is a simple regression model and the response rate is
exponential, we have the following results:

2
( | ) = ( + , ) (2)

( | , ) = ( + ) (3)

0
1

where ( | ) is the population distribution. Now substituting (2) and (3) into

equation (1) yields the following sample distribution.

( | ) = ( + + , ) (4)
2
2
1
0

1
2
Therefore, by comparing (2) and (4), we can confirm the of bias.
1
ii. Parameter estimation of the exponential response rate function
2
The magnitude of the bias calculated in equation (4) is . Since
1
informative sampling uses known , , the magnitude of bias can be
0
1
calculated by estimating form the regression model which is made by the
2
variable of interest and auxiliary variable. However, in the case of the response
rate model, should be estimated. For this, Chung and Shin (2017) estimated
1
by a method dividing the given stratum or population into equally spaced
1
135 | I S I W S C 2 0 1 9

141 142 143 144 145 146 147 148 149 150 151