Page 321 - Contributed Paper Session (CPS) - Volume 7
P. 321

CPS2110 Johann Sebastian B. C. et al.
               not  be  able  to  reach  a  very  high statistical power  as  the  true  form  of  the
               variance of the count data is, in fact, a transcendental function, in which a finite
               combination of polynomials can only give an approximation to an extent of
               error.
                   As  stated  in  the  previous  section,  the  following  table  presents  the
               proportion of samples where the estimated coefficient  is negative, which
               may result in a negative variance of the count data:

                            Form        Constant   Linear   Quadratic   Polynomial
                        Equidispersed      64%      65%       69%          69%
                            Weak           53%      53%       58%          58%
                           Strong          0%        0%        0%          0%
                       Transcendental      30%      32%       40%          40%
                                    Table 5. Proportion of negative estimates

                   From  the  table,  for  the  forms  of  weak  to  no  overdispersion,  the  four
               assumed variance models estimate a negative coefficient more than half of
               the  time,  which  diminishes  as  the  true  form  of  overdispersion  becomes
               stronger. For the transcendental form, however, a negative estimate happens
               only a third of the time. Nonetheless, because the true form of overdispersion
               is not made known to the researcher, the current method of modelling and
               testing the variance of count data can be said to be prone to producing an
               erroneous,  negative  variance.  This  has  also  been  observed  in  the  two
               pioneering studies of Cameron and Trivedi (1986) (1998).

               4.  Discussion and Conclusion
                   The  assumption  of  a  certain  form  of  the  variance  and  testing  its
               significance using the parametric -distribution is the commonly used method
               of testing for overdispersion in Poisson regression. However, because of the
               dissimilarities  of  the  size  and  power  of  the  test  under  the  parametric  and
               nonparametric paradigms, it is indicative that the -distribution may not be
               the true distribution of the score test statistic, and thus, is an inappropriate
               choice when a regression-based test for overdispersion is to be executed.
                   Moreover,  assuming  the  form  of  the  variance  is  very  prone  to  model
               misspecification  and  the  poor  performance  of  the  model:  using  less
               parameters in the variance model will result in a parsimonious model which
               gives a good statistical size when the appropriate distribution is used but fails
               to capture the extra-Poisson variation when the null hypothesis is false, thus
               resulting in low statistical power of the test. On the other hand, adding more
               parameters to the variance model will results in a model which may be able to
               capture more of the variation in the data, but fails to be simplified when the


                                                                  308 | I S I   W S C   2 0 1 9
   316   317   318   319   320   321   322   323   324   325   326