Page 102 - Contributed Paper Session (CPS) - Volume 3
P. 102

CPS1954 Vincent C. et al.
                  obtained directly from the regression coefficients because   represents the
                                                                             
                  rate of change in the HAZ between years  and   +1  .
                                                           
                      So  far,  we  have  not  mentioned  any  distributional  assumption  on  the
                  growth velocity vector   = (  , . . .  ) . Anderson et al. (2018) and Lee et
                                                          
                                          
                                                0
                                                       
                  al. (2018) model    as realisations from a multivariate ( , Σ ) distribution
                                                                            
                                                                               
                                    
                  with mean vector   and covariance matrix Σ . This signifies a homogeneous
                                                              
                                     
                  population model where individual growth profiles largely follow the trend of
                  a  global trajectory and the variability of  deviation from this mean curve is
                  determined by Σ . On average, the rate of growth is the same for all children
                                  
                  in the population. However, this is rarely the case in practice. For example,
                  Goode  et  al.  (2014)  find  that  higher  socio-economic  status  has  a  positive
                  impact  on  the  HAZ  through  greater  health  consciousness  and  better
                  household  sanitation  system.  Therefore,  we  consider  a  normal  mixture
                  distribution where



                  for positive weights   summing to 1 in order to accommodate for a more
                                       
                  complex  composition  in  the  population.  Each  mixture  component  in  (2.3)
                  corresponds to a particular type of growth pattern and each child belongs to
                  one of these  subgroups. By clustering the children into different subgroups,
                  further  analysis  can  then  be  done  to  identify  risk  factors  which  cause  the
                  manifestation of certain growth behaviour.
                      Equation (2.3) requires the specification of the number of subgroups ,
                  which is often not known a priori in practice. Therefore, we employ a Bayesian
                  non-parametric  approach  to  circumvent  the  model  selection  procedure  in
                  modelling the distribution of the growth velocity  . Conceptually, the number
                                                                  
                  of parameters in a Bayesian non-parametric model is set to infinity and a prior
                  distribution  is  posited  on  the  infinite  dimensional  parameter  space Θ.  The
                  complexity of the model (referring to the value of  in our setting) is then
                  adapted to the amount of information available in the dataset. One such prior
                  which has been widely used in various applications (da Silva, 2007; Blunsom et
                  al., 2008) is the Dirichlet process (DP) prior established in Ferguson (1973).
                      In  order  to  illustrate  the  usefulness  of  the  DP  prior,  we  formulate  our
                  problem of modelling the distribution of   by a mixture distribution in terms
                                                           
                  of a DP mixture model (Antoniak, 1974) in which




                  for   =  1, . . . ,  where   = (  ,  ) is the parameter of a normal distribution
                                                   
                                                
                                          
                  specifying  the  mixture  component  associated  with  child    and  (,  )
                                                                                           0
                                                                      91 | I S I   W S C   2 0 1 9
   97   98   99   100   101   102   103   104   105   106   107