Page 176 - Contributed Paper Session (CPS) - Volume 1
P. 176

CPS1255 Tsung-Jen Shen et al.
                  community ecology (Kunin & Gaston 1993). In this context, the development
                  of robust statistical methods for accurately predicting the occurrence of new
                  rare species in additional samples is urgent and necessary.

                  2.  Methodology
                      Assume  that  an  initial  sample  of  n  individuals  is  collected  from  a
                  metacommunity  of  S  species  in  which  the  species  relative  abundance
                  distribution is given by  p , p ,… , p . Let the binary random variable  Z =1,
                                           1  2     S                                  i, j
                  signifying that the jth selected individual is identified as species i; otherwise
                  Z = 0, where j = 1, 2, …, n. ( X , X ,… ,X ) represents species counts with
                    i, j
                                                    2
                                                 1
                                                           S
                        
                   = ∑ 
                           , in the sample and follows a multinomial distribution with a
                   
                       =1
                  grand total of n and occurrence probabilities of  p , p ,… , p . Let  F (m) be
                                                                    1  2     S       k
                  the expected number of newly found species absent in the first sample but
                  have exactly k individuals detected in an additional sample of size m. Similarly,
                  the binary variable  Z  (j = n+1, n+2, …, n+m) can be used to describe the
                                       i, j
                  sampling outcome for the additional sample. Mathematically, we can express
                  F (m) by
                    k
                                é  S        æ     ö            ù
                      F (m) = Eê å I(X = 0)ç   m  ÷ p (1- p ) m-k ú
                                                     k
                                                  ÷
                       k        ê i=1  i    ç  k ø  i     i    ú
                                            è
                                ë
                                                               û
                              æ    ö S   æ                n+m       ö æ n+m      ö ,      (1)
                                         ç
                            =ç  m  ÷ å P Z ,Z ,… ,Z   i,n+m å Z = k ÷ Pç å Z = k÷
                                   ÷
                              ç
                                                                    ÷ ç
                                                                                 ÷
                                         ç
                              è  k ø i=1  è  i,1  i,2     j=1  i, j  ø è  j=1  i, j  ø
                  where I(A) is an indicator function such that I(A) = 1 if statement A is true and
                                                               k
                  I(A)  =  0  if  untrue.  Note  that  the  term  p (1- p ) n+m-k   in  F (m)  can  be
                                                               i    i           k
                  equivalently interpreted as exactly k individuals of species i coming out of
                  n+m  individuals  in  the  combined  original  and  additional  samples.  The
                  derivation from the second equality to the last equality in Eq. 1 is based on the
                  fact  that  Z   entities  can  be  regarded  as  independent  random  Bernoulli
                             i, j
                  variates with success rates  p  while the sampling outcome is that there are k
                                              i
                  successful outcomes out of  n+m trials.
                      Incorporating Bayesian weights, we proposed an estimator of  F (m) as
                                                                                     k
                  (Shen & Chen 2018)








                                                                     165 | I S I   W S C   2 0 1 9
   171   172   173   174   175   176   177   178   179   180   181