Page 308 - Contributed Paper Session (CPS) - Volume 4
P. 308

CPS2233 Sharon Lee
                  by manual analysis. The CCR was calculated separately for each data and the
                  results are shown in Table 1, together with the results by HDPGMM (Cron et
                  al., 2013) and FLAME (Pyne et al., 2009). The later method adopts a cluster
                  matching step in a post-hoc manner. It can be observed from Table 1 that
                  Hcyto obtained a higher CCR than HDPGMM and FLAME for most of the 16
                  data. This is also supported by the average CCR across the batch, where Hcyto
                  obtained  0.929  compared  to  0.796  and  0.538  obtained  by  HDPGMM  and
                  FLAME, respectively.

                  4.  Discussion and Conclusion
                      The clustering and alignment of cell populations across multiple data is an
                  interesting and challenging problem. The proposed Hcyto method adopts a
                  hierarchical approach to automatically segment and match these clusters, with
                  implicit models that can directly handle non-normal distributional features.
                  The methodology is motivated and demonstrated by cytometric data analysis,
                  but is applicable to other types of data with similar structure. Results from the
                  real example shows that Hcyto provides improved accuracy compared to other
                  algorithms  that  adopt  intuitive  approaches  such  as  pooling  and  post-hoc
                  cluster  matching.  Future  work  may  look  at  the  scalability  of  the  Hcyto
                  framework for larger data and extend it for use in downstream analyses such
                  as  identification  of  discriminatory  features,  supervised  classification  of
                  unlabelled data, and longitudinal modelling of batches.

                  References
                  1.  Aghaeepour N, Nikolic R, Hoos HH, Brinkman RR. (2011). Rapid cell
                      population identification in flow cytometry data. Cytometry Part A
                      79A:6–13.
                  2.  Aghaeepour, N., Finak, G., The FLOWCAP Consortium, The DREAM
                      Consortium, Hoos, H., Mosmann, T., Gottardo, R., Brinkman, R.R.,
                      Scheuermann, R.H. (2013). Critical assessment of automated ow
                      cytometry analysis techniques. Nature Methods 10, 228-238
                  3.  Aghaeepour, N., Chattopadhyay, P.K., Chikina, M., Van Gassen, S., Kurs,
                      M., Malek, M., McLachlan, G.J., Qui, P., Saeys, Y., Stanton, R., Tong, D.,
                      Wang, K., Nolan, G., Finak, G., Gottardo, R., Mossman, T., Scheurmann R.,
                      and Brinkman, R. (2016). Benchmark for evaluation of algorithms for
                      identification of cellular correlates of clinical outcomes. Cytometry: Part
                      A 89A, 16-21
                  4.  Azzalini, A., Dalla Valle, A.(1996). The multivariate skew-normal
                      distribution. Biometrika 83, 715726 Cron A, Gouttefangeas C, Frelinger J,
                      Lin L, Singh SK, et al. (2013). Hierarchical modeling for rare event
                      detection and cell subset alignment across flow cytometry samples. PLoS
                      Computational Biology 9.

                                                                     297 | I S I   W S C   2 0 1 9
   303   304   305   306   307   308   309   310   311   312   313