Page 297 - Special Topic Session (STS) - Volume 1
P. 297

STS430 Eustasio D.B. et al.
            which measures the spread of the distributions. Then, to measure closeness to
            a deformation model, we take a look at the minimal variation among warped
            distributions, a quantity that we could consider as a minimal alignment cost.
            Under some mild conditions, a deformation model holds if and only if this
            minimal  alignment  cost  is  null  and  we  can  base  our  assessment  of  a
            deformation model on this quantity.
                As in [16, 26], we provide results (a Central Limit Theorem and bootstrap
            versions) that enable to reject that the minimal alignment cost exceeds some
            threshold, and hence to conclude that it is below that threshold. Our results
            are given in a setup of general, nonparametric classes of warping functions.
            We also provide results in the somewhat more restrictive setup where one is
            interested in the more classical goodness-of-fit problem for the deformation
            model.  Note  that  a  general  Central  Limit  Theorem  is  available  for  the
            Wasserstein distance in [19]. This works is published in a long version in [9].

            2.  Wasserstein variation and deformation models for distributions
                Much recent work has been conducted to measure the spread or the inner
            structure of a collection of distributions. In this paper, we define a notion of
            variability  which  relies  on  the  notion  of  Fréchet  mean  for  the  space  of
            probabilities endowed with the Wasserstein metrics, of which we will recall the
                                                                                   
            definition hereafter. First, for any integer d ≥ 1, consider the set  (ℝ ) of
                                                                               
                                                                      
            probabilities with finite rth moment. For  and  in  (ℝ ), we denote by
                                                                  
                                                                                      
                                                                                
            ∏(, ) the set of all probability measures  over the product set ℝ × ℝ
            with  first  (respectively  second)  marginal    (respectively ) .  The  
                                                                                      
            transportation cost between these two measures is defined as
                                       
                                                           
                                (, ) =  inf  ∫‖ − ‖ (, ).
                                 
                                            ∏(,)
                                                                                
                This transportation cost makes it possible to endow the set  (ℝ ) with
                                                                             
            the metric  (, ). More details on Wasserstein distances and their links with
                        
            optimal transport problems can be found, e.g., in [27, 35].
                Within this framework, we can define a global measure of separation of a
                                                                            
            collection of probability measures as follows. Given  , …   (ℝ ), let
                                                                     
                                                                1
                                                                         
                                                                    1 ⁄
                                                                    
                                                    1
                                                           
                              ( , … ,  ) =  inf  { ∑  (  , )}
                                       
                              
                                                              
                                 1
                                                           
                                                 
                                             (ℝ )  
                                                      =1
            be the Wasserstein r-variation of  , … ,   or the variance of the  s.
                                                                            
                                              1
                                                    
                The special case r = 2 has been studied in the literature. The existence of
                                           2
                                                           2
            a minimizer of the map  ↦ { ( , ) + ⋯ +  ( , )}/ is proved in [1], as
                                           2
                                                              
                                              1
                                                           2
            well as its uniqueness under some smoothness assumptions. Such a minimizer,
             , is called a barycenter or Fréchet mean of  , … ,  . Hence,
                                                         1
                                                               
              
                                                               286 | I S I   W S C   2 0 1 9
   292   293   294   295   296   297   298   299   300   301   302