Page 220 - Contributed Paper Session (CPS) - Volume 4
P. 220

CPS2201 Mikhail L.
                  treated  active  learning  for  the  Cook  -  Torrance  model  Cook  and  Torrance
                  (1981). Analysis of BRDF data with statistical and machine learning methods
                  was  discussed  in  Langovoy  (2015b),  Langovoy  (2015a),  Sole  et  al.  (2018),
                  Doctor and Byers (2018).

                  2.  Active learning and design of experiments
                     In  general,  BRDF  is  a  5-dimensional  manifold,  having  4  angular  and  1
                  wavelength dimension. Note that even a set of 1-dimensional manifolds is
                  infinite-dimensional (and k-dimensional manifolds are not to be confused with
                  parametric  −dimensional families of functions). At the same time, a typical
                  measuring device only takes between 50 and 1000 points for all the BRDF
                  layers together. In view of this, the available measurement points are indeed
                  very scarce for a complicated problem such as BRDF estimation. Therefore, an
                  efficient sampling strategy is required when performing the measurements.
                  Since sets of BRDF measurements are, in fact, observed random manifolds, we
                  are dealing here with manifold-valued data.
                     Statistical  design  of  experiments  (see  Fisher  et  al.  (1960),  Cox  and  Reid
                  (2000))  is  a  well  developed  area  of  quantitative  data  analysis.  However,
                  previous  research  in  this  field  was  often  more  concerned  with  (important)
                  topics  such  as  manipulation  checks,  interactions  between  factors,  delayed
                  effects, repeatability, among many others. This shifted the focus away from
                  considering design of  statistical experiments on structured, constrained, or
                  infinite-dimensional data. In contrast, BRDF measurements are carried out in
                  strictly defined settings and by qualified experts. Therefore, there is less room
                  for  human  or  random  errors  and  influences.  On  the  other  hand,  BRDF
                  measurements are collections of points representing manifolds, so defining
                  even the simplest statistical quantities in this case turns out to be a nontrivial
                  and conceptual task.
                     Overall, our methodology represents a far-reaching generalization of the
                  active machine learning framework, also generalizing the proactive learning
                  setup of Donmez and Carbonell (2008). Active learning, as a special case of
                  semi-supervised machine learning, oftentimes deals with finite sets of labels
                  and aims at solving classification or clustering problems with a finite number
                  of classes. While there have been a number of promising practical applications,
                  most  of  the  existing  theory  deals  with  analysis  of  performance  of  specific
                  algorithms  (query  by  committee,    algorithm,  or  importance  weighted
                                                       2
                  approach, among a few others) under rather restrictive conditions on the loss
                  functions,  incoming  distributions,  and  other  components  of  the  learning
                  model. For recent developments, we refer to Agarwal et al. (2013), Beygelzimer
                  et al. (2009), Dasgupta and Hsu (2008).



                                                                     209 | I S I   W S C   2 0 1 9
   215   216   217   218   219   220   221   222   223   224   225