Page 147 - Special Topic Session (STS) - Volume 4
P. 147

STS571 Ossi Nurmi et al.
            1.  Introduction
                During 2016 up to 2018, Statistics Finland carried out the work in two
            phases within the context of Eurostat’s ESSNet Big Data –project. The focus in
            these projects was on three statistical domains: inbound tourism, outbound
            tourism and seasonal population. The first phase in 2017 and 2017 focused on
            negotiations with national authorities and MNOs in order to set up such a
            process  that  is  feasible  from  a  legislative  and  technical  point  of  view.  The
            second phase was to carry out the process, collect data from each MNO and
            analyze the results. The chosen approach relies on the operators to process
            the  data  and  aggregate  it  for  Statistics  Finland.  In  the  current  Finnish
            legislation,  only  the  operator  is  allowed  to  process  the  raw  data  using
            automatic means. The size of the raw data is also massive, with annual data
            consisting of several billions of events per operator.























                      Figure 1 Process from raw microdata to aggregated trips

            The  starting  point  for  each  operator  are  the  raw  data  of  each  of  their
            subscribers. The subscriptions are associated with sim cards found on mobile
            devices. Machine-to-machine sim cards are excluded from the data as they do
            not represent the movement of people.

            2.  Methodology for raw data: processing roaming events to tourism
                trips
                In case of outbound trips, the raw data consists of roaming events (calls,
            sms, mobile data) taking place outside of the subscriber’s home network, in
            other words, a foreign country where the event took place. Based on the time
            gaps  between  these  events,  individual  trips  of  each  subscriber  can  be
            recognized.







                                                               136 | I S I   W S C   2 0 1 9
   142   143   144   145   146   147   148   149   150   151   152