Page 122 - Special Topic Session (STS) - Volume 2
P. 122

STS466 Sasongko Y.
                  need  to  measure  the  market  sentiment  based  on  people  opinion  and
                  transform  it  into  scoring  variable  to  be  calculated  together  in  the  stock
                  predictive model.

                  1.1 Unstructured Data
                     Data can  be categorised into structured and unstructured based on its
                  characteristic. Structured data is data established in a structured format and
                  pattern that has been defined properly. Tables, worksheets, are typical format
                  that are used to store the data. For so many years organizations have been
                  putting more attention to the structured data. They invest million of dollars
                  developing framework to manage the structured data. Relational database,
                  data  warehouse,  business  intelligence,  are  frameworks  have  been  used  by
                  organization in managing structured data.
                     The unstructured data is data without pattern and definitive format. It can
                  be a narrative text or binary code without any descriptive provided. Content
                  of books, content of documents, memos, contracts, images, movies are sample
                  of  unstructured  data.  This  kind  of  data  receives  less  attention  from  the
                  organization for data processing as the characteristic of the data makes them
                  challenging to process. The common processing done to the unstructured
                  data are indexing and searching.
                     Unfortunately, 90% of data in the world is in unstructured format. It has
                  been a challenge for many organizations to engage with unstructured data
                  processing. Putting the context on the data, extracting the object based on
                  language  grammar,  clustering  the  content  for  similarity,  are  research
                  conducted as part of unstructured data processing. It will be a great impact
                  for organization if they are able to manage the unstructured data, so better
                  knowledge can be acquired, and better decision making can be established.

                  1.2 The Internet Era
                     Internet has been a commodity since the booming of dot com during end
                  of nineties. The cost required by public is getting more affordable every year.
                  Internet penetration reports are also showing positive trend in many countries.
                  With the smart phone as everybody’s communication platform, internet access
                  become a basic need for everyone.
                     The social media also contributes significant impact on digitalization of
                  asset. More content is produced every day in digital format and store them in
                  the  internet-based  platform  like  clouds.  It  makes  the  internet  become  the
                  giant platform of data sources which can be one of organization directions in
                  seeking more sources for information.




                                                                     111 | I S I   W S C   2 0 1 9
   117   118   119   120   121   122   123   124   125   126   127