Page 122 - Special Topic Session (STS) - Volume 2
P. 122
STS466 Sasongko Y.
need to measure the market sentiment based on people opinion and
transform it into scoring variable to be calculated together in the stock
predictive model.
1.1 Unstructured Data
Data can be categorised into structured and unstructured based on its
characteristic. Structured data is data established in a structured format and
pattern that has been defined properly. Tables, worksheets, are typical format
that are used to store the data. For so many years organizations have been
putting more attention to the structured data. They invest million of dollars
developing framework to manage the structured data. Relational database,
data warehouse, business intelligence, are frameworks have been used by
organization in managing structured data.
The unstructured data is data without pattern and definitive format. It can
be a narrative text or binary code without any descriptive provided. Content
of books, content of documents, memos, contracts, images, movies are sample
of unstructured data. This kind of data receives less attention from the
organization for data processing as the characteristic of the data makes them
challenging to process. The common processing done to the unstructured
data are indexing and searching.
Unfortunately, 90% of data in the world is in unstructured format. It has
been a challenge for many organizations to engage with unstructured data
processing. Putting the context on the data, extracting the object based on
language grammar, clustering the content for similarity, are research
conducted as part of unstructured data processing. It will be a great impact
for organization if they are able to manage the unstructured data, so better
knowledge can be acquired, and better decision making can be established.
1.2 The Internet Era
Internet has been a commodity since the booming of dot com during end
of nineties. The cost required by public is getting more affordable every year.
Internet penetration reports are also showing positive trend in many countries.
With the smart phone as everybody’s communication platform, internet access
become a basic need for everyone.
The social media also contributes significant impact on digitalization of
asset. More content is produced every day in digital format and store them in
the internet-based platform like clouds. It makes the internet become the
giant platform of data sources which can be one of organization directions in
seeking more sources for information.
111 | I S I W S C 2 0 1 9