What is Data?
The
quantities, characters, or symbols on which operations are performed by a
computer, which may be stored and transmitted in the form of electrical signals
and recorded on magnetic, optical, or mechanical recording media. These are treated as data.
What is Big Data?
Big
Data is also data but with a huge size. Big Data is a term used to describe a
collection of data that is huge in size and yet growing exponentially with
time. In short such data is so large and complex that none of the traditional
data management tools are able to store it or process it efficiently.
(Or)
Big
Data refers to the datasets too large and complex for traditional systems to
store and process. The major problems faced by Big Data majorly falls under
three Vs. They are volume, velocity, and variety.
Give Some Examples Of Big Data
Stock
Exchange :
The
New York Stock Exchange generates about one terabyte of new trade data per day.
Social
Media :
The statistic shows that 500+terabytes of new data get ingested into the databases
of social media site Facebook, every day. This data is mainly generated in
terms of photo and video uploads, message exchanges, putting comments, etc.
Jet
Engine:
A single Jet engine can generate 10+terabytes of data in 30 minutes of flight
time. With many thousand flights per day, the generation of data reaches up to many
Petabytes.
What are Types Of Big Data?
BigData'
could be found in three variations of forms. They are:
1.
Structured
2.
Unstructured
3.
Semi-structured
Structured
Any
data that can be stored, accessed, and processed in the form of fixed-format is
termed as a 'structured' data.
Ex:
Data stored in RDBMS(ORACLE – Employee Table, Student Table etc.,)
But,
over the period of time, talent in computer science has achieved greater
success in developing techniques for working with such kind of data (where the
format is well known in advance) and also deriving value out of it. However,
nowadays, we are foreseeing issues when the size of such data grows to a huge
extent, typical sizes are being in the rage of multiple zettabytes. Do you
know? 1021 bytes equal to 1 zettabyte or one billion terabytes forms a
zettabyte. Looking at these figures one can easily understand why the name Big
Data is given and imagine the challenges involved in its storage and
processing.
Unstructured
Any
data with unknown form or structure is classified as unstructured data. In
addition to the size being huge, unstructured data poses multiple challenges
in terms of its processing for deriving value out of it. A typical example of
unstructured data is a heterogeneous data source containing a combination of
simple text files, images, videos etc. Now day organizations have wealth of
data available with them but unfortunately, they don't know how to derive value
out of it since this data is in its raw form or unstructured format.
Ex: The output returned by 'Google Search'

No comments:
Post a Comment