We differentiate Big Data characteristics from traditional data by one or more of the four V’s: Volume, Velocity, Variety and variability.
Volume is the amount of data generated that must be understood to make data-based decisions.
A text file is a few kilobytes, a sound file is a few megabytes while a full-length movie is a few gigabytes.
Amazon handles 15 million customer click stream user data per day to recommend products.
Extremely large volume of data is a major characteristic of big data online training
Velocity measures how fast data is produced and modified and the speed with which it needs to be processed. An increased number of data sources both machine and human generated drive velocity.
72 hours of video are uploaded to YouTube every minute this is the velocity.
Extremely high velocity of data is another major big data characteristics
Variety defines data coming from new source both inside and outside of an enterprise It can be structured, semi-structured or unstructured.
It is typically found in tables with columns and rows of data. The intersection of the row and the column in a cell has a value and is given a “key,” which it can be referred to in queries. Because there is a direct relationship between the column and the row, these databases are commonly referred to as relational databases. A retail outlet that stores their sales data (name of person, product sold, amount) in an Excel spreadsheet or CSV file is an example of structured data.
A Product table in a database is an example of Structured Data
Semi-structured data also has an organization, but the table structure is removed so the data can be more easily read and manipulated. XML files or an RSS feed for a webpage are examples of semi-structured data.
Example: XML file
<product> <name>Pen </name> <price>$7.95</price> </product> <product> <name>Paper </name> <price>$8.95</price> </product>
Unstructured data generally has no organizing structure, and Big Data technologies use different ways to add structure to this data. Typical example of unstructured data is, a heterogeneous data source containing a combination of simple text files, images, videos etc
Output returned by ‘Google Search‘
This refers to the inconsistency which can be shown by the data at times, thus hampering the process of being able to handle and manage the data effectively.
You can see that few values are missing in the below table
|Department||Year||Minimum sales||Maximum sales|
Data available can sometimes get messy and maybe difficult to trust. With wide variety in big data types generated, quality and accuracy are difficult to control.
Example: A Twitter post has hashtags, typos and abbreviations.
to our newsletter
Azure is a great Microsoft Cloud Computing platform in providing various cloud services through online. ITGuru Certified Azure Architect certification course gives you the practical knowledge on Azure Cloud platform through real-world use cases from live experts
Getting knowledge of cloud platforms like ServiceNow is essential in today’s world for the smooth running of projects in cloud platform. Turn your dream to the reality of becoming the Certified ServiceNow Administrator through ServiceNow Administration online certification Course with practical examples by live industry experts through online at ITGuru with real-world use cases.
knowing the basics on any platform like Workday is not enough to sustain the IT industry. Hence it is essential to go beyond on Workday basics like Workday Financials training which lets you know the application of Financials management in real -world use cases from ITGuru Live Experts in a practical way.
An organization is considered as the best one when it offers the best benefits to the employee. Moreover, the greater the employee benefits, the greater the contribution to the organization. ITGuru let you know the practical workday Human Resource Management(HRM) features with live examples by experts
Turn your dream into reality by ITGuru live experts with real-world use cases through practical knowledge on python online course and become the certified associate in python programming and become a master in python programming
Python is the trending programming language in the IT industry. Mastering in python programming gives you more value among the people in the IT industry. Hence start today to learn python programming online by live experts with real-time uses cases at ITGuru