“With Big Data, things that once seemed impossible are now at hand.” José Ruiz Big Data Manager at ParadigmaBack
The massive amounts of data moving on the Internet (terabytes, petabytes…) have created the need for a technology that allows companies to extract value from this data efficiently in time and money. This technology is what is known as Big Data.
Companies can follow the behavior of their customers to get useful information that can be transformed into economic benefits. Any data can be analyzed, stored both consciously or not.
One of the most important aspects of Big Data is the possibility to treat a variety of data, many until recently ignored because of the difficulty involved in accessing them. These data types are not only structured:
Big Data: a whole internet for every user
Knowing the client thanks to the analysis of a huge volume of data of great variety and speed
Most experts speak of three important concepts (3 “Vs” volume, velocity and variety). Other trends add four more to this “V” classification (visualization, veracity, variability and value), as proposed in 2013 by Mark Van Rijmenam.
The digital universe continues to grow. Much of this information may seem irrelevant now, but maybe in the future it may be useful. To give us an idea, in the last two years 90% of all data created worldwide has been generated and is expected to grow over the next decade by 44. The Internet-mobile, e-commerce, social media … revolution generate continuous stream of information.
It refers to the speed at which data is generated as well as its processing, storage and analysis by databases.
Although there is general agreement regarding the potential value of Big Data, these data are worthless if they are not correct.
The current purpose, the extraction of the value of all the stored information. And the potential is huge value.
It refers to data whose meaning is constantly changing.
After processing the data, it is necessary to present it visibly and accessible. This is one of the great challenges of Big Data.
In relation to the three types of data previously mentioned (structured, unstructured and semi-structured). 90% of the total belongs to unstructured.