“Big Data Era” is one of the most shared topics at technology and business scenarios today. It is said to be the “challenge of organizations” and a “magic” solution that will make a business more competitive. Today all they are betting its future on Big Data and even believe they have implemented one in their companies. However, the truth is that very few know thoroughly this set of technologies and most they have are data marts or other support systems decisions. Question: Do I have or not a Big Data? It becomes many times more in a philosophical debate than a technical issue or technology architecture.
The term “Big Data” was first used in July 1997 in an article by NASA researchers Michael Cox and David Ellsworth. They describe and visualize the problem of information systems according to the exponential growth of data and call in the article as The Problem of Big Data.
Currently Big Data or “the problem of Big Data” as described by Cox and Ellsworth is not limited to large volumes of data, but also includes other problems such as speed input and storage of such data; and the variety of formats they are received. It is precisely these features called the “3 V”, first coined in 2001 by Doug Laney, an analyst at Gartner; those who complete the definition of Big Data.
In addition to the “3 Vs”, other specialists authors in Big Data will add two other V, describing the accuracy of the data, the ability of technology to validate if the data is useful or not, and the value added by the implementation the project to the company. In addition to the V aggregated by different authors, according to my experience in Big Data and the evolution of data for organizations; added two V more than sized and shelter the objectives proposed for Big Data, the sixth V is the display of information and seventh V is the advantage precisely these “2V” referring to the strategic way of showing the past (historical information ) and current (real time) to gain advantage over the competition.
According to the specifications of the paragraphs above my definition of Big Data is set unified, integrated, scalable and adaptive to technologies; allowing handle large Volumes data (Volume) with V peed input “almost” in real time (Velocity) and Variety of formats both structured and unstructured (Variety).
During the implementation of these technologies (software components and hardware) it is important to define the Veracity of data to incorporate or not the model that decision (based Veracity) and measure the true Value that is added to the company implementing a Big Data (Value). It is critical to select a very good tool for the View of data; it should be intuitive, fast and flexible to find patterns of association, trends and create new business models (Visualization); in order to acquire a competitive advantage sustainable over time.