NYU Stern School of Business
Fall 2023
FoF
Page 3
history
•
1663: death rates during bubonic plague; intro of statistical data
analysis
•
1865: term
business intelligence
introduced
•
1926: Nikola Tesla predicts humans will one day have access to
large swaths of data via an instrument that can be carried "in
[one's] vest pocket."
•
1943: Colossus, data processing machine to decipher Nazi codes
during WWII (computer)
•
1965: data center buildings to store millions of tax returns and
fingerprints on magnetic tape
•
1969: ARPANET created
•
1996: digital data storage becomes more cost-effective than
storing information on paper
•
1997: domain google.com registered
•
2014: more mobile devices access the internet than desktops in
the US; the rest of the world follows in 2016
5
properties (V's)
1. volume
: amount of data (data sets growing rapidly - digitization,
internet, mobile)
2. variety
: diversity of data types and sources
-
structured: high degree of organization, affording easy search in relational
databases (financial statements, transaction statements)
-
unstructured: compilation is time and energy consuming (social media posts,
news articles)
-
human vs machine generated
3. velocity
: speed at which data is generated; high frequency trading,
electronic payment systems vs. house purchasing
4. veracity
: data quality and accuracy (noisy, incomplete, inconsistent or
duplicated data)
5. variability
: fluctuations and inconsistencies, e.g. seasonal, cyclical,
sudden spikes
6. value
: economic and strategic benefits
6