Which of the following is an example of big data utilized in action today?
Wi-Fi Networks
Individual, Unconnected Hospital Databases
Social Media
The Internet
What reasoning was given for the following: why is the "data storage to price ratio" relevant to big data?
Lower prices mean larger storage becomes easier to access for everyone, creating bigger amounts of data for client-facing services to work with.
Companies can't afford to own, maintain, and spend the energy to support large data storage unless the cost is sufficiently low.
Larger storage means easier accessibility to big data for every user because it allows users to download in bulk.
It isn't, it was just an arbitrary example of big data usage.
What is the best description of personalized marketing enabled by big data?
Being able to use personalized data from every single customer for personalized marketing needs.
Marketing to each customer on an individual level and suiting to their needs.
Being able to obtain and use customer information for groups of consumers and utilize them for marketing needs.
Of the following, which are some examples of personalized marketing related to big data?
Google ordering ads to show items based on recent and past search results.
A survey that asks your age and markets to you a specific brand.
News outlets gathering information from the internet in order to report them to the public.
What is the workflow for working with big data?
Theory -> Models -> Precise Advice
Extrapolation -> Understanding -> Reproducing
Big Data -> Better Models -> Higher Precision
Which is the most compelling reason why mobile advertising is related to big data?
Mobile advertising benefits from data integration with location which requires big data.
Mobile advertising allows massive cellular/mobile texting to a wide audience, thus providing large amounts of data.
Since almost everyone owns a cell/mobile phone, the mobile advertising market is large and thus requires big data to contain all the information.
Mobile advertising in and of itself is always associated with big data.
What are the three types of diverse data sources?
Information Networks, Map Data, and People
Machine Data, Organizational Data, and People
Machine Data, Map Data, and Social Media
Sensor Data, Organizational Data, and Social Media
What is an example of machine data?
Sorted data from Amazon regarding customer info.
Weather station sensor output.
Social Media
What is an example of organizational data?
Social Media
Disease data from Center for Disease Control.
Satellite Data
Of the three data sources, which is the hardest to implement and streamline into a model?
People
Organizational Data
Machine Data
Which of the following summarizes the process of using data streams?
Integration -> Personalization -> Precision
Big Data -> Better Models -> Higher Precision
Theory -> Models -> Precise Advice
Extrapolation -> Understanding -> Reproducing
Where does the real value of big data often come from?
Combining streams of data and analyzing them for new insights.
Size of the data.
Having data-enabled decisions and actions from the insights of new data.
Using the three major data sources: Machines, People, and Organizations.
What does it mean for a device to be "smart"?
Must have a way to interact with the user.
Having a specific processing speed in order to keep up with the demands of data processing.
Connect with other devices and have knowledge of the environment.
What does the term "in situ" mean in the context of big data?
In the situation
Accelerometers.
The sensors used in airplanes to measure altitude.
Bringing the computation to the location of the data.
Which of the following are reasons mentioned for why data generated by people are hard to process? Choose all that apply.
Skilled people to analyze the data are hard to come by.
The velocity of the data is very high.
They cannot be modeled and stored.
Very unstructured data.
What is the purpose of retrieval and storage; pre-processing; and analysis in order to convert multiple data sources into valuable data?
Designed to work like the ETL process.
To enable ETL methods.
To allow scalable analytical solutions to big data.
Since the multi-layered process is built into the Neo4j database connection.
Which of the following are benefits of organization-generated data? Choose all that apply.
Higher Sales
Improved Safety
High Velocity
Customer Satisfaction
Better Profit Margins
What are data silos and why are they bad?
Data produced from an organization that is spread out. Bad because it creates unsynchronized and invisible data.
A giant centralized database to house all the data production within an organization. Bad because it hinders opportunity for data generation.
Highly unstructured data. Bad because it does not provide meaningful results for organizations.
A giant centralized database to house all the data produces within an organization. Bad because it is hard to maintain as highly structured data.
Which of the following are benefits of data integration? Choose all that apply.
Reduce data complexity.
Adds value to big data.
Monitoring of data.
Unify your data system.
Increase data availability.
Increase data collaboration.