Amazon has been collecting review data for a particular product. They have realized that almost 90% of the reviews were mostly a 5/5 rating. However, of the 90%, they realized that 50% of them were customers who did not have proof of purchase or customers who did not post serious reviews about the product. Of the following, which is true about the review data collected in this situation?
Low Veracity
High Veracity
Low Volume
Low Valence
High Valence
High Volume
As mentioned in the slides, what are the challenges to data with a high valence?
Complex Data Exploration Algorithms
Difficult to Integrate
Reliability of Data
Which of the following are the 6 V's in big data?
Volume
Vision
Variety
Valence
Velocity
Value
Veracity
What is the veracity of big data?
The size of the data.
The abnormality or uncertainties of data.
The connectedness of data.
The speed at which data is produced.
What are the challenges of data with high variety?
Hard to perform emergent behavior analysis.
Hard in utilizing group event detection.
Hard to integrate.
The quality of data is low.
Which of the following is the best way to describe why it is crucial to process data in real-time?
More accurate.
More expensive to batch process.
Prevents missed opportunities.
Batch processing is an older method that is not as accurate as real-time processing.
What are the challenges with big data that has high volume?
Speed Increase in Processing
Storage and Accessibility
Effectiveness and Cost
Cost, Scalability, and Performance