Difference between Big Data and Data Science

Big Data

  1. Hugh volumes of data which cannot be handled using traditional database programming.
  2. Characterized by volume, variety, and velocity.

Data Science

  1. A data-focused on scientific activity.
  2. Approaches to process big data.
  3. Harnesses the potential of big data for business decisions.
  4. Similar to data mining.

Concept

Big Data

  1. Diverse data types generated from multiple data sources.
  2. Includes all types and formats of data.

Data Science

  1. A specialized area involving scientific programming tools, models and techniques to process big data.
  2. Provides techniques to extract insights and information from large datasets.
  3. Supports organizations in decision making.

Basic of formation

Big Data

  1. Internet users/traffic
  2. Electronic devices (sensors, RFID, etc.)
  3. Audio/video streams including live feeds.
  4. Online discussion forums.
  5. Data generated in organizations (transactions, DB, spreadsheets, emails, etc.)
  6. Data generated from system logs.

Data Science

  1. Applies scientific methods to extract knowledge from big data.
  2. Related to data filtering, preparation, and analysis.
  3. Capture complex patterns from big data and develop models.
  4. Working apps are created by programming developed models.

Application areas

Big Data

  • Financial services
  • Telecommunications
  • Optimizing business process
  • Performance optimization
  • Health and sports
  • Improving commerce
  • Research and development
  • Security and law enforcement

Data Science

  • Internet search
  • Digital advertisement
  • Search recommenders
  • Image/speech recognition
  • Fraud, risk detection
  • Web development
  • Other miscellaneous areas/utilities

Approach

Big Data

  • To develop business agility
  • To gain competitiveness
  • Leverage datasets for business advantage
  • Establish realistic metrics and ROI
  • To achieve sustainability
  • To understand markets and gain new customers

Data Science

  • Involves extensive use of mathematics, statistics, and other tools
  • State-of-the-art technique/algorithms for data mining
  • Programming skills (SQL, NoSQL), Hadoop platforms
  • Data acquisition, preparation, processing, publishing, preserve or destroy
  • Data visualization, prediction

Popular Posts

Spread the knowledge
 
  

Leave a Reply

Your email address will not be published. Required fields are marked *