Skip to main content

Data Science and Machine Learning

 Basic Concepts of Data Science and Machine Learning 


Data Science is the extraction and analysis of the relevant information from Data.

Machine Learning is the part of Data Science, which enables the system to process datasets autonomously without any human interference by utilizing various algorithms to work on the massive volume of data generated and extracted from numerous sources. 


Benefits:

Data Science
  • Helps in finding and refining target viewers 
  • Ensure better communication between service providers and service utilizers.
  • Improved business value and better risk analysis. 

Machine Learning

  • Supportive in marketing and predicting accurate sales forecasts
  • Helpful inaccurate medical diagnoses
  • Supportive in the elimination of data duplication and erroneousness
  • Caring in spam detection
  • Provide appropriate product recommendation

Required Expertise

Data Science 

  • Programming Skills
  • Data Warehousing 
  • Statistics
  • Mathematics
  • Software Engineering
  • Data visualization and communication 

Machine Learning

  • Basic Programming Skills
  • Statistics
  • Mathematics
  • System Design
  • Software Engineering 
Applications:

Data Science

  • Recommender Systems
  • Internet Search Engine
  • Image recognition
  • Speech Recognition
  • Gaming
  • Airline Route Planning
  • Comparative analysis of Price 
  • Fraud and risk detection
  • Robotics
  • Self-driving cars

Machine Learning

  • Virtual Personal Assistant
  • Video Surveillance
  • Online Fraud detection
  • Social Media Services
  • Email Spam and Malware filtering 
  • Operational Client Support
  • Product recommendation

Top Tools

Data Science

  • Python
  • R (Statistics Language for computation and graphics)
  • Jupyter Notebook
  • Tableau
  • Keras

Machine Learning 

  • Python
  • C++
  • R (Statistics Language for computation and graphics)
  • Jupyter Notebook
  • Tableau
References:

Maheshwari, S., Gautam, P., & Jaggi, C. K. (2021). Role of Big Data Analytics in supply chain management: current trends and future perspectives. International Journal of Production Research, 59(6), 1875-1900. https://doi.org/10.1080/00207543.2020.1793011
 
Jordan, M. I., & Mitchell, T. M. (2015). Machine learning: Trends, perspectives, and prospects. Science, 349(6245), 255-260. https://doi.org/10.1126/science.aaa8415

Van Der Aalst, W. (2016). Data science in action. In Process mining (pp. 3-23). Springer, Berlin, Heidelberg.

Cielen, D., & Meysman, A. (2016). Introducing data science: big data, machine learning, and more, using Python tools. Simon and Schuster. 


Any comments or suggestions are welcome.

Thanks for reading this blog. 


 

Comments

Popular posts from this blog

SQL 5 Minutes Read

SQL CHEAT SHEET (5 Minutes Read of Structured Query Language (SQL)) SQL (Structured Query Language) is a standardised programming language used to manage relational databases and execute various operations on related data. SQL, which was developed in the 1970s, is now widely used not only by database administrators but also by developers building data integration scripts and data analysts wanting to set up and perform analytical queries.  SQL is used to modify database table and index structures, add, update, and delete rows of data, and retrieve subsets of information from inside a database for transaction processing and analytics applications. Queries and other SQL operations take the form of statements, which are regularly used instructions. Select, add, insert, update, delete, create, change, and truncate are all SQL statements. In this blog, we will learn how to perform basic operations in SQL. Get function for inserting data, update data, deleting data, grouping data, or...
  2.  10 FEA TURE ENCODING TECHNIQUES EVERY DATA SCIENTIST MUST KNOW FEATURE ENCODING TECHNIQUES  1- LABEL ENCODING    Label encoding is intuitive and easy to understand. Label Encoding refers to converting the labels into the numeric form so as to convert them into the machine-readable form. Machine learning algorithms can then decide in a better way how those labels must be operated. It is an important pre-processing step for the structured dataset in supervised learning.         Example: Suppose we have a column Height in some dataset. After applying label encoding, the Height column is converted into: where 0 is the label for tall, 1 is the label for medium and 2 is the label for short height. Limitation of label Encoding Label encoding converts the data in machine-readable form, but it assigns a unique number (starting from 0) to each class of data. This may lead to the generation of priority issues in the training of data sets. A l...

UPSC Previous year paper for AD Census Operations (Technical) & Statistical Officer (Planning/Statistics)

UPSC Previous year paper for AD Census Operations (Technical) & Statistical Officer (Planning/Statistics) Syllabus of the Test:  (1) Statistical Methods (2) Sampling Techniques/Survey Methodology (3) Demography and Vital Statistics (4) Fundamentals of Applied Multivariate Analysis (5) Official Statistics (6) Basic knowledge in Computer Applications UPSC 2017 Paper   For PDF file, click on the below link: https://drive.google.com/file/d/14oDfPYGI9bA2g0m-0OmfexHS8LOgaf3j/view?usp=sharing (There are 25 slides) For PDF file, click on the below link: https://drive.google.com/file/d/14oDfPYGI9bA2g0m-0OmfexHS8LOgaf3j/view?usp=sharing Any comments or suggestions are welcome. Subscribe our Youtube Channel----->  Thanks for reading this blog.