ML Summary
depth
2
This document contains summary notes about various core concepts.
Core concepts employed by data scientist:
- Supervised Learning
- Unsupervised Learning
- Semi-Supervised Learning
- Reinforcement Learning
- Statistical Analysis and Methods
- Basic Machine Learning Approaches
- Natural Language Processing (NLP)
- Deep Learning
- Visualization and Communication
- Business Intelligence Integration
Various ML Libraries
Definition: A machine learning paradigm where the model is trained on labeled data (input-output pairs).
Applications:
Key Algorithms:
Definition: A paradigm where the model identifies patterns in unlabeled data.
Applications:
Key Algorithms:
Definition: Combines a small amount of labeled data with a large amount of unlabeled data.
Applications: Useful when labeling data is expensive or time-consuming (e.g., text classification in NLP)
Definition: A learning paradigm where an agent learns by interacting with an environment to maximize rewards.
Applications: Optimizing supply chains, dynamic pricing, and recommendation systems.
Applications:
Applications: Leveraging neural networks for tasks like:
For visual insights tools like:
Combining predictive analytics, dashboards, and automated reporting systems to facilitate data-driven decision-making.
Various ML libraries used from over all least complext to most complex order are given below.
Numpy and Pandas are Foundational Libraries which provide basic utilities for numerical operations and data manipulation.
Statistical Analysis libraries focus on exploratory data analysis and statistical hypothesis testing.
Core libraries for classical and advanced machine learning algorithms. Core libraries for classical and advanced machine learning algorithms.
Core libraries for classical and advanced machine learning algorithms.
Core libraries for classical and advanced machine learning algorithms.
Core libraries for classical and advanced machine learning algorithms.
Key Features:
Specialized frameworks for designing and training neural networks.
Libraries dedicated to text processing and linguistic analysis.
Libraries for domain-specific tasks or scaling.
Scope: Computer vision tasks. Key Goals: Image processing, object detection. Complexity: High.
Library/Tool |
Scope |
Key Goals |
---|---|---|
NumPy, Pandas |
Data manipulation |
Data cleaning, numerical ops |
Matplotlib, Seaborn |
Data visualization |
Static and statistical plots |
Scikit-learn |
Classical ML |
Train ML models (SVM, RF) |
TensorFlow, PyTorch |
Deep learning |
Neural networks, DL models |
Gensim, spaCy |
NLP |
Topic modeling, linguistic ops |
XGBoost, LightGBM |
Boosting models |
High-performance ML |
OpenCV, SHAP |
Specialized tools |
Vision tasks, model explainers |
Library/Tool | Scope | Key Goals |
---|---|---|
NumPy, Pandas | Data manipulation | Data cleaning, numerical ops |
Matplotlib, Seaborn | Data visualization | Static and statistical plots |
Scikit-learn | Classical ML | Train ML models (SVM, RF) |
TensorFlow, PyTorch | Deep learning | Neural networks, DL models |
Gensim, spaCy | NLP | Topic modeling, linguistic ops |
XGBoost, LightGBM | Boosting models | High-performance ML |
OpenCV, SHAP | Specialized tools | Vision tasks, model explainers |