Contents Menu Expand Light mode Dark mode Auto light/dark, in light mode Auto light/dark, in dark mode Skip to content
Zero to AI
Logo
Zero to AI

Roadmaps

  • Overview
  • Core Systems
  • Advanced Topics
  • End-to-End Flows

Foundations

  • Course Setup
    • AI Model Landscape: March 6, 2026
    • Troubleshooting Guide
  • Python Fundamentals
  • Data Science Foundations
    • How To Become a Data Engineer
    • NumPy Examples - Consolidated & Deduplicated
      • πŸ“Š Content Statistics
      • πŸ’‘ Study Tips
      • πŸ“– Additional Resources
      • πŸ—ΊοΈ Next Steps
      • 01 Basics
        • Data types
        • Comparison Operators
        • Logic Operators
        • if, elif, else Statements
        • for Loops
        • while Loops
        • range()
        • List Comprehension
        • Functions
        • Lambda Expressions
        • map and filter
        • Methods
        • Great Job!
        • Exercises
        • Great job!
        • Load in NumPy
        • The Basics
        • Accessing/Changing Specific Elements, Rows, Columns
        • 3-D Array Example
        • Initializing Different Types of Arrays
        • Be Careful When Copying Arrays
        • Mathematics
        • Linear Algebra
        • Statistics
        • Reorganizing Arrays
        • Miscellaneous
        • NumPy Exercises
        • Great Job!
        • NumPy Operations
        • Great Job!
        • NumPy
        • Numpy Arrays
        • Great Job!
        • NumPy Indexing and Selection
        • Great Job!
      • 02 Intermediate
        • Copy vs. View in NumPy
        • Filtering NumPy Arrays with Boolean Indexing
        • NumPy Universal Functions (ufuncs)
        • Square Root – np.sqrt()
        • Absolute Value – np.absolute()
        • Exponential – np.exp()
        • Min and Max – np.min() / np.max()
        • Sign Function – np.sign()
        • Introduction to NumPy Arrays
        • Array Creation Methods
        • Iterating Over NumPy Arrays
        • Iterating a 1-D Array
        • Iterating 2-D and 3-D Arrays
        • Sorting NumPy Arrays
        • Sorting Numbers
        • Sorting Strings Alphabetically
        • Sorting Booleans
        • Searching NumPy Arrays with np.where()
        • Retrieving Search Results
        • Finding Even and Odd Numbers
        • Reshaping NumPy Arrays
        • 2-D Array Shape
        • Reshaping from 1-D to 2-D
        • Slicing 1-D NumPy Arrays
        • Slicing 2-D NumPy Arrays
      • 03 Exercises
        • 100 numpy exercises
        • 100 numpy exercises
        • Random Sampling
        • Random Sampling – Solutions
        • Set Routines
        • Set Routines – Solutions
        • Sorting, Searching, and Counting
        • Sorting, Searching, and Counting – Solutions
        • Statistics
        • Statistics – Solutions
        • Array Creation Routines
        • Array Creation Routines – Solutions
        • Array Manipulation Routines
        • Array Manipulation Routines – Solutions
        • String Operations
        • Comparison
        • String Information
        • String Operations – Solutions
        • Comparison
        • String Information
        • NumPy-Specific Help Functions
        • NumPy-Specific Help Functions β€” Solutions
        • Input and Output
        • Input and Output – Solutions
        • Linear Algebra
        • Linear Algebra – Solutions
        • Discrete Fourier Transform
        • Complex Numbers
        • Discrete Fourier Transform
        • Window Functions
        • Complex Numbers
        • Discrete Fourier Transform
        • Window Functions
        • Logic Functions
        • Logic Functions – Solutions
        • Mathematical Functions
        • Mathematical Functions – Solutions
      • 04 Advanced
        • Numpy Tutorials
          • Determining Moore’s Law with real data in NumPy
          • Pairing Jupyter notebooks and MyST-NB
          • Saving and sharing your NumPy arrays
          • Analyzing the impact of the lockdown on air quality in Delhi, India
          • Deep learning on MNIST
          • Deep reinforcement learning with Pong from pixels
          • Masked Arrays
          • Sentiment Analysis on notable speeches of the last decade
          • Plotting Fractals
          • Determining Static Equilibrium in NumPy
          • Learn to write a NumPy tutorial
          • Linear algebra on n-dimensional arrays
          • X-ray image processing
          • NLP from Scratch Tutorial Data
    • Pandas Examples - Consolidated & Organized
      • Pandas Examples Consolidation Summary
      • πŸ“Š What You’ll Learn
      • πŸ’‘ Pro Tips for Success
      • πŸ“– Essential pandas Operations
      • πŸ“ˆ Progress Tracking
      • πŸ”— Additional Resources
      • πŸ“Š Collection Statistics
      • πŸŽ“ Ready to Begin?
      • 01 Basics
        • Differences between Shared Methods
        • Selecting One Column from a DataFrame
        • Select Two or More Columns in DataFrame
        • Add New Column to DataFrame
        • Broadcasting Operations
        • A Review of the .value_counts() Method
        • Drop Rows with Null Values
        • Fill in Null Values with the .fillna() Method
        • The .astype() Method
        • Sort a DataFrame with the .sort_values() Method, Part 1
        • Sort a DataFrame with the .sort_values() Method, Part 2
        • Sort a DataFrame with the .sort_index() Method
        • Rank Values with the .rank() Method
        • Filter a DataFrame Based on a Condition
        • Filter with More than One Condition (AND)
        • Filter with More than One Condition (OR)
        • The .isnull() and .notnull() Methods
        • The .between() Method
        • The .duplicated() Method
        • The .drop_duplicates() Method
        • The .unique() and .nunique() Methods
        • set_index() and reset_index() Methods
        • Retrieve Rows by Index Label with .loc[]
        • Retrieve Row(s) by Index Position with iloc
        • The Catch-All .ix[] Method
        • Second Argument to .loc[], .iloc[], and .ix[] Methods
        • Set New Values for Specific Cell or Row
        • Set Multiple Values in DataFrame
        • Rename Index Labels or Columns in a DataFrame
        • Delete Rows or Columns from a DataFrame
        • Create Random Sample
        • The .nsmallest() and .nlargest() Methods
        • Filtering with the where Method
        • The .query() Method
        • A Review of the .apply() Method on Single Columns
        • The .copy() Method
        • Data Cleaning in Pandas
        • EDA in Pandas
        • Filtering and Ordering
        • Group by and Aggregating
        • Indexing
        • Merge, Join, and Concatenate
        • Pandas Series and DataFrames
        • Reading In Files
        • Pandas Visualization
        • Pandas Numpy Lessons
          • Lesson1
            • Exploratory Analysis
            • Create a Pandas Dataframe
            • Introduction to Pandas
            • Loading data into Pandas
            • Load a CSV from a local file
            • Load JSON from a local file
            • You can read from many formats
            • Writing data from Pandas Dataframes
            • Write to many other destinations
            • Copy/Paste into other formats
          • Lesson2
            • Applying Functions
            • Common Dataframe operations
            • Manipulating text in DataFrames
            • Visualizing data
          • Lesson3
            • Common array operations
            • Introduction to NumPy arrays
            • More array operations
      • 02 Intermediate
        • Apple Health Data – Real-World Data Analysis Project
        • DataFrame Fundamentals – Inspection, Selection, and Cleaning
        • DataFrame Operations – Filtering, Deduplication, and Unique Values
        • DataFrame Operations – Indexing, Renaming, Querying, and Applying Functions
        • Electronic Production in India – Data Analysis and Visualization
        • GroupBy – Split-Apply-Combine for Aggregation
        • Input and Output – Reading and Writing Data
        • Merge, Join, and Concatenate – Combining DataFrames
        • MultiIndex – Hierarchical Indexing, Pivoting, and Reshaping
        • Options and Settings – Controlling Pandas Display Behavior
        • Data Cleaning in Pandas
        • EDA in Pandas
        • Filtering and Ordering
        • Group by and Aggregating
        • Indexing
        • Merge, Join, and Concatenate
        • Pandas Series and DataFrames
        • Reading In Files
        • Pandas Visualization
        • Panels – Three-Dimensional Data Structures (Deprecated)
        • India Electronic Production – Interactive Visualization with Plotly
        • Reading CSV and Working with Series
        • Creating Series in Python
        • Tamil Nadu Population Literacy Analysis
        • Visualization – Plotting Stock Data with Pandas and Matplotlib
        • Working with Date and Time
        • Working with Text Data
        • Geographic Visualization – Flight Path Maps with Plotly
      • 03 Exercises
        • 100 pandas puzzles
        • 100 pandas puzzles
      • 05 Real World Projects
        • Apple Health Data – Real-World Analysis Project
        • Electronic Production in India – Data Analysis and Visualization
        • India Production Data – Advanced Plotly Visualizations
    • Data Science Examples
      • πŸ“– Study Schedule Examples
      • πŸ’‘ Success Tips
      • πŸŽ“ Skill Progression
      • πŸ“Š Track Your Progress
      • πŸ”— Essential Resources
      • 🎯 Your Next Steps
      • πŸ’ͺ Motivation
      • πŸš€ Ready to Begin?
      • Data Science for Beginners - A Curriculum
      • Are you a student?
      • Getting Started
        • Microsoft Open Source Code of Conduct
        • Contributing
        • Security
        • Reporting Security Issues
        • Preferred Languages
        • Policy
        • Support
        • Contribute by translating lessons
        • For Educators
        • Using the repo as is
        • Included in this curriculum:
        • Please give us your thoughts!
        • Introduction to Data Science
          • Defining Data Science
            • Assignment: Data Science Scenarios
            • Challenge: Analyzing Text about Data Science
            • Solution
              • Assignment: Data Science Scenarios
              • Challenge: Analyzing Text about Data Science
          • Introduction to Data Ethics
          • Assignment
            • Write A Data Ethics Case Study
            • Instructions
            • Rubric
          • Defining Data
            • Classifying Datasets
          • A Brief Introduction to Statistics and Probability
            • Introduction to Probability and Statistics
            • Assignment
            • Introduction to Probability and Statistics
            • Solution
              • Introduction to Probability and Statistics
              • Assignment
        • Working with Data
          • Working with Data: Relational Databases
            • Displaying airport data
          • Working with Data: Non-Relational Data
            • Soda Profits
          • Working with Data: Python and the Pandas Library
            • Assignment for Data Processing in Python
            • Estimation of COVID-19 Pandemic
            • Analyzing COVID-19 Papers
            • Basic Pandas Examples
            • DataFrame
            • Printing and Plotting
            • R
              • Pandas Usecase in R
              • Series
              • DataFrame
              • Printing and Plotting
          • Working with Data: Data Preparation
            • Assignment: Evaluating Data from a Form
            • Data Preparation
        • Visualizations
          • Visualizing Quantities
            • Lines, Scatters and Bars
            • Let’s learn about birds
            • Solution
              • Let’s learn about birds
          • Visualizing Distributions
            • Apply your skills
            • Bird distributions
            • Solution
              • Bird distributions
          • Visualizing Proportions
            • Try it in Excel
            • πŸ„ Mushroom Proportions
            • Solution
              • πŸ„ Mushroom Proportions
              • Pie chart
              • Donut chart
              • Waffle chart
          • Visualizing Relationships: All About Honey 🍯
            • Dive into the beehive
            • Visualizing Honey Production 🍯 🐝
            • Solution
              • Visualizing Honey Production 🍯 🐝
          • Making Meaningful Visualizations
            • Build your own custom vis
            • Dangerous Liaisons data visualization project
            • Dangerous Liaisons data visualization project
          • R
            • Visualizing Quantities
              • Lines, Scatters and Bars
            • Visualizing Distributions
              • Apply your skills
            • Visualizing Proportions
            • Visualizing Relationships: All About Honey 🍯
            • Making Meaningful Visualizations
        • The Data Science Lifecycle
          • Introduction to the Data Science Lifecycle
            • Assessing a Dataset
            • NYC Taxi data in Winter and Summer
          • The Data Science Lifecycle: Analyzing
            • NYC Taxi data in Winter and Summer
            • Use the cells below to do your own Exploratory Data Analysis
            • Analyzing Data
          • The Data Science Lifecycle: Communication
          • Introduction
          • Effective Communication
          • Communication Case Study
          • Conclusion
            • Tell a story
        • Data Science in the Cloud
          • Introduction to Data Science in the Cloud
            • Market Research
          • Data Science in the Cloud: The β€œLow code/No code” way
            • Low code/No code Data Science project on Azure ML
          • Data Science in the Cloud: The β€œAzure ML SDK” way
            • Data Science project using Azure ML SDK
            • Data Science in the Cloud: The β€œAzure ML SDK” way
            • Solution
              • Data Science in the Cloud: The β€œAzure ML SDK” way
        • Data Science in the Wild
          • Data Science in the Real World
            • Explore a Planetary Computer Dataset
        • Docs
        • Quizzes
        • Credits
      • Core Data Science Topics
        • Matplotlib Reference
          • Visualization with Matplotlib
          • Simple Line Plots
          • Simple Scatter Plots
          • Visualizing Errors
          • Density and Contour Plots
          • Histograms, Binnings, and Density
          • Customizing Plot Legends
          • Customizing Colorbars
          • Multiple Subplots
          • Text and Annotation
          • Customizing Ticks
          • Customizing Matplotlib: Configurations and Stylesheets
          • Three-Dimensional Plotting in Matplotlib
          • Geographic Data with Basemap
          • Visualization with Seaborn
          • Further Resources
          • matplotlib-applied
          • matplotlib
        • Numpy Reference
          • Introduction to NumPy
          • Understanding Data Types in Python
          • The Basics of NumPy Arrays
          • Computation on NumPy Arrays: Universal Functions
          • Aggregations: Min, Max, and Everything In Between
          • Computation on Arrays: Broadcasting
          • Comparisons, Masks, and Boolean Logic
          • Fancy Indexing
          • Sorting Arrays
          • Structured Data: NumPy’s Structured Arrays
          • NumPy
        • Pandas Reference
          • Data Manipulation with Pandas
          • Introducing Pandas Objects
          • Data Indexing and Selection
          • Operating on Data in Pandas
          • Handling Missing Data
          • Hierarchical Indexing
          • Combining Datasets: Concat and Append
          • Combining Datasets: Merge and Join
          • Aggregation and Grouping
          • Pivot Tables
          • Vectorized String Operations
          • Working with Time Series
          • High-Performance Pandas: eval() and query()
          • Further Resources
          • Pandas
      • Machine Learning Examples
        • Kaggle Notebooks
          • Kaggle Machine Learning Competition: Predicting Titanic Survivors
        • Scikit Learn Reference
          • Density Estimation: Gaussian Mixture Models
          • Introduction to scikit-learn
          • Machine Learning Models Cheat Sheet
          • The Estimator API
          • The Iris Dataset
          • K-Nearest Neighbors Classifier
          • K-Means Clustering with scikit-learn
          • K-Means on the Iris Dataset
          • The K-Means Algorithm: Expectation Maximization
          • Linear Regression with scikit-learn
          • Linear Regression
          • Principal Component Analysis (PCA) with scikit-learn
          • PCA on the Iris Dataset
          • Dimensionality Reduction: Principal Component Analysis in-depth
          • Random Forests with scikit-learn
          • Decision Trees: The Building Block
          • Creating a Decision Tree
          • Decision Trees and over-fitting
          • Ensembles of Estimators: Random Forests
          • Random Forest Limitations
          • Support Vector Machines (SVM) with scikit-learn
          • Linear SVM Classifier
          • Support Vector Machine with Kernels Classifier
          • Validation and Model Selection
          • Fig Code
            • scikit-learn
      • Deep Learning Examples
        • Tensorflow Keras
          • Deep Dream
            • Deep Dreams (with Caffe)
          • Keras Tutorial
            • Outline (Draft)
            • Yam Peleg, Valerio Maggio
            • Goal of this Tutorial
            • (Tentative) Schedule
            • Requirements
            • How to set up your environment
            • Recreate the Conda Environment
            • Test if everything is up&running
            • Consulting Material
            • Introduction to Deep Learning
            • Artificial Neural Networks (ANN)
            • Building Neural Nets from scratch
            • Addendum
            • Theano
            • Symbolic variables
            • Evaluating expressions
            • Other tensor types
            • Automatic differention
            • Shared Variables
            • Updates
            • About the data
            • Keras
            • β€œData Sciencing” this example a little bit more
            • A simple implementation of ANN for MNIST
            • Convolutional Neural Network
            • The Problem Space
            • Convolutional Layer
            • Going Deeper Through the Network
            • CNN in Keras
            • ConvNet HandsOn with Keras
            • Basic data analysis on the dataset
            • Convolution Nets for MNIST
            • A simple CNN
            • Adding more Dense Layers
            • Adding Dropout
            • Adding more Convolution Layers
            • Exercise
            • Batch Normalisation
            • Practical Deep Learning
            • VGG16
            • Fine Tuning of a Pre-Trained Model
            • Hands On:
            • Unsupervised learning
            • Natural Language Processing using Artificial Neural Networks
            • Word Embeddings
            • Convolutional Neural Networks for Sentence Classification
            • Another Example
            • Recurrent Neural networks
            • Sentence Generation using RNN(LSTM)
            • RNN using LSTM
            • Using TFIDF Vectorizer as an input instead of one hot encoder
            • Sentence Generation using LSTM
            • Conclusions
            • Trained image classification models for Keras
          • Tensor Flow Examples
            • Download and Setup
            • Notebooks
              • 1 Intro
                • Basic Operations in TensorFlow
              • 2 Basic Classifiers
                • Linear Regression in TensorFlow
                • Logistic Regression in TensorFlow
                • Nearest Neighbor in TensorFlow
              • 3 Neural Networks
                • AlexNet in TensorFlow
                • Convolutional Neural Network (CNN) in TensorFlow
                • Multilayer Perceptron (MLP) in TensorFlow
                • Recurrent Neural Network (LSTM) in TensorFlow
              • 4 Multi Gpu
                • Basic Multi-GPU Computation in TensorFlow
              • 5 Ui
                • Graph Visualization with TensorBoard
                • Run the command line
                • Open http://localhost:6006/ into your web browser
                • Loss Visualization with TensorBoard
                • Run the command line
                • Open http://localhost:6006/ into your web browser
          • Exercises
            • Deep Learning with TensorFlow
            • Deep Learning with TensorFlow
            • Deep Learning with TensorFlow
            • Deep Learning with TensorFlow
            • Deep Learning with TensorFlow
            • Deep Learning with TensorFlow
          • Theano Tutorial
            • Intro Theano
              • Introduction to Theano
              • Graph definition and Syntax
              • Graph Transformations
              • Advanced Topics
              • Logistic Regression in Theano
            • Rnn Tutorial
              • Introduction
              • Recurrent Neural Networks in Theano
              • Generating sequences
            • Scan Tutorial
              • Introduction to Scan in Theano
            • Theano Mlp
              • Multilayer Perceptron in Theano
      • Reference Notebooks
        • Aws
          • Amazon Web Services (AWS)
        • Spark
          • HDFS
          • Spark
    • Matplotlib
      • Matplotlib interactive examples
      • Plot Types Jupyter
        • 3D
          • 3D Bar Charts
          • 3D Fill Between
          • 3D Line Plots
          • 3D Quiver (Vector Field) Plots
          • 3D Scatter Plots
          • 3D Stem Plots
          • 3D Surface Plots
          • Triangulated Surface Plots
          • Voxel Plots
          • 3D Wireframe Plots
        • Arrays
          • Wind Barb Plots
          • Contour Line Plots
          • Filled Contour Plots
          • Image Display
          • Pseudocolor Mesh Plots
          • 2D Vector Field Plots
          • Streamline Plots
        • Basic
          • Bar Charts
          • Fill Between Curves
          • Line and Marker Plots
          • Scatter Plots
          • Stacked Area Charts
          • Step Function Plots
          • Stem Plots
        • Stats
          • Box and Whisker Plots
          • Empirical Cumulative Distribution Functions
          • Error Bar Plots
          • Event Plots
          • Hexagonal Binning
          • 2D Histograms
          • Histograms
          • Pie Charts
          • Violin Plots
        • Unstructured
          • Contour Lines on Unstructured Grids
          • Filled Contours on Unstructured Grids
          • Pseudocolor Plots on Unstructured Grids
          • Triangulation Visualization
    • Scikit-Learn Examples
      • Applications
        • Plot Cyclical Feature Engineering
        • Plot Digits Denoising
        • Plot Face Recognition
        • Plot Model Complexity Influence
        • Plot Out Of Core Classification
        • Plot Outlier Detection Wine
        • Plot Prediction Latency
        • Plot Species Distribution Modeling
        • Plot Stock Market
        • Plot Time Series Lagged Features
        • Plot Tomography L1 Reconstruction
        • Plot Topics Extraction With Nmf Lda
        • Wikipedia Principal Eigenvector
      • Bicluster
        • Plot Bicluster Newsgroups
        • Plot Spectral Biclustering
        • Plot Spectral Coclustering
      • Calibration
        • Plot Calibration
        • Plot Calibration Curve
        • Plot Calibration Multiclass
        • Plot Compare Calibration
      • Classification
        • Plot Classification Probability
        • Plot Classifier Comparison
        • Plot Digits Classification
        • Plot Lda
        • Plot Lda Qda
      • Cluster
        • Plot Adjusted For Chance Measures
        • Plot Affinity Propagation
        • Plot Agglomerative Clustering Metrics
        • Plot Hierarchical Clustering Dendrogram
        • Plot Birch Vs Minibatchkmeans
        • Plot Bisect Kmeans
        • Plot Cluster Comparison
        • Plot Coin Segmentation
        • Plot Coin Ward Segmentation
        • Plot Dbscan
        • Plot Dict Face Patches
        • Plot Digits Agglomeration
        • Plot Digits Linkage
        • Plot Face Compress
        • Plot Feature Agglomeration Vs Univariate Selection
        • Demo of HDBSCAN Clustering Algorithm
        • Plot Inductive Clustering
        • Plot Kmeans Assumptions
        • Plot Kmeans Digits
        • Plot Kmeans Plusplus
        • Plot Kmeans Silhouette Analysis
        • Plot Kmeans Stability Low Dim Dense
        • Plot Linkage Comparison
        • Plot Mean Shift
        • Plot Mini Batch Kmeans
        • Plot Optics
        • Plot Segmentation Toy
        • Plot Ward Structured Vs Unstructured
      • Compose
        • Plot Column Transformer
        • Plot Column Transformer Mixed Types
        • Plot Compare Reduction
        • Plot Digits Pipe
        • Plot Feature Union
        • Plot Transformed Target
      • Covariance
        • Plot Covariance Estimation
        • Plot Lw Vs Oas
        • Robust Covariance Estimation and Mahalanobis Distances
        • Robust vs Empirical Covariance Estimate
        • Plot Sparse Cov
      • Cross Decomposition
        • Plot Compare Cross Decomposition
        • Plot Pcr Vs Pls
      • Datasets
        • Plot Random Multilabel Dataset
      • Decomposition
        • Plot Faces Decomposition
        • Plot Ica Blind Source Separation
        • Plot Ica Vs Pca
        • Plot Image Denoising
        • Plot Incremental Pca
        • Plot Kernel Pca
        • Plot Pca Iris
        • Plot Pca Vs Fa Model Selection
        • Plot Pca Vs Lda
        • Plot Sparse Coding
        • Plot Varimax Fa
      • Developing Estimators
        • Sklearn Is Fitted
      • Ensemble
        • Plot Adaboost Multiclass
        • Plot Adaboost Regression
        • Plot Adaboost Twoclass
        • Plot Bias Variance
        • Plot Ensemble Oob
        • Plot Feature Transformation
        • Plot Forest Hist Grad Boosting Comparison
        • Plot Forest Importances
        • Plot Forest Iris
        • Plot Gradient Boosting Categorical
        • Plot Gradient Boosting Early Stopping
        • Plot Gradient Boosting Oob
        • Plot Gradient Boosting Quantile
        • Plot Gradient Boosting Regression
        • Plot Gradient Boosting Regularization
        • Plot Hgbt Regression
        • Plot Isolation Forest
        • Plot Monotonic Constraints
        • Plot Random Forest Embedding
        • Plot Random Forest Regression Multioutput
        • Plot Stack Predictors
        • Plot Voting Decision Regions
        • Plot Voting Regressor
      • Feature Selection
        • Plot F Test Vs Mi
        • Plot Feature Selection
        • Plot Feature Selection Pipeline
        • Plot Rfe Digits
        • Plot Rfe With Cross Validation
        • Plot Select From Model Diabetes
      • Frozen
        • Plot Frozen Examples
      • Gaussian Process
        • Plot Compare Gpr Krr
        • Plot Gpc
        • Plot Gpc Iris
        • Plot Gpc Isoprobability
        • Plot Gpc Xor
        • Plot Gpr Co2
        • Plot Gpr Noisy
        • Plot Gpr Noisy Targets
        • Plot Gpr On Structured Data
        • Plot Gpr Prior Posterior
      • Impute
        • Plot Iterative Imputer Variants Comparison
        • Plot Missing Values
      • Inspection
        • Plot Causal Interpretation
        • Plot Linear Model Coefficient Interpretation
        • Plot Partial Dependence
        • Plot Permutation Importance
        • Plot Permutation Importance Multicollinear
      • Kernel Approximation
        • Plot Scalable Poly Kernels
      • Linear Model
        • Plot Ard
        • Plot Bayesian Ridge Curvefit
        • Plot Elastic Net Precomputed Gram Matrix With Weighted Samples
        • Plot Huber Vs Ridge
        • Plot Lasso And Elasticnet
        • Plot Lasso Dense Vs Sparse Data
        • Plot Lasso Lars Ic
        • Plot Lasso Lasso Lars Elasticnet Path
        • Plot Lasso Model Selection
        • Plot Logistic L1 L2 Sparsity
        • Plot Logistic Multinomial
        • Plot Logistic Path
        • Plot Multi Task Lasso Support
        • Plot Nnls
        • Plot Ols Ridge
        • Plot Omp
        • Poisson Regression and Non-Normal Loss
        • Plot Polynomial Interpolation
        • Plot Quantile Regression
        • Plot Ransac
        • Plot Ridge Coeffs
        • Plot Ridge Path
        • Plot Robust Fit
        • Plot Sgd Early Stopping
        • Plot Sgd Iris
        • Plot Sgd Loss Functions
        • Plot Sgd Penalties
        • Plot Sgd Separating Hyperplane
        • Plot Sgd Weighted Samples
        • Plot Sgdocsvm Vs Ocsvm
        • Plot Sparse Logistic Regression 20Newsgroups
        • Plot Sparse Logistic Regression Mnist
        • Plot Theilsen
        • Tweedie Regression on Insurance Claims
      • Manifold
        • Plot Compare Methods
        • Plot Lle Digits
        • Plot Manifold Sphere
        • Plot Mds
        • Plot Swissroll
        • Plot T Sne Perplexity
      • Miscellaneous
        • Plot Anomaly Comparison
        • Plot Display Object Visualization
        • Plot Estimator Representation
        • Plot Isotonic Regression
        • The Johnson-Lindenstrauss Bound for Random Projections
        • Plot Kernel Approximation
        • Plot Kernel Ridge Regression
        • Plot Metadata Routing
        • Plot Multilabel
        • Plot Multioutput Face Completion
        • Plot Outlier Detection Bench
        • Plot Partial Dependence Visualization Api
        • Plot Pipeline Display
        • Plot Roc Curve Visualization Api
        • Plot Set Output
      • Mixture
        • Plot Concentration Prior
        • Plot Gmm
        • Plot Gmm Covariances
        • Plot Gmm Init
        • Plot Gmm Pdf
        • Plot Gmm Selection
        • Plot Gmm Sin
      • Model Selection
        • Plot Confusion Matrix
        • Plot Cost Sensitive Learning
        • Plot Cv Indices
        • Plot Cv Predict
        • Plot Det
        • Plot Grid Search Digits
        • Plot Grid Search Refit Callable
        • Plot Grid Search Stats
        • Plot Grid Search Text Feature Extraction
        • Plot Learning Curve
        • Plot Likelihood Ratios
        • Plot Multi Metric Evaluation
        • Plot Nested Cross Validation Iris
        • Plot Permutation Tests For Classification
        • Plot Precision Recall
        • Plot Randomized Search
        • Plot Roc
        • Plot Roc Crossval
        • Plot Successive Halving Heatmap
        • Plot Successive Halving Iterations
        • Plot Train Error Vs Test Error
        • Plot Tuned Decision Threshold
        • Plot Underfitting Overfitting
      • Multiclass
        • Plot Multiclass Overview
      • Multioutput
        • Plot Classifier Chain Yeast
      • Neighbors
        • Approximate Nearest Neighbors
        • Plot Caching Nearest Neighbors
        • Plot Classification
        • Plot Digits Kde Sampling
        • Plot Kde 1D
        • Plot Lof Novelty Detection
        • Plot Lof Outlier Detection
        • Plot Nca Classification
        • Plot Nca Dim Reduction
        • Plot Nca Illustration
        • Plot Nearest Centroid
        • Plot Regression
        • Plot Species Kde
      • Neural Networks
        • Plot Mlp Alpha
        • Plot Mlp Training Curves
        • Plot Mnist Filters
        • Plot Rbm Logistic Classification
      • Preprocessing
        • Plot All Scaling
        • Plot Discretization
        • Plot Discretization Classification
        • Plot Discretization Strategies
        • Plot Map Data To Normal
        • Plot Scaling Importance
        • Plot Target Encoder
        • Plot Target Encoder Cross Val
      • Release Highlights
        • Plot Release Highlights 0 22 0
        • Release Highlights for scikit-learn 0.23
        • Release Highlights for scikit-learn 0.24
        • Release Highlights for scikit-learn 1.0
        • Release Highlights for scikit-learn 1.1
        • Release Highlights for scikit-learn 1.2
        • Release Highlights for scikit-learn 1.3
        • Release Highlights for scikit-learn 1.4
        • Release Highlights for scikit-learn 1.5
        • Release Highlights for scikit-learn 1.6
        • Release Highlights for scikit-learn 1.7
        • Release Highlights for scikit-learn 1.8
      • Semi Supervised
        • Plot Label Propagation Digits
        • Plot Label Propagation Digits Active Learning
        • Plot Label Propagation Structure
        • Plot Self Training Varying Threshold
        • Plot Semi Supervised Newsgroups
        • Plot Semi Supervised Versus Svm Iris
      • Svm
        • Plot Custom Kernel
        • Plot Iris Svc
        • Plot Linearsvc Support Vectors
        • Plot Oneclass
        • Plot Rbf Parameters
        • Plot Separating Hyperplane
        • Plot Separating Hyperplane Unbalanced
        • Plot Svm Anova
        • Plot Svm Kernels
        • Plot Svm Margin
        • Plot Svm Regression
        • Scaling the Regularization Parameter for SVCs
        • Plot Svm Tie Breaking
        • Plot Weighted Samples
      • Text
        • Plot Document Classification 20Newsgroups
        • Plot Document Clustering
        • Plot Hashing Vs Dict Vectorizer
      • Tree
        • Plot Cost Complexity Pruning
        • Plot Iris Dtc
        • Plot Tree Regression
        • Plot Unveil Tree Structure
  • Mathematics for ML
    • 3Blue1Brown Visual Mathematics
      • Calculus (3Blue1Brown)
        • Chapter 1: The Essence of Calculus
        • Chapter 2: The Paradox of the Derivative
        • Chapter 3: Derivative Formulas through Geometry
        • Chapter 4: Visualizing the Chain Rule and Product Rule
        • Chapter 5: Derivatives of Exponential
        • Chapter 6: Implicit Differentiation
        • Chapter 7: Limits and L’Hopital’s Rule
        • Chapter 8: Integration and the Fundamental Theorem
        • Chapter 9: What Does Area Have to Do with Slope?
        • Chapter 10: Higher Order Derivatives
        • Chapter 11: Taylor Series
        • Chapter 12: What Makes \(e^x\) So Special?
      • Chapter 1: Differential Equations - Introduction
      • Chapter 2: The Heat Equation
      • Chapter 3: Solving the Heat Equation
      • Chapter 4: Fourier Series and Complex Exponential
      • Chapter 5: Laplace Transforms
      • Chapter 6: Understanding the Laplace Transform
      • Chapter 7: Resonance and Forced Oscillations
      • Chapter 8: Matrix Exponents
      • Chapter 1: Abstract Vector Spaces
      • Chapter 2: Linear combinations, span, and basis vectors
      • Chapter 3: Linear transformations and matrices
      • Chapter 4: Matrix multiplication as composition
      • Chapter 5: Three-dimensional linear transformations
      • Chapter 6: The determinant
      • Chapter 7: Inverse matrices, column space and null space
      • Chapter 8: Nonsquare matrices as transformations between dimensions
      • Chapter 9: Dot products and duality
      • Chapter 10: Three-Dimensional Linear Transformations
      • Chapter 12: Cramer’s rule, explained geometrically
      • Chapter 13: Change of basis
      • Chapter 16: Abstract Vector Spaces
      • Chapter 1: But What is a Neural Network?
      • Chapter 2: Gradient Descent
      • Chapter 3: What is Backpropagation?
      • Chapter 4: Backpropagation Calculus
      • Chapter 5: GPT and Large Language Models
      • Chapter 6: Attention in Transformers
      • Chapter 7: Attention Mechanism Deep Dive
      • Chapter 8: How GPT Stores Facts
      • Chapter 9: Diffusion Models
    • Advanced Mathematics for Machine Learning
      • 1. The Learning Problem
      • 2. Generalization: The Core Challenge
      • Advanced Statistical Learning Theory
      • 3. The Bias-Variance Tradeoff
      • 4. Sample Complexity
      • 5. PAC Learning Framework
      • 6. Summary
      • 7. Exercises
      • References
      • 1. The Sampling Problem
      • 2. Markov Chain Basics
      • 3. Metropolis-Hastings Algorithm
      • 4. Gibbs Sampling
      • 5. Convergence and Diagnostics
      • 6. Summary
      • 1. Rademacher Complexity Definition
      • 2. Generalization Bound via Rademacher Complexity
      • 3. Rademacher Complexity for Different Hypothesis Classes
      • 4. Connection to Generalization
      • 5. Comparison: Rademacher Complexity vs VC Dimension
      • 6. Advanced: Gaussian Complexity
      • Summary
      • 1. PAC-Bayes Framework
      • 2. Proof Sketch
      • 3. Application: Gaussian Posterior over Weights
      • 4. Training with PAC-Bayes Bound
      • 5. Advantages of PAC-Bayes
      • Summary
      • 1. Motivation: Neural Networks as Kernel Methods
      • 2. Linearized Training Dynamics
      • 3. Computing NTK for Simple Networks
      • 4. Empirical NTK via Jacobian
      • 5. Training Dynamics: Kernel Gradient Descent
      • 6. NTK Evolution During Training
      • 7. Theoretical Implications
      • Summary
      • 1. Motivation: The Intractability Problem
      • 2. Evidence Lower Bound (ELBO)
      • 3. Mean-Field Approximation
      • 4. Example: Bayesian Gaussian Mixture (Simple Case)
      • 5. Black Box Variational Inference (BBVI)
      • Summary
      • 1. Motivation: Infinite Flexibility
      • 2. Dirichlet Distribution Review
      • 3. Dirichlet Process (DP)
      • 4. Chinese Restaurant Process (CRP)
      • 5. DP Mixture Model for Clustering
      • Summary
      • 1. The EM Framework
      • 2. Mathematical Derivation
      • 3. Convergence Proof
      • 4. Gaussian Mixture Model (GMM)
      • 5. EM for GMM
      • 6. Visualizing the E and M Steps
      • 7. Summary
      • 1. Function Properties
      • 2. Gradient Descent Algorithm
      • 3. Convergence for L-Smooth Functions
      • 4. Strong Convexity: Linear Convergence
      • 5. Summary of Convergence Rates
      • Summary
      • 1. State Space Model
      • 2. Kalman Filter Algorithm
      • 3. Kalman Smoother (RTS)
      • Summary
      • 1. Motivation
      • 2. Sklar’s Theorem
      • 3. Common Copula Families
      • 4. Application: Constructing Joint Distributions
      • Summary
      • 1. Motivation: Diversity
      • 2. DPP Definition
      • 3. L-ensemble DPP
      • 4. Sampling Algorithm
      • 5. Application: Diverse Subset Selection
      • Summary
      • 1. The Johnson-Lindenstrauss Lemma
      • 2. Random Projection
      • 3. Implementation
      • 4. Sparse Random Projections
      • 5. Application: Nearest Neighbors
      • Summary
      • 1. Primal Problem
      • 2. Dual Problem
      • 3. KKT Conditions
      • 4. Application: SVM Dual
      • Summary
      • 1. Problem: Solve \(Ax = b\)
      • 2. Conjugate Directions
      • 3. Convergence Analysis
      • 4. Preconditioned CG
      • Summary
      • 1. Matrix Bernstein Inequality
      • 2. Matrix Chernoff Bound
      • 3. Random Matrix Theory
      • 4. Marchenko-Pastur Law
      • 5. Application: Compressed Sensing
      • Summary
    • Stanford CS229 Machine Learning
      • CS229: Machine Learning Course (Stanford University)
        • Lecture 1: Introduction & Linear Regression
        • Lecture 2: Linear Regression and Gradient Descent
        • Lecture 3: Locally Weighted Regression (LWR)
        • Lecture 3 & 4: Logistic Regression & Classification
        • Lecture 5 & 6: Generative Learning Algorithms
        • Lecture 6-7: Support Vector Machines
        • Lecture 8: Bias, Variance, and Regularization
        • Lecture 9: Learning Theory
        • Lecture 10: Decision Trees and Ensemble Methods
        • Lecture 11: Introduction to Neural Networks
        • Lecture 12: Backpropagation & Deep Learning
        • Lecture 13: Advice for Applying Machine Learning
        • Lecture 14: Expectation Maximization & Clustering
        • Lectures 15-17: EM, Factor Analysis, PCA & ICA
        • Lectures 18-20: Reinforcement Learning & MDPs
        • CS229 Practice Problems
        • Anomaly Detection
        • Recommender Systems
    • Practice Labs: Deep Learning Interviews (Kashani)
      • Lab 01: Logistic Regression from Scratch
      • Lab 02: Information Theory & Entropy
      • Lab 03: Calculus, Gradients & Backpropagation
      • Lab 04: Probability & Bayesian Deep Learning
      • Lab 05: Neural Network Ensembles
      • Lab 06: CNN Feature Extraction & Deep Learning Fundamentals
    • Foundational Mathematics
      • Setup: Install Required Packages
      • 1. NumPy: Numerical Python
      • 2. Matplotlib: Visualization Basics
      • 3. Seaborn: Beautiful Statistical Plots
      • 4. SciPy: Scientific Computing
      • 5. scikit-learn: Machine Learning Library
      • Quick Reference Cheat Sheet
      • Practice Exercise
      • Bonus: Next Steps Examples
      • πŸŽ“ Your Learning Journey
      • Linear Algebra Fundamentals
      • Calculus & Derivatives
      • Probability & Statistics
      • Gradient Descent
      • Information Theory
      • Statistical Inference
      • Neural Network Mathematics
      • Advanced Linear Algebra
      • The Architecture of Mathematics: Analytical vs Numerical Approaches
      • AI Foundations: Symbolic vs Non-Symbolic AI & Control Theory
      • Markov Models & Hidden Markov Models (HMMs)
      • Optimization from Scratch: Gradient Descent & Adam
    • Introduction to Statistical Learning with Python (ISLP)
      • Statistics & Probability for AI/ML
      • Chapter 1: Introduction to Statistical Learning
      • Chapter 2: Statistical Learning
      • Chapter 3: Linear Regression
      • Chapter 4: Classification
      • Chapter 5: Resampling Methods
      • Chapter 6: Linear Model Selection and Regularization
      • Chapter 7: Moving Beyond Linearity
      • Chapter 8: Tree-Based Methods
      • Chapter 9: Support Vector Machines
      • Chapter 10: Deep Learning
      • Chapter 11: Survival Analysis and Censored Data
      • Chapter 12: Unsupervised Learning
      • Chapter 13: Multiple Testing
      • ISLP Practice Exercises
      • Part 1: Probability Fundamentals
      • Part 2: Descriptive Statistics
      • Part 3: Probability Distributions
      • Part 4: Statistical Inference
      • Part 5: Hypothesis Testing
      • Part 6: Correlation & Causation
      • Part 7: ML Applications
      • 🎯 Summary & Key Takeaways
      • πŸ“š Additional Resources
    • ML Problem Solving
    • Machine Learning: A Probabilistic Perspective
      • 1. Basic Probability Rules
      • 2. Bayes’ Rule
      • 3. Common Probability Distributions
      • 4. Monte Carlo Sampling
      • 5. Information Theory Basics
      • Summary
      • Exercises
      • 1. Naive Bayes Classifier
      • 2. Gaussian Discriminant Analysis (GDA)
      • 3. Generative vs Discriminative Models
      • 4. Spam Detection with Naive Bayes
      • 5. Effect of the Naive Assumption
      • Summary
      • Key Takeaways
      • Exercises
      • 1. Multivariate Gaussian Distribution
      • 2. Maximum Likelihood Estimation
      • 3. Gaussian Mixture Models (GMM)
      • 4. Expectation-Maximization (EM) Algorithm
      • 5. Missing Data Imputation with EM
      • Summary
      • Key Takeaways
      • Exercises
      • 1. Bayesian Inference Fundamentals
      • 2. Conjugate Priors
      • 3. Posterior Predictive Distribution
      • 4. Bayesian Decision Theory
      • 5. Empirical Bayes
      • Summary
      • Key Takeaways
      • Exercises
      • 1. Linear Regression
      • 2. Ridge Regression (L2 Regularization)
      • 3. Bayesian Linear Regression
      • 4. Logistic Regression for Binary Classification
      • 5. Multinomial Logistic Regression (Softmax)
      • 6. Model Selection and Regularization Comparison
      • Summary
      • Key Takeaways
      • Exercises
      • 1. Lasso Regression (L1 Regularization)
      • 2. Regularization Path: Ridge vs Lasso
      • 3. Elastic Net: Combining L1 and L2
      • 4. Feature Selection with Lasso
      • 5. Sparse Logistic Regression
      • Summary
      • Key Takeaways
      • Exercises
      • 1. Kernel Functions and the Kernel Trick
      • 2. Kernel Ridge Regression
      • 3. Support Vector Machines (SVM)
      • 4. Gaussian Processes for Regression
      • 5. GP Hyperparameter Optimization
      • Summary
      • Key Takeaways
      • When to Use
      • Exercises
      • 1. Bayesian Network Basics
      • 2. Conditional Independence and d-Separation
      • 3. Naive Bayes Classifier
      • 4. Markov Chains
      • Summary
      • Key Takeaways
      • Exercises
      • 1. Hidden Markov Model Definition
      • 2. Forward Algorithm (Filtering)
      • 3. Backward Algorithm
      • 4. Viterbi Algorithm (Decoding)
      • 5. Baum-Welch Algorithm (Learning)
      • 6. Application: Part-of-Speech Tagging
      • Summary
      • Key Takeaways
      • Computational Complexity
      • Limitations
      • Extensions
      • Exercises
      • 1. Monte Carlo Basics
      • 2. Metropolis-Hastings Algorithm
      • MCMC Theory: Advanced Mathematical Foundations
      • 3. Gibbs Sampling
      • 4. Convergence Diagnostics
      • 5. Bayesian Linear Regression with MCMC
      • Summary
      • Key Takeaways
      • Comparison
      • Modern MCMC
      • Exercises
      • 1. Gaussian Mixture Models (GMM)
      • 2. K-Means as Hard EM
      • 3. EM Algorithm Theory
      • 4. EM for GMM
      • 5. Model Selection (BIC, AIC)
      • 6. Mixture of Bernoullis
      • Summary
      • Key Takeaways
      • Extensions
      • Exercises
      • 1. Principal Component Analysis (PCA)
      • 2. Probabilistic PCA
      • 3. Factor Analysis (FA)
      • 4. Independent Component Analysis (ICA)
      • Summary
      • Comparison Table
      • Key Insights
      • Practical Tips
      • Exercises
      • 1. K-Means Clustering (Detailed)
      • 2. Hierarchical Clustering
      • 3. Spectral Clustering
      • 4. DBSCAN (Density-Based)
      • 5. Affinity Propagation
      • 6. Cluster Evaluation Metrics
      • Summary
      • Algorithm Comparison
      • Choosing an Algorithm
      • Practical Tips
      • Exercises
    • Mathematics for Machine Learning (MML)
      • Course
        • 2.1 Systems of Linear Equations
        • 2.2 Matrices
        • 2.3 Solving Systems of Linear Equations
        • 2.4 Vector Spaces
        • 2.5 Linear Independence
        • 2.6 Basis and Rank
        • 2.7 Linear Mappings
        • 2.8 Affine Spaces
        • Summary
        • 3.1 Norms
        • 3.2 Inner Products
        • 3.3 Lengths and Distances
        • 3.4 Angles and Orthogonality
        • 3.5 Orthonormal Basis
        • 3.8 Orthogonal Projections
        • 3.9 Rotations
        • Summary
        • 4.1 Determinant and Trace
        • 4.2 Eigenvalues and Eigenvectors
        • 4.3 Cholesky Decomposition
        • 4.4 Eigendecomposition and Diagonalization
        • 4.5 Singular Value Decomposition (SVD)
        • 4.6 Matrix Approximation
        • Summary
        • 5.1 Differentiation of Univariate Functions
        • 5.2 Partial Differentiation and Gradients
        • 5.3 Gradients of Vector-Valued Functions
        • 5.6 Backpropagation and Automatic Differentiation
        • 5.7 Higher-Order Derivatives
        • 5.8 Linearization and Multivariate Taylor Series
        • Summary
        • 6.1 Probability Space
        • 6.2 Discrete and Continuous Probabilities
        • 6.3 Sum Rule, Product Rule, and Bayes’ Theorem
        • 6.4 Summary Statistics and Independence
        • 6.5 Gaussian Distribution
        • Summary
        • 7.1 Gradient Descent
        • Advanced Optimization Theory for Deep Learning
        • 7.2 Constrained Optimization and Lagrange Multipliers
        • 7.3 Convex Optimization
        • Summary
        • 9.1 Problem Formulation
        • 9.2 Maximum Likelihood Estimation (MLE)
        • 9.3 Regularization: Ridge and Lasso
        • 9.4 Bayesian Linear Regression
        • 9.5 Model Selection and Evaluation
        • Summary
        • 10.1 Problem Setting
        • 10.2 Maximum Variance Perspective
        • 10.3 Projection Perspective
        • 10.4 PCA Algorithm
        • 10.5 PCA on Real Data: Handwritten Digits
        • 10.6 PCA for Data Preprocessing
        • Summary
        • 11.1 Gaussian Mixture Model
        • 11.2 Expectation-Maximization (EM) Algorithm
        • 11.3 Soft vs Hard Clustering
        • 11.4 Model Selection: Choosing K
        • 11.5 GMM Applications and Limitations
        • Summary
        • 12.1 Separating Hyperplanes
        • 12.2 Maximum Margin
        • 12.3 Primal Optimization Problem
        • 12.4 Dual Problem and Lagrange Multipliers
        • 12.5 Kernel Trick
        • 12.6 Soft Margin SVM
        • Summary
      • Exercises
        • Chapter 2: Linear Algebra
        • Chapter 3: Analytic Geometry
        • Chapter 4: Matrix Decompositions
        • Chapter 5: Vector Calculus
        • Chapter 6: Probability and Distributions
        • Chapter 7: Continuous Optimization
        • 🎯 Bonus Challenge: Integration Exercise πŸ”΄πŸ”΄
        • Chapter 9: Linear Regression
        • Chapter 10: PCA (Principal Component Analysis)
        • Chapter 11: Gaussian Mixture Models
        • Chapter 12: Support Vector Machines
        • 🎯 Bonus Challenge: Complete ML Pipeline πŸ”΄πŸ”΄
        • Chapter 2: Linear Algebra Solutions
        • Chapter 3: Analytic Geometry Solutions
        • Chapter 4: Matrix Decompositions Solutions
        • Chapter 9: Linear Regression Solutions
        • Chapter 10: PCA Solutions
      • Practice Labs: Math for ML (MML Book)
        • Lab 01: Linear Algebra
        • Lab 02 – Analytic Geometry
        • Lab 03: Matrix Decompositions
        • Lab 04: Vector Calculus
        • Lab 05: Probability and Distributions
        • Lab 6: Continuous Optimization
        • Lab 07: Linear Regression
        • Lab 08: Dimensionality Reduction with Principal Component Analysis
        • Lab 09: Density Estimation with Gaussian Mixture Models
        • Lab 10: Classification with Support Vector Machines
    • Practice Labs: SLP (Jurafsky/Martin)
      • Lab 01: Words, Tokens & Text Processing
      • Lab 02: N-gram Language Models
      • Lab 03: Word Embeddings
      • Lab 04: Neural Networks from Scratch
      • Lab 05: Transformers & Attention
      • Lab 06: Large Language Models

Core AI

  • Tokenization
    • Production Tokenization Guide
    • Tokenization Comparison Guide
    • Integration Guide: Using Tokenizers with Popular Frameworks
    • Understanding Tokens
    • HuggingFace Tokenizers - Complete Learning Module
    • HuggingFace Tokenizers Library - Complete Learning Guide
    • Understanding Tokens: The Foundation of Language Models
    • Tokenization β€” Start Here
    • 01 Tokenizers Quickstart
    • HuggingFace Tokenizers - Quick Start Examples
    • 02 Tokenizers Training
    • HuggingFace Tokenizers - Training Examples
    • 03 Advanced Training Methods
    • Advanced Training Methods for Tokenizers
    • Setup
    • Part 1: Normalization
    • Part 2: Pre-tokenization
    • Part 3: Post-processing
    • Part 4: Decoders
    • Part 5: Complete Pipeline Examples
    • Summary
    • Sentencepiece Example
    • Tiktoken Example
    • Basic Tokenization Example
    • Token Exercises
    • Token Exercises - Interactive Practice
    • Token Exploration
    • Token Exploration - Advanced Examples
  • Embeddings
    • Quick Start Guide - Embeddings
    • Embedding Models Comparison Guide
    • Embeddings β€” Start Here
    • Embeddings Intro
    • Embeddings Introduction
    • Huggingface Embeddings
    • Openai Embeddings
    • Paraphrase Mining With Sentence Transformers
    • Semantic Search With Sentence Transformers
    • Semantic Similarity
    • Semantic Similarity Explorer
    • Semantic Textual Similarity (STS)
    • Sentence Transformers Quickstart
    • Sparse Encoders: SPLADE and Learned Sparse Representations
    • Vector Database Demo
    • Vector Database Demo
  • Neural Networks
    • Assignment: Build a Neural Network from Scratch
    • Attention Mechanism: The Breakthrough Innovation
    • Challenges: Neural Networks
    • Neural Networks: From Basics to Transformers
    • Transformer Architecture: Complete Guide
    • Verify Installation
    • 🎯 What You’ll Build
    • πŸ§ͺ Quick Neural Network Demo
    • πŸ“– Reading Material
    • πŸŽ“ Prerequisites Review
    • 🚦 Next Steps
    • πŸ“Š Progress Tracker
    • 🎯 Learning Goals
    • πŸ”— Helpful Resources
    • πŸš€ Let’s Begin!
    • What is a Neuron?
    • 2. Activation Functions
    • 3. Building a Neural Network Layer
    • 4. Building a Complete Neural Network
    • 5. Training a Neural Network
    • 6. Visualizing Decision Boundaries
    • 7. Experimenting with Architecture
    • Summary
    • The Problem
    • 2. The Chain Rule - Foundation of Backpropagation
    • 3. Backpropagation in a Simple Network
    • 4. Training with Backpropagation
    • 5. Multi-Layer Network with Matrix Operations
    • 6. Vanishing and Exploding Gradients
    • Summary
    • Creating Tensors
    • Tensor Operations
    • 2. Automatic Differentiation (Autograd)
    • 3. Building Neural Networks with nn.Module
    • 4. Training a Neural Network – The Complete Loop
    • 5. Modern Optimizers
    • 6. Real Dataset – MNIST Digit Classification
    • 7. Saving and Loading Models
    • Summary
    • Before Attention: The Bottleneck Problem
    • 2. Scaled Dot-Product Attention
    • Attention Mechanism: Mathematical Foundations
    • 3. Self-Attention Example – Understanding Context
    • 4. Multi-Head Attention
    • 5. Masked Attention - For Autoregressive Models
    • 6. Cross-Attention - Connecting Two Sequences
    • 7. Practical Application – Sequence Classification
    • Summary
    • Why Transformers Changed Everything
    • 2. Positional Encoding
    • 3. Feedforward Network
    • 4. Transformer Encoder Layer
    • 5. Complete Transformer Encoder
    • 6. Simple Classification with Transformer
    • 7. Using Pre-trained Transformers
    • 8. Fine-tuning Example
    • 9. Transformer Architecture Diagram
    • Summary
    • πŸŽ“ Congratulations!

Applied AI

  • Vector Databases
    • πŸ“– Learning Path
    • 🎯 Prerequisites
    • πŸ—„οΈ Database Comparison
    • πŸ’‘ Common Use Cases
    • πŸ”— Additional Resources
    • 🚦 Ready to Start?
    • πŸ—ΊοΈ Your Complete Learning Journey
    • 1. Understanding Vectors and Embeddings
    • 2. Similarity Metrics
    • 3. Why Vector Databases?
    • 4. Simple Vector Database Implementation
    • 5. Using Our Simple Vector Database
    • 6. Update and Delete Operations
    • 7. Real-World Example with Sentence Embeddings
    • 8. Performance Comparison: Different Metrics
    • Key Takeaways
    • Next Steps
    • Chroma – Local Vector Database
    • Qdrant – Production Vector Database
    • Weaviate – Enterprise Vector Database
    • Milvus – Large-Scale Vector Database
    • 1. Connection Setup
    • 2. Enable pgvector Extension
    • 3. Create Tables for Embeddings
    • 4. Generate and Store Embeddings
    • 5. Semantic Search
    • 6. Filtered Semantic Search
    • 7. Hybrid Search (Vector + Full-Text)
    • 8. Product Search Example
    • 9. Performance Optimization
    • 10. Monitoring and Statistics
    • 11. Best Practices
    • 12. Integration with AWS Services
    • 13. Cleanup
    • Summary
  • RAG
    • RAG Evaluation Playbook
    • RAG Technique Selection Guide
    • Assignment: Build a Production-Ready RAG System
    • Challenges: RAG Systems
    • RAG: Retrieval-Augmented Generation - START HERE
    • Basic RAG from Scratch
    • Document Processing and Chunking
    • LangChain RAG
    • LlamaIndex RAG
    • Advanced Retrieval
    • Conversational RAG
    • RAG Evaluation
    • HyDE + Reranking for RAG
    • Advanced RAG Techniques (2025-2026 State of the Art)
    • GraphRAG and Visual RAG (Microsoft GraphRAG + ColPali)
    • Section A: Microsoft GraphRAG
    • Section B: ColPali Visual Document RAG
    • Corrective RAG (CRAG-Style)
    • Parent-Child Retrieval for RAG
    • RAPTOR-Style Hierarchical Retrieval
  • MLOps
    • MLOps: Machine Learning in Production
    • Experiment Tracking with MLflow
    • Building ML APIs with FastAPI
    • Model Deployment Strategies
    • Containerizing ML Applications with Docker
    • Monitoring ML Models in Production
    • CI/CD for Machine Learning
    • Cloud Deployment for ML Models
    • LLM Infrastructure for Production (2025-2026 Essential Stack)
    • LLM Production Optimization
  • Specializations
    • AI Specializations β€” Start Here
    • AI Agents Specialization
      • AI Agents Series - Completion Summary
      • AI Agents Specialization β€” Start Here
      • Function Calling & Tool Use
      • ReAct: Reasoning + Acting Agents
      • LangGraph: Stateful Agent Workflows
      • Multi-Agent Systems
      • Memory & State Management for Agents
      • Deploying Agents to Production
    • Computer Vision Specialization
      • Computer Vision Specialization β€” Start Here
      • Image Classification with Deep Learning
      • Object Detection: YOLO, DETR & Beyond
      • CLIP: Connecting Text and Images
      • Stable Diffusion & Image Generation
      • Multimodal RAG: Text + Images
    • Advanced NLP Specialization
      • Advanced NLP Specialization β€” Start Here
      • Named Entity Recognition (NER)
      • Machine Translation
      • Text Summarization: Extractive & Abstractive
      • Sentiment Analysis at Scale
      • Information Extraction from Documents

Advanced

  • Prompt Engineering
    • Assignment: Build an Advanced Prompt Engineering System
    • Setup
    • Example 1: Basic vs. Improved Prompt
    • Example 2: Few-Shot Learning
    • Example 3: Chain-of-Thought Reasoning
    • Example 4: System Prompts
    • Example 5: Structured Output
    • Key Takeaways
    • Next Steps
    • 1. Zero-Shot Prompting
    • 2. One-Shot Prompting
    • 3. Few-Shot Prompting
    • 4. Dynamic Few-Shot Selection
    • 5. Best Practices
    • Key Takeaways
    • Model Selection (December 2025)
    • Next Steps
    • 1. The Classic Example
    • 2. Zero-Shot CoT
    • 3. Few-Shot CoT
    • 4. Self-Consistency
    • 5. Structured CoT
    • 6. CoT for Code Debugging
    • 7. Least-to-Most Prompting
    • Best Practices
    • Key Takeaways
    • Next Steps
    • Setup
    • 1. Simple ReAct Example
    • 2. ReAct Agent
    • 3. Real Tools – Wikipedia Search
    • 4. More Complex Example
    • 5. Custom Tools
    • 6. Error Handling and Self-Correction
    • Best Practices
    • Key Takeaways
    • Limitations
    • Next Steps
    • Structured LLM Outputs & Programmatic Prompting (2025-2026)
    • Long-Context Strategies: Working with 128K–1M Token Windows
  • LLM Fine-Tuning
    • LLM Fine-tuning β€” Start Here
    • Dataset Preparation for LLM Fine-tuning
    • Supervised Fine-Tuning (SFT) β€” Complete Workflow
    • LoRA Fine-tuning Basics (December 2025)
    • QLoRA β€” Memory-Efficient Fine-Tuning on Consumer GPUs
    • DPO Alignment: Teaching Models to Be Helpful and Harmless
    • Evaluating Fine-Tuned LLMs
    • Deploying Fine-Tuned LLMs to Production
    • GRPO Reasoning Training - Training R1-Style Thinking Models (2025)
    • Unsloth - 2x-5x Faster Fine-Tuning with 80% Less VRAM (2025)
    • Quantization: GPTQ, AWQ, GGUF & bitsandbytes
    • RLHF & Constitutional AI: Alignment Training
  • Multimodal AI
    • Multimodal AI β€” Start Here
    • Audio & Speech
      • Whisper: Speech Recognition & Audio Understanding
      • Text-to-Speech: TTS with OpenAI, Coqui & Edge TTS
    • Image Generation
      • Stable Diffusion: Text-to-Image Generation
      • ControlNet: Precise Control Over Image Generation
    • Vision-Language Models
      • CLIP Basics: Zero-Shot Vision with CLIP
      • Vision-Language Models: GPT-4V, LLaVA & Gemini Vision
      • Multimodal RAG: Retrieval-Augmented Generation with Images
  • Local LLMs
    • AI Toolkit for VS Code
    • Local LLMs β€” Start Here
    • Setup
    • 1. Download and Run a Model
    • 8. CLI Usage (from terminal)
    • Tips & Best Practices
    • Key Takeaways
    • Limitations
    • Next Steps
    • Part 1: Major Open Source Model Families
    • Part 2: Model Family Deep Dives
    • 🎯 Key Takeaways
    • πŸ“ Practice Exercises
    • πŸ”— Resources
    • Local RAG with Ollama
    • Local LLM Servers and APIs
    • Speculative Decoding: 2-3x Faster LLM Inference
  • AI Agents
    • AI Agents - Assignment
    • AI Agents - Challenges
    • AI Agents - Post-Quiz
    • AI Agents - Pre-Quiz
    • AI Agents β€” Start Here
    • Part 1: What is an AI Agent?
    • Part 2: Chatbot vs Agent
    • Part 4: Agent Design Patterns
    • 🎯 Summary
    • Setup
    • Part 1: Function Calling Basics
    • Part 2: Tool Schema Design
    • Part 4: Error Handling
    • Part 5: Advanced Patterns
    • Part 6: Best Practices Summary
    • 🎯 Final Knowledge Check
    • πŸš€ Next Steps
    • Part 1: What is ReAct?
    • 🎯 Final Knowledge Check
    • πŸš€ Next Steps
    • Part 1: Framework Overview (2026 Landscape)
    • Part 2: LangChain Agents
    • Part 3: LangGraph Workflows
    • Part 4: Memory Integration
    • Part 5: Framework Comparison
    • Part 6: Production Patterns
    • 🎯 Knowledge Check
    • πŸš€ Next Steps
    • Part 1: Multi-Agent Basics
    • Part 2: Agent Coordination
    • Part 3: Role-Based Teams
    • Part 4: Communication Patterns
    • Part 5: Conflict Resolution
    • Part 6: A2A Protocol & Parallel Execution
    • Part 7: Production Systems
    • 🎯 Final Knowledge Check
    • πŸš€ Next Steps
    • MCP β€” Model Context Protocol
    • OpenAI Agents SDK + LangGraph 1.0
    • Section A β€” OpenAI Agents SDK
    • Section B β€” LangGraph 1.0
    • Section C β€” Comparison and Production Guidance
    • Notebook 08: Working with Reasoning Models
    • Autonomous AI Agents in 2026
    • Notebook 10: Agent Evaluation

Supplementary

  • Model Evaluation
    • Assignment: Complete Model Evaluation Pipeline
    • Challenges: Model Evaluation & Metrics
    • Post-Quiz: Model Evaluation & Metrics
    • Pre-Quiz: Model Evaluation & Metrics
    • Model Evaluation β€” Start Here
    • Part 1: Confusion Matrix Basics
    • Part 2: Core Metrics
    • Part 3: ROC Curves & AUC
    • Part 4: Handling Imbalanced Data
    • Part 5: Multi-Class Metrics
    • Part 6: Choosing the Right Metric
    • 🎯 Knowledge Check
    • πŸš€ Next Steps
    • Part 1: Core Regression Metrics
    • Part 2: Understanding Residuals
    • Part 3: R-Squared Explained
    • Part 4: Choosing the Right Metric
    • Part 5: Outlier Handling
    • Part 6: Advanced Metrics
    • 🎯 Knowledge Check
    • πŸ“š Summary
    • πŸš€ Next Steps
    • Part 1: Introduction to LLM Evaluation
    • Part 2: BLEU Score
    • Part 3: ROUGE Metrics
    • Part 4: Perplexity
    • Part 5: Semantic Similarity (BERTScore)
    • Part 6: Human Evaluation
    • Part 7: RAG Evaluation
    • 🎯 Knowledge Check
    • πŸ“š Summary
    • πŸš€ Next Steps
    • Part 1: Understanding Bias in AI
    • Part 2: Fairness Metrics
    • Part 3: Detecting Bias
    • Part 4: Mitigation Strategies
    • Part 5: Real-World Case Studies
    • Part 6: Fairness Toolkits
    • 🎯 Knowledge Check
    • πŸ“š Summary
    • πŸš€ Next Steps
    • Part 1: Cross-Validation
    • Part 2: Comparing Multiple Models
    • Part 3: Statistical Significance Testing
    • Part 4: A/B Testing
    • Part 5: Multi-Objective Selection
    • Part 6: Model Selection Framework
    • 🎯 Knowledge Check
    • πŸ“š Summary
    • πŸš€ Next Steps
  • Debugging & Troubleshooting
    • Assignment: Debug & Optimize a Broken ML Pipeline
    • Debugging & Troubleshooting Challenges
    • Post-Quiz: Debugging & Troubleshooting
    • Pre-Quiz: Debugging & Troubleshooting
    • Debugging & Troubleshooting β€” Start Here
    • Part 1: The Debugging Workflow
    • Part 2: Sanity Checks Checklist
    • Part 3: Baseline Models
    • Part 4: Debugging Checklist
    • Part 5: Logging and Instrumentation
    • 🎯 Key Takeaways
    • πŸ“ Practice Exercise
    • πŸš€ Next Steps
    • Missing Data: The Silent Model Killer
    • Duplicate Detection: Preventing Data Leakage and Inflated Metrics
    • Outlier Detection: Distinguishing Signal from Noise
    • Label Noise Detection: When Your Ground Truth Lies
    • Distribution Shift Detection: When the World Changes Under Your Model
    • 🎯 Key Takeaways
    • πŸ“ Practice Exercise
    • CPU Profiling with cProfile: Finding Where Time Disappears
    • Vectorized Optimization: Replacing Loops with NumPy
    • Memory Profiling: Tracking Allocation and Leaks
    • Identifying Bottlenecks in ML Pipelines
    • Common Optimization Techniques for ML Code
    • Vectorization: Broadcasting Over Loops
    • Caching and Memoization: Trading Memory for Speed
    • Batch Processing: Amortizing Per-Call Overhead
    • 🎯 Key Takeaways
    • πŸ“ Optimization Checklist
    • Learning Curves: Diagnosing Bias vs. Variance
    • Overfitting vs. Underfitting: Visual Diagnosis with Polynomial Regression
    • Regularization Strategies: Constraining Model Complexity
    • Convergence Issues: When Gradient Descent Gets Lost
    • Model Complexity Trade-off: Validation Curves
    • 🎯 Key Takeaways
    • πŸ“ Debugging Checklist
    • Confusion Matrix Deep Dive: Beyond Aggregate Accuracy
    • Per-Class Error Analysis: Finding the Weakest Links
    • Failure Case Analysis: Learning from Mistakes
    • Confidence Analysis: Does the Model Know What It Does Not Know?
    • Error Analysis Report: Structured Communication of Findings
    • 🎯 Key Takeaways
    • πŸ“ Error Analysis Checklist
    • πŸŽ‰ Congratulations!
  • Low-Code AI Tools
    • Assignment: Low-Code ML Application
    • Challenges: Low-Code AI Tools
    • Post-Quiz: Low-Code AI Tools
    • Pre-Quiz: Low-Code AI Tools
    • Low-Code AI Tools β€” Start Here
    • Gradio Basics: From Python Function to Web Interface in Three Lines
    • Key Gradio Components
    • Image Classification Interface
    • Text Generation Interface: Controlling LLM Output
    • Advanced Layouts with Blocks: Beyond Simple Input-Output
    • Multi-Modal Interface: Combining Data Types
    • Sharing and Deployment: From Local to Global
    • Deployment Options
    • 🎯 Key Takeaways
    • πŸ“ Practice Exercises
    • πŸ”— Resources
    • Streamlit Basics: Python Scripts that Become Web Apps
    • Running the App
    • ML Model Deployment App: Interactive Classification with Sidebar Controls
    • Session State: Persisting Data Across Reruns
    • Caching for Performance: Avoiding Redundant Computation
    • Interactive Data Dashboard: Filters, KPIs, and Multi-Tab Visualization
    • 🎯 Key Takeaways
    • πŸ“ Practice Exercises
    • πŸ”— Resources
    • Part 1: Introduction to Hugging Face Spaces
    • Part 6: Advanced Space Configuration
    • 🎯 Key Takeaways
    • πŸ“ Practice Exercises
    • πŸ”— Resources
    • Part 1: Introduction to AutoML
    • 🎯 Key Takeaways
    • πŸ“ Practice Exercises
    • πŸ”— Resources
    • Data Preparation: Building a Realistic Churn Dataset
    • Model Training with AutoML: From Data to Tuned Model in Minutes
    • Build Gradio Interface: Making the Model Accessible
    • Create Deployable App Files: From Notebook to Production
    • Production Considerations: From Demo to Reliable System
    • 🎯 Key Takeaways
    • πŸ“ Project Extensions
    • πŸŽ“ Final Exercise
    • πŸ† Congratulations!
  • AI Safety & Red Teaming
    • Assignment: Secure AI System Implementation
    • Challenges: AI Security & Red Teaming
    • Quiz: AI Safety & Red Teaming
    • AI Safety & Red Teaming β€” Start Here
    • Part 1: Understanding Prompt Injection Attacks
    • Summary & Best Practices
    • Practice Exercises
    • OpenAI Moderation API: Production-Grade Content Classification
    • Toxicity Detection with Detoxify: Local ML-Based Analysis
    • Custom Content Filters: Domain-Specific Safety Rules
    • Multi-Layer Content Moderation: Defense in Depth for Safety
    • Moderation Policies and Response Strategies: Beyond Binary Blocking
    • Production Implementation: Integrating All Components
    • Summary & Best Practices
    • Understanding PII Types: A Risk-Based Classification
    • Basic PII Detection with Regex: Fast Pattern Matching
    • Advanced PII Detection with Presidio: ML-Powered Entity Recognition
    • Anonymization Strategies: Choosing the Right Approach
    • Privacy Compliance: GDPR and CCPA Requirements
    • Production PII Protection Pipeline: End-to-End System
    • Summary & Best Practices
    • Understanding Bias Types: Where Unfairness Enters the ML Pipeline
    • Fairness Metrics: Mathematical Definitions of Equality
    • Visualizing Bias: Making Disparities Visible
    • Bias Mitigation Strategies: Pre-processing, In-processing, and Post-processing
    • Bias in LLMs: Detecting and Measuring Language Model Bias
    • Building Fairness-Aware Systems: Runtime Monitoring
    • Summary & Best Practices
    • Part 1: Red Team Methodology
    • Summary & Best Practices
  • Real-Time & Streaming AI
    • Real-Time Streaming β€” Start Here
    • Streaming LLM Responses
    • WebSocket Connections for Real-Time AI Chat
    • Streaming RAG Pipeline
    • Production-Grade Streaming Systems

Reference

  • Quizzes
    • Neural Networks - Post-Quiz
    • Neural Networks - Pre-Quiz
    • Retrieval-Augmented Generation (RAG) - Pre-Quiz
  • References & Hands-On Labs
    • Cloud Platform Labs & Resources ☁️
    • Microsoft Hands-On Labs πŸ§ͺ
    • AI/ML Video Learning Resources πŸŽ₯
  • Glossary & Foundations
    • AI/ML Glossary

Research

  • Advanced Deep Learning
    • Advanced Deep Learning β€” Start Here
    • GAN Mathematics: Comprehensive Theory
    • Generative Adversarial Networks (GANs)
    • Wasserstein GAN (WGAN)
    • Variational Autoencoders (VAEs)
    • Neural Ordinary Differential Equations (Neural ODEs)
    • InfoGAN: Information-Maximizing Generative Adversarial Networks - Comprehensive Theory
    • Vision Transformers (ViT)
    • 1. Motivation: Multi-Scale Latent Representations
    • 2. ELBO for Hierarchical VAE
    • Summary
    • Conditional GANs (cGAN): Comprehensive Theory
    • 1. Motivation: Discrete Latent Spaces
    • 2. Vector Quantization Layer
    • Summary
    • Advanced Vector Quantized Variational Autoencoders (VQ-VAE): Theory and Practice
    • 1. Motivation: Exact Likelihood
    • 3. Composing Flows
    • 6. Modern Flow Architectures
    • Summary
    • Advanced Normalizing Flows Theory
    • 1. Motivation: Iterative Refinement
    • 2. Forward Diffusion Process
    • 3. Reverse Process & Training
    • 4. Sampling (Reverse Process)
    • Summary
    • Advanced Diffusion Models Theory
    • 1. BERT vs GPT
    • 1.5. Masked Language Modeling: Deep Dive
    • 2.5. Segment Embeddings and Special Tokens
    • 3.5. BERT Training: Advanced Techniques
    • Summary
    • 1. GPT vs BERT
    • 1.5. Causal Masking: Mathematical Foundation
    • 2.5. Positional Encoding: Theory and Variants
    • 3.5. Scaling Laws for Language Models
    • 4.5. Advanced Generation Strategies: Complete Analysis
    • Summary
    • 1. Attention Complexity Problem
    • 2. Linformer: Low-Rank Attention
    • 3. Performer: Kernel Approximation
    • Summary
    • Advanced Efficient Transformers Theory
    • 1. Message Passing Framework
    • 2. Graph Convolutional Network (GCN)
    • Advanced Message Passing Theory
    • 3. Graph Attention Network (GAT)
    • Summary
    • Advanced Graph Neural Networks Theory
    • 1. Meta-Learning Problem
    • 2. MAML Algorithm
    • Summary
    • Advanced Meta-Learning Theory
    • Advanced Meta-Learning and MAML Theory
    • 1. Few-Shot Classification
    • 2. Algorithm
    • Summary
    • Advanced Prototypical Networks Theory
    • Neural Radiance Fields (NeRF): Comprehensive Theory
    • 1. Style-Based Generator
    • Summary
    • Advanced StyleGAN: Mathematical Foundations and Architecture Deep Dive
    • 1. Contrastive Learning Framework
    • Summary
    • Advanced Contrastive Learning Theory
    • Advanced Adversarial Robustness Theory
    • 1. Adversarial Examples
    • 5. PGD Attack
    • Summary
    • 1. Knowledge Distillation
    • Summary
    • Advanced Knowledge Distillation Theory
    • 1. Point Cloud Basics
    • 2. T-Net (Transformation Network)
    • 3. PointNet Architecture
    • Summary
    • Advanced Point Cloud Networks Theory
    • 1. CycleGAN Theory
    • Summary
    • 1. Progressive Growing
    • Summary
    • Advanced Neural Network Interpretability Theory
    • 1. GradCAM Theory
    • Summary
    • Advanced Curriculum Learning Theory
    • 1. Curriculum Learning
    • Summary
    • 1. Catastrophic Forgetting
    • Summary
    • Advanced Continual Learning Theory
    • Advanced Continual Learning Theory
    • Advanced Continual Learning: Mathematical Foundations and Modern Approaches
    • Advanced Neural Architecture Search Theory
    • 1. DARTS: Differentiable Architecture Search
    • Summary
    • Advanced Neural Architecture Search Theory
    • Advanced Neural Architecture Search: Mathematical Foundations and Modern Methods
    • 1. Bahdanau Attention (Additive)
    • 2. Luong Attention (Multiplicative)
    • 3. Scaled Dot-Product Attention
    • 4. Multi-Head Attention
    • Summary
    • Advanced Attention Mechanisms: Mathematical Foundations and Modern Architectures
    • 1. Memory Networks Concept
    • 2. Content-Based Addressing
    • Summary
    • Advanced Memory Networks Theory
    • Advanced Memory Networks: Mathematical Foundations and Modern Architectures
    • 1. Capsule Networks Concept
    • 2. Dynamic Routing
    • 5. Margin Loss
    • Summary
    • Advanced Capsule Networks Theory
    • Advanced Capsule Networks Theory
    • Advanced Capsule Networks: Mathematical Foundations and Modern Architectures
    • 1. Score Matching
    • 3. Langevin Dynamics Sampling
    • 6. Annealed Langevin Dynamics
    • Summary
    • Advanced Score-Based Generative Models Theory
    • Advanced Score-Based Generative Models: Mathematical Foundations and Modern Architectures
    • 1. Energy-Based Models
    • 2. Contrastive Divergence
    • Summary
    • Advanced Energy-Based Models Theory
    • Advanced Energy-Based Models: Mathematical Foundations and Modern Architectures
    • 1. Mixture of Experts Concept
    • Summary
    • Advanced Mixture of Experts: Mathematical Foundations and Modern Architectures
    • 1. Implicit Neural Representations
    • Summary
    • Advanced Implicit Neural Representations Theory
    • 1. Gaussian Process Theory
    • 1.5. Gaussian Process Theory: Deep Mathematical Foundations
    • 7.5. Sparse Gaussian Processes: Scaling to Large Data
    • Summary
    • Advanced GP Topics and Extensions
    • Advanced Gaussian Processes Theory
    • 1. Bayesian Neural Networks
    • Bayesian Neural Networks: Deep Theory and Variational Inference
    • Summary
    • Advanced Bayesian Neural Networks Theory
  • Reinforcement Learning
    • Reinforcement Learning β€” Start Here
    • 01: Markov Decision Processes (MDPs)
    • 02: Value-Based Methods (Q-Learning)
    • 03: Deep Q-Networks (DQN)
    • 04: Policy-Based Methods (REINFORCE)
    • 05: Advanced Topics & Real-World Applications
    • 06: Practical Exercises & Implementations
  • Time Series Analysis & Forecasting
    • Time Series Analysis β€” Start Here
    • 01: Time Series Fundamentals
    • 02: Classical Statistical Methods
    • 03: Facebook Prophet
    • 04: Deep Learning for Time Series
    • 05: Advanced Techniques & Applications
    • 06: Practical Applications & Exercises
  • Causal Inference
    • Causal Inference β€” Start Here
    • 01: Causal Fundamentals
    • 02: Causal Graphs & DAGs
    • 03: Experimental Design
    • 04: Observational Methods
    • 05: Advanced Topics & Applications
    • 🎯 Quasi-Experimental Designs

Production

  • Practical Data Science
    • Data Science Interview Prep: The 30 Questions That Actually Come Up
    • Data Science Interview Prep β€” Part 2: Q16 to Q30
    • Practical Data Science
    • Computer Vision
      • Image Processing Basics: NumPy Arrays, PIL, and Computer Vision Fundamentals
      • CNNs From Scratch: Understanding Convolution, Pooling, and Classification
      • Transfer Learning: Leveraging Pretrained Models for Custom Image Tasks
      • Object Detection: From Bounding Boxes to Modern Detectors
      • Image Segmentation: Pixel-Level Understanding
    • Deep Learning & NLP
      • Transformers from Scratch: Self-Attention & Positional Encoding
      • BERT Text Classification: Fine-Tuning Transformers on Your Own Data
      • LLM Application Patterns: Building Production-Grade AI Features
      • Text Preprocessing: From Raw Text to Features
      • LLM Fine-Tuning with LoRA and QLoRA
    • Machine Learning
      • sklearn Pipelines: The Right Way to Build ML Workflows
      • Model Selection: Cross-Validation, Learning Curves & Bias-Variance
      • Ensemble Methods: Bagging, Boosting & Stacking
      • Imbalanced Datasets: Handling Class Imbalance Properly
      • Model Interpretability: SHAP, Permutation Importance & Partial Dependence
      • End-to-End ML Project: Customer Churn Prediction
    • Python for Data Science
      • Pandas Fundamentals: The Operations Every Data Scientist Must Know
      • Exploratory Data Analysis: A Systematic Framework
      • Data Visualization: From Exploratory Plots to Publication-Quality Figures
      • Data Cleaning Pipelines: From Messy Data to Model-Ready Features
      • Feature Engineering: Turning Raw Data into Model Fuel
    • Recommender Systems & Causal Inference
      • Collaborative Filtering: Building Recommender Systems from User Behavior
      • Content-Based Filtering: Recommending by Item Similarity
      • Neural Collaborative Filtering: Deep Learning for Recommendations
      • Causal Inference: Moving Beyond Correlation
      • Difference-in-Differences: Causal Inference from Natural Experiments
      • Building an A/B Testing Platform
    • Solutions
      • Solutions: Computer Vision Track
      • Solutions: Deep Learning & NLP Track
      • Solutions: Machine Learning Track
      • Solutions: Python & Data Science Track
      • Solutions: Recommender Systems & Causal Inference Track
      • Solutions: SQL & Data Engineering Track
      • Solutions: Statistics & MLOps Track
      • Solutions: Time Series & Forecasting Track
    • SQL & Data Engineering
      • Advanced SQL: Window Functions, CTEs, and Query Patterns That Actually Matter
      • SQL Query Optimization: From Slow to Fast in 10 Patterns
      • Data Pipelines with Airflow: DAGs, Operators, and Production Patterns
      • PySpark Fundamentals: Distributed Data Processing for Large Datasets
      • dbt: Data Modeling for Analytics Engineering
      • Streaming Data: Kafka, Windowed Aggregations, and Real-Time Pipelines
    • Statistics & MLOps
      • Hypothesis Testing: The Statistics Behind A/B Tests and Decisions
      • Bayesian Thinking: Updating Beliefs with Data
      • Model Deployment with FastAPI: From Notebook to Production API
      • ML Monitoring & Drift Detection: Keeping Models Healthy in Production
      • Feature Stores: From Training to Production Without Data Leakage
    • Time Series Forecasting
      • Time Series Fundamentals: Decomposition, Stationarity, and Autocorrelation
      • ARIMA & SARIMA: Statistical Forecasting for Practitioners
      • Prophet: Scalable Forecasting for Business Time Series
      • LSTM for Time Series: Sequence Modeling with Deep Learning
      • Anomaly Detection: Finding the Signal in Noisy Time Series
      • Forecasting Competition: ARIMA vs Prophet vs LSTM
  • AI Hardware & Validation
    • Section 1: Hardware Validation
    • Section 2: Kernel Validation
    • Section 3: Framework Validation
    • Section 4: Model Performance Validation
    • Section 5: End-to-End Pipeline Validation
    • Section 6: Distributed Training Validation
    • Section 7: Datacenter Validation
    • Section 8: Regression & Release Validation
    • Chapter 9: Industry AI Benchmarking & Performance Analysis
    • Part 1 β€” LLM Performance Metrics: What Gets Measured
    • Part 2 β€” API Performance Benchmarking
    • Part 3 β€” Hardware Benchmarking: AA-SLT (System Load Test)
    • Part 4 β€” Hardware Benchmarking: AA-AgentPerf
    • Part 5 β€” Intelligence Benchmarking
    • Part 6 β€” Multi-Modal Benchmarking
    • Part 7 β€” Other Industry Benchmarks
    • Exercises
    • Key Takeaways
    • Lab 01: Hardware Validation
    • Lab 02: Kernel Validation
    • Lab 03: Model Performance Validation
    • Lab 04: Regression & Release Validation Suite
    • Lab 05: Distributed Training Validation
    • Lab 06: Framework Validation
    • Lab 07 β€” GPGPU Backends: CoreML Β· DirectML Β· Vulkan
    • Part 1 β€” CoreML (Apple)
    • Part 2 β€” DirectML (Microsoft / Windows)
    • Part 3 β€” Vulkan (Cross-Platform)
    • Part 4 β€” Cross-Backend Parity Validation
    • Part 5 β€” Exercises
    • Key Takeaways
    • Lab 08 β€” Industry Benchmarking: Hands-On
    • Part 1 β€” TTFT & Output Speed Measurement
    • Part 2 β€” Mini AA-SLT (System Load Test)
    • Part 3 β€” SLO-Based Capacity Planning (AA-AgentPerf Style)
    • Part 4 – Hardware Comparison Dashboard
    • Part 5 β€” Mini Intelligence Eval Runner
    • Exercises
    • Key Takeaways
  • Inference Optimization & Model Serving
    • vLLM Quickstart: High-Throughput Model Serving

Developer Tools

  • AI-Powered Development with VS Code & GitHub Copilot
    • AI Coding Tools for ML Engineers (March 2026)
    • VS Code AI Setup Guide
    • MCP Deep Dive: Model Context Protocol
    • GitHub Copilot Customization Guide
    • VS Code + GitHub Copilot Workflows
    • Build a Simple MCP Server in Python
Back to top
View this page
Edit this page

Microsoft Open Source Code of ConductΒΆ

This project has adopted the Microsoft Open Source Code of Conduct.

Resources:

  • Microsoft Open Source Code of Conduct

  • Microsoft Code of Conduct FAQ

  • Contact opencode@microsoft.com with questions or concerns

Next
Data Science for Beginners - A Curriculum
Previous
Contributing
Copyright © MIT License 2026, Pavan Mudigonda
v0.1.0