My Minimal Implementation of Commonly-used Machine Learning Algorithms using Only Numpy

1 minute read

Simple-Implementation-of-ML-Algorithms

My simplest implementations of common ML algorithms from scratch.

For an easy understanding, most of the codes implements only minimal versin of the algorithm.
The implementation uses only numpy.
Most algorithms are within 100 lines of codes.

Example Output

See decision boundary visualization for implemented classifiers in decision_boundary_vis.py
See the results from the generative models trained with MNIST dataset:
- Restricted Boltzmann Machine (RBM):
- Variational Autoencoders (VAE):
- Deep Convolutional Generative Adversarial Network (DC GAN):

Implemented algorithms:

Regression Models
- Linear Regression
  - Matrix solver
  - SGD/Adam solver
  - L1 regularization Lasso
  - L2 regularization Ridge
- Logistic Regression
  - Multi-class prediction
- Factorization Machines
  - Regularization
  - Classification/regression
Bayes Models
- Naive Bayes
  - Multinomial model
  - Document tokenizer
- Beyasian Network
  - Conditional probability MLE
  - Beyasian inference
Tree Models and Ensemble Learning
- Decision Tree
  - Classification/regression
  - Different metrics
  - Feature importances
  - Sample weights
- Random Forest
- Adaboost
- Gradient Boost Decision Tree
  - Shrinkage
  - Line search of multiplier
- XGBoost
  - XGBoost Regression Tree
  - Shrinkage
Deep Learning
- Architecture
  - Multilayer Perceptron
  - Restricted Boltzman Machine
  - Deep Belief Network
  - Variational autoencoder (VAE)
  - Convolutional Neural Network
    - Convolutional layer with vectorized img2col and col2img
  - Recurrent neural network
    - Backpropagation through time (BPTT)
  - Long short-term memory
  - Generative Adversarial Networks (GAN)
  - Deep Q-Network (Reinforcement learning)
- Layers
  - Feedforward layer (dense)
  - Convolutional layer
  - Max pooling layer
  - Batch normalization layer
  - Softmax layer for classification
  - Activation layer
    - ReLU (Leaky)
    - Tanh (Leaky)
    - Sigmoid
  - WIP: Drop out layer
- Training
  - Mini Batch
  - He initialization
  - Loss functions
    - Mean squared error for regression
    - Cross entropy for classification
    - Log loss for classification
  - L1/L2 Regularization
  - Gradient check
Optimization Algorithms (See implementations in MLP)
- Stochastic Gradient Descent
- Gradient Descent with Momentum
- Nesterov Momentum
- AdaGrad
- RMSProp
- Adam
k-Nearest Neighbors
Support Vector Machine
- Soft boundary
- SMO algorithm
- Different heuristics for selecting pairs in SMO
Genetic Algorithm
- Training a NN model
- Selection by Fitness
- Crossover approaches
- Mutation rate
Hidden Markov Model
- Fitting by Baum-Welch
- Prediction by Viterbi
WIP
- add results in this readme

Feel free to use the code. Please contact me if you have any question :)

Share on

Twitter Facebook Google+ LinkedIn

Cong Xie (解聪)

My Minimal Implementation of Commonly-used Machine Learning Algorithms using Only Numpy

Simple-Implementation-of-ML-Algorithms

My simplest implementations of common ML algorithms from scratch.

Example Output

Implemented algorithms:

Share on

Leave a Comment

You May Also Enjoy

Homemade Strawberry Shortcake

Two angularfire based online multiplay games

Homemade Minecraft Costume for Halloween

Pseudo R square, standard error and Z-value of logistic regression