Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

Overview

Stat4ML

Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

This is the first course from our trio courses:

  1. Statistics Foundation for ML

https://github.com/Bellman281/Stat4ML/

  1. Introduction to Statistical Learning https://github.com/Bellman281/Intro_Statistical_Learning

  2. Advanced Statistical Learning for DL ( to be anounced)

Registration Form for cohort 2 of STAT4ML:

https://forms.gle/ZqLJLmv1K5nGVx3m7

Notes about the course:

Instructor : Omid Safarzadeh,

LinkedIn: https://www.linkedin.com/in/omidsafarzadeh/

IG : @deepdatascientists

Course Text Book: Statistical Inference 2nd Edition by George Casella (Author), Roger L. Berger (Author) :

https://www.amazon.com/Statistical-Inference-George-Casella-dp-0534243126/dp/0534243126/ref=mt_other?_encoding=UTF8&me=&qid=

Pre Requisitives

Recall from Calculus:

    Derivative
          Chain rule
    Integral
          Techniques of Integration
          Substitution
    Integration by parts

Matrix Algebra Review:

    Matrix operations
    Matrix Multiplication
       Properties of determinants
       Inverse Matrix
       Matrix Transpose
       Properties of transpose
    Partioned Matrices
    Eigenvalues and Eigenvectors
    Matrix decomposition
       LU decomposition
       Cholesky decomposition
       QR decomposition
       SVD
    Matrix Differentiation

Course 1 :

Slide 1 : Probability Theory Foundation

 Sample Space
 Probability Theory Foundation
    Axiomatic Foundations
    The Calculus of Probabilities
 Independence
 Conditional Probability
    Bayes Theorem
 Random Variables
 Probability Function
    Distribution Functions
    Density function

Slide 2: Moments

   Moments
       Expected Value
       Variance
       Covariance and Correlation
   Moment Generating Functions
       Normal mgf
   Matrix Notation for Moments

Slide 3: Distribution Functions

   Distributions
     Discrete Distribution
       Discrete Uniform Distribution
       Binomial Distribution
       Poisson Distribution
     Continuous Distribution
       Uniform Distribution
       Exponential Distribution
       Normal Distribution
       Lognormal Distribution
       Laplace Distribution
       Beta Distribution

Slide 4: Conditional and Multivariate Distributions

Joint and Marginal Distribution
Conditional Distributions and Independence
Bivariate Transformations
Hierarchical Models and Mixture Distribution
Bivariate Normal Distribution
Multivariate Distribution

Slide 5: Convergence Concepts

Random Samples
   Sums of Random Variable from a Random Sample
Inequalities
Convergence Concepts:
   Almost Sure Convergence
   Convergence in Probability
   Convergence in Distribution
The Delta Method

Slide 6: Maximum Likelihood Estimation

Maximum Likelihood Estimation
  Motivation and the Main Ideas
  Properties of the Maximum Likelihood Estimator

Slide 7: Bayesian and posterior distribution Estimation

   Computing the posterior
   Maximum likelihood estimation (MLE)
Maximum a posteriori (MAP) estimation
   Posterior mean
   MAP properties
Bayesian linear regression
Owner
Omid Safarzadeh
Deep Learning Expert, Kaggler
Omid Safarzadeh
Text Classification in Turkish Texts with Bert

You can watch the details of the project on my youtube channel Project Interface Project Second Interface Goal= Correctly guessing the classification

42 Dec 31, 2022
Implemented shortest-circuit disambiguation, maximum probability disambiguation, HMM-based lexical annotation and BiLSTM+CRF-based named entity recognition

Implemented shortest-circuit disambiguation, maximum probability disambiguation, HMM-based lexical annotation and BiLSTM+CRF-based named entity recognition

0 Feb 13, 2022
Conditional Transformer Language Model for Controllable Generation

CTRL - A Conditional Transformer Language Model for Controllable Generation Authors: Nitish Shirish Keskar, Bryan McCann, Lav Varshney, Caiming Xiong,

Salesforce 1.7k Dec 28, 2022
Open solution to the Toxic Comment Classification Challenge

Starter code: Kaggle Toxic Comment Classification Challenge More competitions 🎇 Check collection of public projects 🎁 , where you can find multiple

minerva.ml 153 Jun 22, 2022
pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

Python binding for Morfologik Morfologik is Polish morphological analyzer. For more information see http://github.com/morfologik/morfologik-stemming/

Damian Mirecki 18 Dec 29, 2021
GooAQ 🥑 : Google Answers to Google Questions!

This repository contains the code/data accompanying our recent work on long-form question answering.

AI2 112 Nov 06, 2022
LightSeq: A High-Performance Inference Library for Sequence Processing and Generation

LightSeq is a high performance inference library for sequence processing and generation implemented in CUDA. It enables highly efficient computation of modern NLP models such as BERT, GPT2, Transform

Bytedance Inc. 2.5k Jan 03, 2023
Python3 to Crystal Translation using Python AST Walker

py2cr.py A code translator using AST from Python to Crystal. This is basically a NodeVisitor with Crystal output. See AST documentation (https://docs.

66 Jul 25, 2022
NeoDays-based tileset for the roguelike CDDA (Cataclysm Dark Days Ahead)

NeoDaysPlus Reduced contrast, expanded, and continuously developed version of the CDDA tileset NeoDays that's being completed with new sprites for mis

0 Nov 12, 2022
AI and Machine Learning workflows on Anthos Bare Metal.

Hybrid and Sovereign AI on Anthos Bare Metal Table of Contents Overview Terraform as IaC Substrate ABM Cluster on GCE using Terraform TensorFlow ResNe

Google Cloud Platform 8 Nov 26, 2022
Khandakar Muhtasim Ferdous Ruhan 1 Dec 30, 2021
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Espresso Espresso is an open-source, modular, extensible end-to-end neural automatic speech recognition (ASR) toolkit based on the deep learning libra

Yiming Wang 919 Jan 03, 2023
LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation

LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation Tasks | Datasets | LongLM | Baselines | Paper Introduction LOT is a ben

46 Dec 28, 2022
A repo for materials relating to the tutorial of CS-332 NLP

CS-332-NLP A repo for materials relating to the tutorial of CS-332 NLP Contents Tutorial 1: Introduction Corpus Regular expression Tokenization Tutori

Alok singh 9 Feb 15, 2022
A high-level Python library for Quantum Natural Language Processing

lambeq About lambeq is a toolkit for quantum natural language processing (QNLP). Documentation: https://cqcl.github.io/lambeq/ Getting started Prerequ

Cambridge Quantum 315 Jan 01, 2023
Easy to start. Use deep nerual network to predict the sentiment of movie review.

Easy to start. Use deep nerual network to predict the sentiment of movie review. Various methods, word2vec, tf-idf and df to generate text vectors. Various models including lstm and cov1d. Achieve f1

1 Nov 19, 2021
Code examples for my Write Better Python Code series on YouTube.

Write Better Python Code This repository contains the code examples used in my Write Better Python Code series published on YouTube: https:/

858 Dec 29, 2022
Machine Psychology: Python Generated Art

Machine Psychology: Python Generated Art A limited collection of 64 algorithmically generated artwork. Each unique piece is then given a title by the

Pixegami Team 67 Dec 13, 2022
A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

List Of English Words A text file containing over 466k English words. While searching for a list of english words (for an auto-complete tutorial) I fo

dwyl 8.5k Jan 03, 2023
Python utility library for compositing PDF documents with reportlab.

pdfdoc-py Python utility library for compositing PDF documents with reportlab. Installation The pdfdoc-py package can be installed directly from the s

Michael Gale 1 Jan 06, 2022