example

data, models, algorithms

regression

classification

tagging

search & rank

recommenders

sequence learning

reinforcement learning

markov decision processes

data, models, algorithms

regression

classification

tagging

search & rank

recommenders

sequence learning

reinforcement learning

markov decision processes

data manipulation

preprocessing

linear algebra

calculus

auto differentiation

probability

documentation

preprocessing

linear algebra

calculus

auto differentiation

probability

documentation

linear regression

LR from scratch

LR implementation

softmax regression

fashion-mnist

softmax from scratch

softmax implementation

LR from scratch

LR implementation

softmax regression

fashion-mnist

softmax from scratch

softmax implementation

intro

MLP from scratch

model selection, underfit, overfit

weight decay

dropout

fwd/back propagation

computational graphs

numerical stability

environmental concerns

kaggle - house prices

MLP from scratch

model selection, underfit, overfit

weight decay

dropout

fwd/back propagation

computational graphs

numerical stability

environmental concerns

kaggle - house prices

layers & blocks

parameters

deferred initialization

custom layers

file i/o

gpus

parameters

deferred initialization

custom layers

file i/o

gpus

dense layers to convolutions

image convolutions

padding, stride

channels

pooling

LeNet

image convolutions

padding, stride

channels

pooling

LeNet

alexnet

vgg

network-in-network (NiN)

googlenet

batch normalization

resnet

densenet

vgg

network-in-network (NiN)

googlenet

batch normalization

resnet

densenet

sequence models

text preprocessing

language models

rnns

rnns from scratch

implementation

backprop through time

text preprocessing

language models

rnns

rnns from scratch

implementation

backprop through time

gated recurrent unit (grus)

lstms

deep rnns

bidirectional rnns

machine translation

encoder-decoders

sequence-to-sequence

beam search

lstms

deep rnns

bidirectional rnns

machine translation

encoder-decoders

sequence-to-sequence

beam search

intro

sequence-to-sequence

transformers

sequence-to-sequence

transformers

intro

convexity

gradient descent

stochastic gradient descent

minibatch SGD

momentum

adagrad

rmsprop

adadelta

adam

learning rate scheduling

convexity

gradient descent

stochastic gradient descent

minibatch SGD

momentum

adagrad

rmsprop

adadelta

adam

learning rate scheduling

compilers & interpreters

async computation

auto parallelism

hardware

multiple GPUs

parameter servers

async computation

auto parallelism

hardware

multiple GPUs

parameter servers

image augmentation

fine tuning

object detection / bounding boxes

anchor boxes

multiscale OD

pikachu dataset

single-shot multibox detect

region-based CNNs (R-CNNs)

semantic segmentation

transposed convolution

fully convolutional nets (FCNs)

neural style transfer

CIFAR-10 image class on Kaggle

Imagenet/Dogs on Kaggle

fine tuning

object detection / bounding boxes

anchor boxes

multiscale OD

pikachu dataset

single-shot multibox detect

region-based CNNs (R-CNNs)

semantic segmentation

transposed convolution

fully convolutional nets (FCNs)

neural style transfer

CIFAR-10 image class on Kaggle

Imagenet/Dogs on Kaggle

word2vec

word2vec - approx training

word2vec dataset

word2vec implementation

subword embedding (fasttext)

word embed - global vectors (GloVe)

synonyms & analogies

text sentiment classification

text sentiment using RNNs

text sentiment using textCNN

word2vec - approx training

word2vec dataset

word2vec implementation

subword embedding (fasttext)

word embed - global vectors (GloVe)

synonyms & analogies

text sentiment classification

text sentiment using RNNs

text sentiment using textCNN

overview

MovieLens

matrix factorization

AutoRec with autoencoders

personalized rankings

collaborative filtering

sequence-aware recommenders

feature-rich recommenders

factorization machines

deep factorization machines

MovieLens

matrix factorization

AutoRec with autoencoders

personalized rankings

collaborative filtering

sequence-aware recommenders

feature-rich recommenders

factorization machines

deep factorization machines

intro

deep convolutional GANs

deep convolutional GANs

geometry

linear algebra

eigendecomposition

single-variable calculus

multi-variable calculus

integral calculus

random variables

max likelihood

naive bayes

statistics

info theory

linear algebra

eigendecomposition

single-variable calculus

multi-variable calculus

integral calculus

random variables

max likelihood

naive bayes

statistics

info theory

jupyter

amazon sagemaker

aws ec2

google colab

servers & gpus

amazon sagemaker

aws ec2

google colab

servers & gpus