Poke | Deep Learning

Deep Learning

28 videos • 764 views • by DataMListic

Why Batch Normalization (batchnorm) Works

DataMListic
Download

Capsule Networks Explained | Why Using Pooling is a Bad Idea

DataMListic
Download

Why Deep Neural Networks (DNNs) Underperform Tree-Based Models on Tabular Data

DataMListic
Download

AMSGrad - Why Adam FAILS to Converge

DataMListic
Download

Why Neural Networks Can Learn Any Function | The Universal Approximation Theorem

DataMListic
Download

Why Residual Connections (ResNet) Work

DataMListic
Download

Why Neural Networks (NN) Are Deep | The Number of Linear Regions of Deep Neural Networks

DataMListic
Download

Why ReLU Is Better Than Other Activation Functions | Tanh Saturating Gradients

DataMListic
Download

Why The Reset Gate is Necessary in GRUs

DataMListic
Download

Why Recurrent Neural Networks (RNN) Suffer from Vanishing Gradients - Part 2

DataMListic
Download

Why We Need Activation Functions In Neural Networks

DataMListic
Download

Why Convolutional Neural Networks Are Not Permuation Invariant

DataMListic
Download

Why Recurrent Neural Networks Suffer from Vanishing Gradients - Part 1

DataMListic
Download

Multi-Head Attention (MHA), Multi-Query Attention (MQA), Grouped Query Attention (GQA) Explained

DataMListic
Download

How to Fine-tune Large Language Models Like ChatGPT with Low-Rank Adaptation (LoRA)

DataMListic
Download

Gated Recurrent Unit (GRU) Equations Explained

DataMListic
Download

Long Short-Term Memory (LSTM) Equations Explained

DataMListic
Download

LLM Prompt Engineering with Random Sampling: Temperature, Top-k, Top-p

DataMListic
Download

Two Towers vs Siamese Networks vs Triplet Loss - Compute Comparable Embeddings

DataMListic
Download

LLM Tokenizers Explained: BPE Encoding, WordPiece and SentencePiece

DataMListic
Download

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits - Paper Explained

DataMListic
Download

Chain-of-Verification (COVE) Reduces Hallucination in Large Language Models - Paper Explained

DataMListic
Download

RLHF: Training Language Models to Follow Instructions with Human Feedback - Paper Explained

DataMListic
Download

BART Explained: Denoising Sequence-to-Sequence Pre-training

DataMListic
Download

Sliding Window Attention (Longformer) Explained

DataMListic
Download

BLEU Score Explained

DataMListic
Download

ROUGE Score Explained

DataMListic
Download

Vector Database Search - Hierarchical Navigable Small Worlds (HNSW) Explained

DataMListic
Download