Channel Avatar

Yannic Kilcher @UCZHmQk67mSJgfCCTn7xBfew@youtube.com

255K subscribers - no pronouns set

I make videos about machine learning research papers, progra


57:00
xLSTM: Extended Long Short-Term Memory
29:22
[ML News] OpenAI is in hot waters (GPT-4o, Ilya Leaving, Scarlett Johansson legal action)
33:26
ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained)
39:14
[ML News] Chips, Robots, and Models
37:01
TransformerFAM: Feedback attention is working memory
17:47
[ML News] Devin exposed | NeurIPS track for high school students
37:17
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
31:19
[ML News] Llama 3 changes the game
18:01
Hugging Face got hacked
09:55
[ML News] Microsoft to spend 100 BILLION DOLLARS on supercomputer (& more industry news)
27:32
[ML News] Jamba, CMD-R+, and other new models (yes, I know this is like a week behind 🙃)
56:16
Flow Matching for Generative Modeling (Paper Explained)
44:05
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping (Searchformer)
27:00
[ML News] Grok-1 open-sourced | Nvidia GTC | OpenAI leaks model names | AI Act
26:50
[ML News] Devin AI Software Engineer | GPT-4.5-Turbo LEAKED | US Gov't Report: Total Extinction
53:15
[ML News] Elon sues OpenAI | Mistral Large | More Gemini Drama
15:12
No, Anthropic's Claude 3 is NOT sentient
42:34
[ML News] Groq, Gemma, Sora, Gemini, and Air Canada's chatbot troubles
17:36
Gemini has a Diversity Problem
50:03
V-JEPA: Revisiting Feature Prediction for Learning Visual Representations from Video (Explained)
01:23:59
What a day in AI! (Sora, Gemini 1.5, V-JEPA, and lots of news)
54:24
Lumiere: A Space-Time Diffusion Model for Video Generation (Paper Explained)
35:27
AlphaGeometry: Solving olympiad geometry without human demonstrations (Paper Explained)
34:32
Mixtral of Experts (Paper Explained)
03:40
Until the Litter End
31:45
LLaMA Pro: Progressive LLaMA with Block Expansion (Paper Explained)
08:17
I created an AI-powered Social Network
57:52
NeurIPS 2023 Poster Session 4 (Thursday Morning)
08:26
Art @ NeurIPS 2023
40:40
Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Paper Explained)
33:22
Another Hit Piece on Open-Source AI
33:53
NeurIPS 2023 Poster Session 3 (Wednesday Evening)
44:17
NeurIPS 2023 Poster Session 2 (Wednesday Morning)
08:20
NeurIPS 2023 Vendor Hall
19:03
NeurIPS 2023 Poster Session 1 (Tuesday Evening)
15:47
Did Google fake their Gemini Video?
37:06
Text Embeddings Reveal (Almost) As Much As Text
47:38
Scalable Extraction of Training Data from (Production) Language Models (Paper Explained)
45:44
What is Q-Learning (back to basics)
18:10
Greg & Sam are BACK! (+ Q-Star is AGI) (Also Memes)
09:59
Is Sam Altman coming back? (OpenAI drama continues)
20:08
OpenAI just fired CEO Sam Altman
21:51
I built the most expensive CPU ever! (Every instruction is a prompt)
11:49
OpenAssistant is Completed
32:27
Efficient Streaming Language Models with Attention Sinks (Paper Explained)
46:45
Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained)
28:26
Retentive Network: A Successor to Transformer for Large Language Models (Paper Explained)
53:07
Reinforced Self-Training (ReST) for Language Modeling (Paper Explained)
44:11
[ML News] LLaMA2 Released | LLMs for Robots | Multimodality on the Rise
29:10
How Cyber Criminals Are Using ChatGPT (w/ Sergey Shykevich)
07:07
Recipe AI suggests FATAL CHLORINE GAS Recipe
53:32
DeepFloyd IF - Pixel-Based Text-to-Image Diffusion (w/ Authors)
31:05
[ML News] GPT-4 solves MIT Exam with 100% ACCURACY | OpenLLaMA 13B released
35:45
Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust (Explained)
01:02:17
RWKV: Reinventing RNNs for the Transformer Era (Paper Explained)
29:29
Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)
16:13
OpenAI suggests AI licenses (US Senate hearing on AI regulation w/ Sam Altman)
39:07
[ML News] Geoff Hinton leaves Google | Google has NO MOAT | OpenAI down half a billion
24:34
Scaling Transformer to 1M tokens and beyond with RMT (Paper Explained)
21:06
OpenAssistant RELEASED! The world's best open-source Chat AI!