Tunadorable | Poke

Tunadorable @UCeQhm8DwHBg_YEYY0KGM1GQ@youtube.com

14K subscribers - no pronouns :c

Oi! I am become change, confuser of subscribers Free 🇵🇸🇨�

Videos Shorts Live Posts Playlists

Recently Uploaded Popular Oldest

Training LLMs on the entire connected web instead of individual sequences

Can AI change the way humanity communicates?

Flash-attention backward pass | Triton GPU Kernels 101 Lesson #10

Flash-attention forward pass | Triton GPU Kernels 101 Lesson #9

Compressing attention heads for BETTER performance?!?!

LayerNorm | Triton GPU Kernels 101 Lesson #8

Matmul | Triton GPU Kernels 101 Lesson #6

Dropout | Triton GPU Kernels 101 Lesson #7

Fused Softmax | Triton GPU Kernels 101 Lesson #5

Vector addition | Triton GPU Kernels 101 Lesson #4

How to use a cloud GPU | Triton GPU Kernels 101 Lesson #3

GPU Architecture Basics | Triton GPU Kernels 101 Lesson #2

Triton GPU Kernels 101: Syllabus day (Lesson #1)

Conversational Swarm Intelligence with Dr. Louis Rosenberg

Building models specifically for multi-GPU parallelization

GPU Kernels but with ✨pretty diagrams✨ to explain them

Has diffusion FINALLY been dethroned for image generation?!?!

How to make neural networks better at learning new things

Have we been doing LLM inference wrong the whole time?!?!

This month's HOTTEST🥵 new AI papers - February 2025

Train global AI models with Federated Learning using Flower (Tutorial / Interview)

How large groups can use AI to organize without leadership

MAJOR inference efficiency gain for diffusion models

Pre-train with patches for huge compute savings

Synthetic data that finally helps instead of hurts!!

Aligning AGI by converging its internal sense of self and other

Initializing large models with weights from smaller ones

Fine, I'll talk about the Meta papers y'all keep sending me

How diffusion modeling can revolutionize evolutionary algorithms

Let this method tune hyper-parameters for you!

Skimming hella new AI paper abstracts - January 2025

Mitigating frequency bias and anisotropy in LLM pre-training

2025 Yearly Theme and Productivity Scheme

Getting LLMs to look inward

Creating new tokens out of internal representations

Dark Representations: the error source SAEs cannot fit

Training GPTs on cellular automata

Training a GPT on massive amounts of fMRI data

Amateurs CAN do LLM research - but also, critiques are valuable

Michael Levin's newest paper - a commentary

Curating December's best new AI papers from ArXiv

Channel Update: what to expect over the next month+

Coding a single neuron w/ backprop in numpy | Deep-ML Code Challenges #25

Live-solving LeetCode #1679 - Max number of k-sum pairs

Live-solving LeetCode #334 - Increasing triplet subsequence

Live-solving LeetCode #1456 - Maximum number of vowels in a substring of given length

Live-solving LeetCode #283 - Move zeroes

Live-solving LeetCode #392 - Is subsequence

Live-solving LeetCode #11 - Container with most water

Live-solving LeetCode #1768 - Merge strings alternately

Live-solving LeetCode #1431 - Kids with the greatest number of candies

Live-solving LeetCode #1071 - Greatest common divisor of strings

Live-solving LeetCode #151 - Reverse words in a string

Live-solving LeetCode #605 - Can Place Flowers

Live-solving LeetCode #345 - Reverse Vowels of a String

Normalizing GPT on the unit-hypersphere (WITH CODE)

Theoretical physics of next token prediction in LLMs

Skimming hella AI paper abstracts - Nov 5, 2024

Every attention head explained