Channel Avatar

Rohan-Paul-AI @UC0_a8SNpTFkmVv5SLMs1CIA@youtube.com

12K subscribers - no pronouns :c

Follow me on 🐦 TWITTER: twitter.com/rohanpaul_ai - to rema


07:32
Training Large Language Models for Reasoning through Reverse Curriculum RL - Audio Podcast
06:18
Self-Taught Evaluators - Audio Podcast
06:17
MedPromptExtract (Medical Data Extraction Tool) - Audio Podcast
08:27
MASAI: Modular Architecture for Software-engineering AI agents - Audio Podcast
07:04
Imitating Language via Scalable Inverse Reinforcement Learning - Audio Podcast
06:30
GraphInstruct: Empowering LLMs with Graph Understanding and Reasoning Capability- Audio Podcast
07:52
Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through - Audio Podcast
06:40
Adaptive Self-Supervised Learning Strategies For On-Device LLM Personalization - Audio Podcast
05:49
The Impact of Initialization on LoRA Finetuning Dynamics - Audio Podcast
05:07
LoRAMoE: Alleviate World Knowledge Forgetting in LLMs via MoE-Style Plugin - Audio Podcast
04:11
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning - Audio Podcast
07:25
LoRA+ Efficient Low Rank Adaptationof Large Models - Audio Podcast
05:49
ReFT: Representation Finetuning for Language Models - Audio Podcast
06:22
Leave No Context Behind-Efficient Infinite Context Transformers - Audio Podcast
06:24
Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries - Audio Podcast
05:47
TextGrad: Automatic "Differentiation" via Text - Audio Podcast
06:16
Paper - "REFT: Reasoning with Reinforced Fine-Tuning - Audio Podcast
05:32
Paper - "Adaptable Logical Control for Large Language Models" - Audio Podcast
07:41
Paper - Breaking reCAPTCHAv2 - Audio Podcast
07:32
Paper - "ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning" - Audio Podcast
07:30
Paper - "Agents in Software Engineering: Survey, Landscape, and Vision" - Audio Podcast
06:18
SCHRODINGER’S MEMORY: LARGE LANGUAGE MODELS - Audio Podcast
06:38
LLMs Still Cant Plan Can LRMs? - OpenAIs o1 on PlanBench - Audio Podcast
08:14
AI Paper Writing in the Margins - Audio Podcast
07:42
Sam Altman's newly published personal blog about the AI future. Audio Podcast by NotebookLM
08:11
Secret behind SambaNova superfast LLM Inferencing Speed - Audio Podcast
05:35
First open-source multimodal math dataset boosts MLLM performance - Podcast
09:18
New Harvard Business School study shows that AI girlfriends reduce loneliness - Audio Podcast
05:47
Paper Podcast - LLM Pruning and Distillation by NVIDIA
06:29
Training LLM to Self-Correct via Reinforcement Learning - Audio Podcast with Google NotebookLM
07:05
Training Language Models to Self-Correct via Reinforcement Learning - Audio Podcast
07:01
Iteration of Thought (IoT): Leveraging Inner Dialogue for Autonomous LLM Reasoning- Audio Podcast
15:22
Natural Language Processing Basics - Audio Podcast
06:51
CPL: CRITICAL PLANNING STEP LEARNING BOOSTS LLM Generalization in Reasoning Tasks - Audio Podcast
07:04
Paper "Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent - Audio Podcast
06:02
AI Paper - LoRA Learns Less and Forgets Less ✨- Audio Podcast
05:07
Very Powerful but very simple Prompting technique - Simply ask the LLM to re-read the question
05:18
Paper - Training Chain-of-Thought via Latent-Variable Inference - Audio Podcast
05:33
Paper - Jailbreaking Large Language Models with Symbolic Mathematics - Audio Podcast
05:18
Paper - "FINE-TUNING LARGE LANGUAGE MODELS FOR DOMAIN ADAPTATION" - Audio Podcast
06:04
Paper - "To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning" Podcast
05:59
Paper - "The Expressive Power of Transformers with Chain of Thought"- Audio Podcast
07:08
Kolmogorov–Arnold Transformer (KAT) - Replaces MLP layers with Kolmogorov-Arnold Network layers
06:11
Paper - "The ADEMAMIX Optimizer" from @Apple - Audio Podcast
07:47
Original Reflection-Tuning Paper from 2023 - Audio Podcast
05:55
Paper - "Masked Mixers For Language Generation And Retrieval" - Audio Podcast
09:24
Paper - "Can LLMs Generate Novel Research Ideas?" - Audio Podcast
07:12
πŸ“š Paper - "Hidden in Plain Sight: Exploring Chat History Tampering in Interactive Language Models"
06:08
πŸ“š Paper Podcast - "Chain of Thought Empowers Transformers to Solve Inherently Serial Problems"
06:38
PAPER - SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning
06:15
AI Paper - "Think before you speak: Training Language Models With Pause Tokens"- Audio Podcast
08:56
Apple last week gave us "Sigmoid Self-Attention" - Paper Podcast
07:16
Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author - Podcast
05:08
AI PAPER - PingPong: A Benchmark for Role-Playing Language Models with User Emulation
12:26
CodeRabbit - Leverage AI to Cut Code Review Time & Bugs in Half
04:53
Landmark Paper - Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Training
07:06
PAPER - Planning In Natural Language Improves LLM Search For Code Generation- Audio Podcast
07:20
Landmark paper from GoogleDeepMind - Scaling LLM Test-Time Compute Optimally can be More Effective"
06:35
Ilya Sutskever's Paper "Let’s Verify Step by Step" - help you understand o1 Model πŸ“ from OpenAI
06:27
πŸ“š Paper "Can Large Language Models Unlock Novel Scientific Research Ideas?"- Audio Podcast