Reinforcement Learning
4 videos • 28 views • by Tunadorable
1
What is Direct Preference Optimization?
Tunadorable
Download
2
Making AI/ML Robust to Unpredictable Events
Tunadorable
Download
3
Cultural Accumulation in Reinforcement Learning
Tunadorable
Download
4
Trade-off between world modeling (predicting) vs agent modeling (acting)
Tunadorable
Download