Center for AI Safety | Poke

Center for AI Safety @UCY_K5gXsXHtuiP8mj3BiWxA@youtube.com

1.6K subscribers - no pronouns :c

More from this channel (soon)

Videos Playlists

Recently Uploaded Popular Oldest

Lecture 8 | AI Safety, Ethics, & Society: Governance

Lecture 7 | AI Safety, Ethics, & Society: Collective Action Problems

Lecture 6 | AI Safety, Ethics, & Society: Beneficial AI and Machine Ethics

Lecture 5 | AI Safety, Ethics, & Society: Complex Systems

Lecture 4 | AI Safety, Ethics, & Society: Safety Engineering

Lecture 3 | AI Safety, Ethics, & Society: Single Agent Safety

Lecture 2 | AI Safety, Ethics, & Society: AI Fundamentals

Lecture 1 | AI Safety, Ethics, & Society: Introduction and Overview of AI Risks

Representation Engineering

An Overview of Catastrophic Risks

WAIC 2023: AI Risks and Safety Forum

Peter Railton - A World of Natural and Artificial Agents in a Shared Environment

Shelly Kagan - The Moral Claims of AI

Peter Salib - AI Rights as a Safety Technology

Ethan Perez - Discovering Language Model Behaviors with Model Written Evaluations

AI and Evolution

David Krueger: Existential Safety, Alignment, and Specification Problems

ML for Cyberdefense

Safety-Capabilities Balance

X-Risk Overview

Possible Existential Hazards

ML for Improved Decision-Making

Review and Conclusion

Detecting Emergent Behavior

Anomaly Detection

Interpretable Uncertainty

Black Swan Robustness

Adversarial Robustness

Deep Learning Review

Risk Decomposition

Accident Models