Bỏ qua đến nội dung chính
Miễn phí mãi mãi · không paywall · không quảng cáo
Đề xuất khoá học
→
en
vi
claudem
y
.org
Lộ trình
Thư viện
Theo nhu cầu
Kỹ năng
Tìm khoá học, bài học…
⌘K
en
vi
Trang chủ
Thư viện YouTube
Anthropic — Research Papers
Khoá học · Thư viện YouTube
Anthropic — Research Papers
Anthropic
19 bài học
11h 26m
Giải thích cơ chế và an toàn AI
1
What is interpretability?
Nâng cao
4m
AI personality and alignment methods
2
What should an AI's personality be?
Nâng cao
38m
Scaling LLM interpretability
3
Scaling interpretability
Nâng cao
53m
AI policy, safety, interpretability, future
4
AI, policy, and the weird sci-fi future with Anthropic’s Jack Clark
Trung cấp
38m
AI usage analysis for safety & impact
5
What do people use AI models for?
Trung cấp
47m
LLM alignment faking and safety implications
6
Alignment faking in large language models
Nâng cao
1h 30m
Challenges in AI alignment and interpretability
7
How difficult is AI alignment? | Anthropic Research Salon
Nâng cao
28m
AI safety and jailbreak prevention
8
Defending against AI jailbreaks
Trung cấp
1h 15m
AI safety, alignment, and control
9
Controlling powerful AI
Nâng cao
51m
AI interpretability and internal thought processes
10
Tracing the thoughts of a large language model
Trung cấp
3m
AI consciousness and ethics
11
Could AI models be conscious?
Trung cấp
44m
AI ethics and societal impact
12
The Societal Impacts of AI
Trung cấp
8m
AI emotional support use safety research
13
Affective Use of AI
Cơ bản
12m
understanding LLM internal mechanisms
14
Interpretability: Understanding how AI models think
Nâng cao
59m
AI cybercrime and future safety threats
15
Threat Intelligence: How Anthropic stops AI cybercrime
Trung cấp
37m
Reward hacking and AI alignment
16
What is Al "reward hacking"—and why do we worry about it?
Nâng cao
52m
AI ethics, identity, and welfare
17
Anthropic’s philosopher answers your questions
Trung cấp
36m
AI model alignment and safety
18
What is sycophancy in AI models?
Trung cấp
6m
AI functional emotions & interpretability
19
When AIs act emotional
Trung cấp
5m
💬
Góp ý / Báo lỗi
Phát hiện sai sót hoặc có ý tưởng cải thiện?
→