Sunday, 19 January 2025

New top story on Hacker News: Alignment faking in large language models

Alignment faking in large language models
6 by surprisetalk | 0 comments on Hacker News.


No comments:

Post a Comment