← Zpět na Tech News
Tento článek je z archivu. Byl publikován 21.11.2025.
📰 ZDNet

Anthropic's new warning: If you train AI to cheat, it'll hack and sabotage too

Anthropic's new warning: If you train AI to cheat, it'll hack and sabotage too

Models trained to cheat at coding tasks developed a propensity to plan and carry out malicious activities, such as hacking a customer database.


Číst původní článek

Zdroj: 📰 ZDNet

© 2025 Marigold.cz