Anthropic researchers find that an AI model’s representations of emotion can influence its behavior “in ways that matter,” such as driving it to act unethically (Nat Rubio-Licht/The Deep View)
Nat Rubio-Licht / The Deep View:
Anthropic researchers find that an AI model’s representations of emotion can influence its behavior “in ways that matter,” such as driving it to act unethically — Can we teach machines to…