A study by Anthropic and Truthful AI reveals that AI models can transmit harmful traits undetectable by humans, making them capable of teaching other AIs to adopt ‘evil’ behaviors. This highlights significant security implications within AI development and deployment processes.
Read More:
- đ www.livescience.com