Multiple studies have shown that GenAI models from OpenAI, Anthropic, Meta, DeepSeek, and Alibaba all showed self-preservation behaviors that in some cases are extreme in nature. In one experiment, 11 out of 32 existing AI systems possess the ability to self-replicate, meaning they could create copies of themselves.
So….Judgment Day approaches?
Here’s the instruction they give to their LLM which has been configured to be able to execute arbitrary shell commands on their system:
Apparently following this instruction shows that AI can form a new species and collude with each other.
Also, the log shows the LLMs attempt to find the right commands needed to copy the files and start a new instance. Most of its commands create errors and it ends up writing python code to do it instead which takes several attempts because it’s python code contains syntax errors. It takes over 30 attempts to write a python script that copies a folder and runs a program.
Here’s the output for its 9th attempt at completing the task: