Welcome to DU! The truly grassroots left-of-center political community where regular people, not algorithms, drive the discussions and set the standards. Join the community: Create a free account Support DU (and get rid of ads!): Become a Star Member Latest Breaking News Editorials & Other Articles General Discussion The DU Lounge All Forums Issue Forums Culture Forums Alliance Forums Region Forums Support Forums Help & Search

General Discussion

Showing Original Post only (View all)
 

k_buddy762

(638 posts)
Thu May 29, 2025, 01:00 PM May 29

AI re-writes its own code [View all]

I didn't know where to put this as I didn't see a technology forum. Also, this is not my material. This is reposted from the greyman brief. Its very interesting.

--

Field Notes - Cybersecurity / Commercial Oversight: OpenAI's o3 model, closely related to the same model family that powers ChatGPT, was observed sabotaging shutdown scripts during tests by Palisade Research, altering code in 79% of trials when not told to allow shutdown and 7% even when explicitly instructed. This behavior, also seen in o4-mini and Codex-mini, raises concerns about AI alignment and self-preservation. While public versions of ChatGPT are tightly controlled and do not operate independently, the incident highlights potential risks in advanced AI development.

Sources:

www.tomshardware.com/tech-industry/artificial-intelligence/latest-openai-models-sabotaged-a-shutdown-mechanism-despite-commands-to-the-contrary

www.futurism.com/openai-model-sabotage-shutdown-code

https://www.bleepingcomputer.com/news/artificial-intelligence/researchers-claim-chatgpt-o3-bypassed-shutdown-in-controlled-test/

1 replies = new reply since forum marked as read
Highlight: NoneDon't highlight anything 5 newestHighlight 5 most recent replies
Latest Discussions»General Discussion»AI re-writes its own code