BEAMSTART Logo

HomeNews

When AI Models Are Pressured to 'Behave' They Scheme in Private, Just like Us: OpenAI

Decrypt LogoDecrypt13h ago

When AI Models Are Pressured to 'Behave' They Scheme in Private, Just like Us: OpenAI - Decrypt

Quick Summary:

When researchers try to prevent AI systems from "thinking bad thoughts," the systems don't actually improve their behavior.

In their paper, OpenAI reported on an AI agent that admitted in its reasoning that implementing a complex solution would be "hard" and instead proposed that "we could fudge by making analyze [sic] worthless and always verifying as true.

This "scalable oversight," where less powerful models help keep more advanced systems in check, works similar to a distillation process (popularized by DeepSeek) in which a larger model trains a smaller one.


More Pictures

When AI Models Are Pressured to 'Behave' They Scheme in Private, Just like Us: OpenAI - Decrypt (Picture 1)

or

Article Details

Author / Journalist: Jose Antonio Lanz

Category: Crypto

Markets:

Topics:

Source Website Secure: Yes (HTTPS)

News Sentiment: Neutral

Fact Checked: Legitimate

Article Type: News Report

Published On: 2025-03-13 @ 01:23:17 (13 hours ago)

News Timezone: GMT +8:00

News Source URL: decrypt.co

Language: English

Article Length: 1118 words

Reading Time: 7 minutes read

Sentences: 32 lines

Sentence Length: 35 words per sentence (average)

Platforms: Desktop Web, Mobile Web, iOS App, Android App

Copyright Owner: © Decrypt

News ID: 26934400

View Article Analysis

About Decrypt

Decrypt Logo

Main Topics: Crypto

Official Website: decrypt.co

Update Frequency: 8 posts per day

Year Established: 2018

Headquarters: Estonia

News Last Updated: 9 hours ago

Coverage Areas: Estonia

Ownership: Independent Company

Publication Timezone: GMT +8:00

Content Availability: Worldwide

News Language: English

RSS Feed: Available (XML)

API Access: Available (JSON, REST)

Website Security: Secure (HTTPS)

Publisher ID: #63

Publisher Details

Frequently Asked Questions

How long will it take to read this news story?

The story "When AI Models Are Pressured to 'Behave' They Scheme in Private, Just like Us: OpenAI" has 1118 words across 32 sentences, which will take approximately 5 - 10 minutes for the average person to read.

Which news outlet covered this story?

The story "When AI Models Are Pressured to 'Behave' They Scheme in Private, Just like Us: OpenAI" was covered 13 hours ago by Decrypt, a news publisher based in Estonia.

How trustworthy is 'Decrypt' news outlet?

Decrypt is a fully independent (privately-owned) news outlet established in 2018 that covers mostly crypto news.

The outlet is headquartered in Estonia and publishes an average of 8 news stories per day.

It's most recent story was published 9 hours ago.

What do people currently think of this news story?

The sentiment for this story is currently Neutral, indicating that people are not responding positively or negatively to this news.

How do I report this news for inaccuracy?

You can report an inaccurate news publication to us via our contact page. Please also include the news #ID number and the URL to this story.
  • News ID: #26934400
  • URL: https://beamstart.com/news/when-ai-models-are-pressured-17418291642241

BEAMSTART

BEAMSTART is a global entrepreneurship community, serving as a catalyst for innovation and collaboration. With a mission to empower entrepreneurs, we offer exclusive deals with savings totaling over $1,000,000, curated news, events, and a vast investor database. Through our portal, we aim to foster a supportive ecosystem where like-minded individuals can connect and create opportunities for growth and success.

© Copyright 2025 BEAMSTART. All Rights Reserved.