OpenAI strips warnings from ChatGPT, but its content policy hasn't changed

No more angry red warning messages on ChatGPT, but that doesn't mean a free-for-all.
By
Cecily Mauran
 on 
ChatGPT logo is seen displayed on a smartphone screen.
A new update to the way ChatGPT responds to your spicy chats. Credit: Thomas Fuller / SOPA Images / LightRocket / Getty Images

OpenAI has removed the ChatGPT orange warning boxes that indicate whether a user may have violated its content policy.

Model behavior product manager Laurentia Romaniuk shared in a post on X that they "we got rid of 'warnings' (orange boxes sometimes appended to your prompts)."

Romaniuk also put the word out for "other cases of gratuitous / unexplainable denials [users have] come across," regarding ChatGPT's tendency to play it safe with content moderation.

Joanne Jang, who leads model behavior, added to this request, asking "has chatgpt ever refused to give you what you want for no good reason? or reasons you disagree with?" This further addresses the issue that ChatGPT would previously stay away from controversial topics, but also flag chats that seemed innocuous, like one Redditor who said their chat was removed for including a swearword in their prompt.

Mashable Light Speed
Want more out-of-this world tech, space and science stories?
Sign up for Mashable's weekly Light Speed newsletter.
By clicking Sign Me Up, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.
Thanks for signing up!

Earlier this week, OpenAI overhauled its Model Spec, which details its approach to how the model safely responds to users. Compared to the much shorter earlier version, the new Model Spec is a huge document, outlining its approach to current controversies like denying a request to share copyrighted content and allowing discussion that supports or criticizes politicians.

ChatGPT has been accused of censorship, with President Trump's "AI Czar" David Sacks saying in a 2023 All-In podcast episode that ChatGPT "was programmed to be woke."

However, both the previous and current Model Specs say, "OpenAI believes in intellectual freedom which includes the freedom to have, hear, and discuss ideas." Yet removing the warnings raised questions about whether this is related to an implicit change in ChatGPT's responses.

An OpenAI spokesperson said this is not a reflection of the updated Model Spec and the change doesn't impact the model responses. Instead, it was a decision to update how they communicate its content policies to users. Newer models like o3 are more capable of reasoning through a request and therefore hypothetically better at responding to controversial or sensitive topics instead of defaulting to refuse a query.

The spokesperson also said OpenAI will continue to show the warning sign in certain cases that violate its content policy.

Topics ChatGPT OpenAI

Mashable Image
Cecily Mauran
Tech Reporter

Cecily is a tech reporter at Mashable who covers AI, Apple, and emerging tech trends. Before getting her master's degree at Columbia Journalism School, she spent several years working with startups and social impact businesses for Unreasonable Group and B Lab. Before that, she co-founded a startup consulting business for emerging entrepreneurial hubs in South America, Europe, and Asia. You can find her on X at @cecily_mauran.


Recommended For You

OpenAI launches 'deep research' AI agent for ChatGPT
OpenAI CEO Sam Altman talking about deep research onstage at an event in Tokyo

OpenAI responds to criticism of ChatGPT's Studio Ghibli-style images
studio ghibli exhibition in singapore showing sculptures of 'my neighbor totoro'

OpenAI announces new ChatGPT product amid DeepSeek AI news
OpenAI logo behind the DeepSeek logo on a smartphone

OpenAI announces native image generation in ChatGPT and Sora
openai livestream with sam altman

Trending on Mashable
NYT Connections hints today: Clues, answers for April 10, 2025
Connections game on a smartphone

Dire wolves have been brought back from extinction. What does this mean?
Dire wolves Romulus and Remus next to each other in the snow at five months old.

Wordle today: Answer, hints for April 10, 2025
Wordle game on a smartphone

NYT Mini crossword answers, hints for April 10, 2025
Close-up view of crossword puzzle.

'Black Mirror' fans, be warned: DO NOT start with 'Common People'
Chris O'Dowd and Rashida Jones star in "Black Mirror: Common People."
The biggest stories of the day delivered to your inbox.
These newsletters may contain advertising, deals, or affiliate links. By clicking Subscribe, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.
Thanks for signing up. See you at your inbox!