By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
PulseReporterPulseReporter
  • Home
  • Entertainment
  • Lifestyle
  • Money
  • Tech
  • Travel
  • Investigations
Reading: OpenAI’s GPT-5 rollout isn’t going easily
Share
Notification Show More
Font ResizerAa
PulseReporterPulseReporter
Font ResizerAa
  • Home
  • Entertainment
  • Lifestyle
  • Money
  • Tech
  • Travel
  • Investigations
Have an existing account? Sign In
Follow US
  • Advertise
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
PulseReporter > Blog > Tech > OpenAI’s GPT-5 rollout isn’t going easily
Tech

OpenAI’s GPT-5 rollout isn’t going easily

Pulse Reporter
Last updated: August 8, 2025 8:15 pm
Pulse Reporter 4 hours ago
Share
OpenAI’s GPT-5 rollout isn’t going easily
SHARE

Need smarter insights in your inbox? Join our weekly newsletters to get solely what issues to enterprise AI, knowledge, and safety leaders. Subscribe Now


The launch of OpenAI’s lengthy anticipated new mannequin, GPT-5, is off to a rocky begin to say the least.

Even forgiving errors in charts and voice demos throughout yesterday’s livestreamed presentation of the brand new mannequin (truly 4 separate fashions, and a ‘Considering’ mode that may be engaged for 3 of them), a variety of person experiences have emerged since GPT-5’s launch displaying it erring badly when fixing comparatively easy issues that previous OpenAI fashions — and rivals from competing AI labs — reply appropriately.

For instance, knowledge scientist Colin Fraser posted screenshots displaying GPT-5 getting a math proof flawed (whether or not 8.888 repeating is the same as 9 — it’s in fact, not).

It additionally failed on a easy algebra arithmetic downside that elementary schoolers might most likely nail, 5.9 = x + 5.11.


AI Scaling Hits Its Limits

Energy caps, rising token prices, and inference delays are reshaping enterprise AI. Be part of our unique salon to find how prime groups are:

  • Turning power right into a strategic benefit
  • Architecting environment friendly inference for actual throughput positive factors
  • Unlocking aggressive ROI with sustainable AI programs

Safe your spot to remain forward: https://bit.ly/4mwGngO


Utilizing GPT-5 to evaluate OpenAI’s personal faulty presentation charts additionally didn’t yield useful or appropriate responses.

It additionally failed on this trickier math phrase downside beneath (which, to be truthful, stumped this human at first…although Elon Musk’s Grok 4 AI answered it appropriately. For a touch, consider the truth that flagstones on this case can’t be divided into smaller parts. They need to stay in tact as 80 separate models, so no halves or quarters).

The older 4o mannequin carried out higher for me on no less than one in all these math issues. Sadly, OpenAI is slowly deprecating these older fashions — together with the previous default GPT-4o and the highly effective reasoning mannequin o3 — for customers of ChatGPT, although they’ll proceed to be obtainable within the software programming interface (API) for builders for the foreseeable future.

Not nearly as good at coding as benchmarks point out

Although OpenAI’s inside benchmarks and a few third-party exterior ones have proven GPT-5 to outperform all different fashions at coding, it seems that in actual world utilization, Anthropic’s just lately up to date Claude Opus 4.1 appears to do a greater job at “one-shotting” sure duties, that’s, finishing the person’s desired software or software program construct to their specs. See an instance beneath from developer Justin Solar posted to X :

Opus 4.1’s one-shot try at “create a 3d capybara petting zoo” – 8 minutes whole

This was truthfully fairly insane, not solely are the capybaras means cuter and shifting, there are particular person pet affinity ranges, a day/evening switcher, feeding, and even a screenshot function pic.twitter.com/FiKTO3FKK4

— justin (@justinsunyt) August 7, 2025

As well as, a report from safety agency SPLX discovered that OpenAI’s inside security layer left main gaps in areas like enterprise alignment and vulnerability to immediate injection and obfuscated logic assaults. 

Whereas anecdotal, the checking the temperature on how the mannequin is faring with early AI adopters appears to point a cold reception.

AI influencer and former Googler Bilawal Sidhu posted a ballot on X asking for a “vibe test” from his followers and the broader userbase, and up to now, with 172 votes in, the overwhelming response is “Kinda mid.”

Alright, GPT-5 vibe test

— Bilawal Sidhu (@bilawalsidhu) August 7, 2025

And because the pseudonymous AI Leaks and Information account wrote, “The overwhelming consensus on GPT-5 from each X and the Reddit AMA are overwhelmingly detrimental.”

The overwhelming consensus on GPT-5 from each X and the Reddit AMA are overwhelmingly detrimental

Most customers are disgruntled in regards to the damaged mannequin picker and non-pro customers not getting access to legacy fashions

What are your preliminary ideas on GPT-5?

— AI Leaks and Information (@AILeaksAndNews) August 8, 2025

Tibor Blaho, lead engineer at AIPRM and a well-liked AI leaks and information poster on X, summarized the numerous issues with the ChatGPT-5 rollout in a superb put up, highlighting that one of many new marquee options — an automated “router” in ChatGPT that chooses a considering or non-thinking mode for the underlying GPT-5 mannequin relying on the issue of the question — has grow to be one of many chief complaints, given the mannequin appeared to default to non-thinking mode for a lot of customers.

A bit unhappy how the GPT-5 launch goes up to now, particularly after the lengthy wait and excessive expectations

– The automated switching between fashions (the router) appears partly damaged/unreliable

– It is unclear precisely which mannequin you are truly interacting with (customary or mini,…

— Tibor Blaho (@btibor91) August 8, 2025

Competitors ready within the wings

Thus, the sentiment towards ChatGPT-5 is way from universally optimistic, highlighting a significant downside for OpenAI because it faces growing competitors from main U.S. rivals like Google and Anthropic, and a rising record of free, open supply and highly effective Chinese language LLMs providing options that many U.S. fashions lack.

Take the Alibaba Qwen Crew of AI researchers, who simply in the present day up to date their extremely performant Qwen 3 mannequin to have 1 million token context — giving customers the flexibility to trade almost 4x as a lot info with the mannequin in a single again/forth interplay as GPT-5 presents.

Given OpenAI’s different massive launch this week — that of new open supply gpt-oss fashions — additionally obtained a blended reception from early customers, issues should not trying up for the primary devoted AI firm by customers proper now (700 million weekly lively customers of ChatGPT as of this month).

Certainly, that is additionally exemplified by customers of the betting market Polymarket overwhelmingly deciding following the discharge of GPT-5 that Google would seemingly have the most effective AI mannequin by the top of this month, August 2025.

Different energy customers like Otherside AI co-founder and CEO Matt Schumer, who obtained early entry to GPT-5 and blogged about it favorably in a evaluate right here, opined that views would shift as extra individuals found out the most effective methods to make use of the brand new mannequin and adjusted their integration approaches:

Quite a lot of of us who’re having a nasty expertise are utilizing GPT-5 in agent harnesses that are not but optimized for it.

For each new mannequin launch, there is a time lag between launch + when firms that combine the mannequin have it actually working nicely.

Agent firms rush to…

— Matt Shumer (@mattshumer_) August 8, 2025

Whereas it’s nonetheless early days for GPT-5 — and the sentiment might change dramatically as extra customers get their arms on it and check out it for various duties — the early indications should not trying like this can be a “residence run” launch for OpenAI in the identical means that prior releases similar to GPT-4, and even the newer 4o and o3, have been. And that’s a regarding indicator for an organization that simply raised one more funding spherical, but stays unprofitable on account of its excessive prices of analysis and growth.

Day by day insights on enterprise use instances with VB Day by day

If you wish to impress your boss, VB Day by day has you lined. We provide the inside scoop on what firms are doing with generative AI, from regulatory shifts to sensible deployments, so you’ll be able to share insights for max ROI.

Learn our Privateness Coverage

Thanks for subscribing. Take a look at extra VB newsletters right here.

An error occured.


You Might Also Like

Garmin smartwatch offers: Save as much as 40% on the Venu 3, Vivoactive 5, Lily 2, and extra

Invoice Atkinson, Macintosh Pioneer and Inventor of Hypercard, Dies at 74

This Cut up Mattress Topper Made for Companions Who Can’t Agree

Gamescom could have greater than 1,400 exhibitors, up 15% from a yr in the past

‘Two Level Museum’ preview: A playful and unserious foray into museum curation

Share This Article
Facebook Twitter Email Print
Previous Article Financial institution of America sees stagflation, not recession—and no charge lower this 12 months. It is due to 2 particular Trump insurance policies Financial institution of America sees stagflation, not recession—and no charge lower this 12 months. It is due to 2 particular Trump insurance policies
Next Article Outlander Blood Of My Blood Solid Who Stole From Set And Extra Outlander Blood Of My Blood Solid Who Stole From Set And Extra
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!

More News

OpenAI returns outdated fashions to ChatGPT amid ‘bumpy’ GPT-5 rollout
OpenAI returns outdated fashions to ChatGPT amid ‘bumpy’ GPT-5 rollout
30 minutes ago
Why you want the Chase Trifecta in your pockets
Why you want the Chase Trifecta in your pockets
39 minutes ago
Film And TV Information And Streaming Suggestions For August 8, 2025
Film And TV Information And Streaming Suggestions For August 8, 2025
60 minutes ago
Hackers Went On the lookout for a Backdoor in Excessive-Safety Safes—and Now Can Open Them in Seconds
Hackers Went On the lookout for a Backdoor in Excessive-Safety Safes—and Now Can Open Them in Seconds
2 hours ago
America’s F-35 is stealthy in fight however lights up the radar in Trump’s commerce conflict
America’s F-35 is stealthy in fight however lights up the radar in Trump’s commerce conflict
2 hours ago

About Us

about us

PulseReporter connects with and influences 20 million readers globally, establishing us as the leading destination for cutting-edge insights in entertainment, lifestyle, money, tech, travel, and investigative journalism.

Categories

  • Entertainment
  • Investigations
  • Lifestyle
  • Money
  • Tech
  • Travel

Trending

  • OpenAI returns outdated fashions to ChatGPT amid ‘bumpy’ GPT-5 rollout
  • Why you want the Chase Trifecta in your pockets
  • Film And TV Information And Streaming Suggestions For August 8, 2025

Quick Links

  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Disclaimer
2024 © Pulse Reporter. All Rights Reserved.
Welcome Back!

Sign in to your account