By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
PulseReporterPulseReporter
  • Home
  • Entertainment
  • Lifestyle
  • Money
  • Tech
  • Travel
  • Investigations
Reading: OpenAI responds to DeepSeek competitors with detailed reasoning traces for o3-mini
Share
Notification Show More
Font ResizerAa
PulseReporterPulseReporter
Font ResizerAa
  • Home
  • Entertainment
  • Lifestyle
  • Money
  • Tech
  • Travel
  • Investigations
Have an existing account? Sign In
Follow US
  • Advertise
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
PulseReporter > Blog > Tech > OpenAI responds to DeepSeek competitors with detailed reasoning traces for o3-mini
Tech

OpenAI responds to DeepSeek competitors with detailed reasoning traces for o3-mini

Pulse Reporter
Last updated: February 8, 2025 2:25 am
Pulse Reporter 4 months ago
Share
OpenAI responds to DeepSeek competitors with detailed reasoning traces for o3-mini
SHARE

Be a part of our every day and weekly newsletters for the most recent updates and unique content material on industry-leading AI protection. Study Extra


OpenAI is now exhibiting extra particulars of the reasoning means of o3-mini, its newest reasoning mannequin. The change was introduced on OpenAI’s X account and comes because the AI lab is underneath elevated stress by DeepSeek-R1, a rival open mannequin that totally shows its reasoning tokens.

Fashions like o3 and R1 endure a prolonged “chain of thought” (CoT) course of through which they generate additional tokens to interrupt down the issue, cause about and take a look at completely different solutions and attain a closing answer. Beforehand, OpenAI’s reasoning fashions hid their chain of thought and solely produced a high-level overview of reasoning steps. This made it troublesome for customers and builders to grasp the mannequin’s reasoning logic and alter their directions and prompts to steer it in the appropriate route. 

OpenAI thought of chain of thought a aggressive benefit and hid it to stop rivals from copying to coach their fashions. However with R1 and different open fashions exhibiting their full reasoning hint, the dearth of transparency turns into a drawback for OpenAI.

The brand new model of o3-mini reveals a extra detailed model of CoT. Though we nonetheless don’t see the uncooked tokens, it supplies way more readability on the reasoning course of.

Why it issues for purposes

In our earlier experiments on o1 and R1, we discovered that o1 was barely higher at fixing knowledge evaluation and reasoning issues. Nevertheless, one of many key limitations was that there was no means to determine why the mannequin made errors — and it usually made errors when confronted with messy real-world knowledge obtained from the online. However, R1’s chain of thought enabled us to troubleshoot the issues and alter our prompts to enhance reasoning.

For instance, in one in every of our experiments, each fashions failed to offer the proper reply. However due to R1’s detailed chain of thought, we had been capable of finding out that the issue was not with the mannequin itself however with the retrieval stage that gathered data from the online. In different experiments, R1’s chain of thought was capable of present us with hints when it didn’t parse the data we supplied it, whereas o1 solely gave us a really tough overview of the way it was formulating its response.

We examined the brand new o3-mini mannequin on a variant of a earlier experiment we ran with o1. We supplied the mannequin with a textual content file containing costs of varied shares from January 2024 by January 2025. The file was noisy and unformatted, a mix of plain textual content and HTML components. We then requested the mannequin to calculate the worth of a portfolio that invested $140 within the Magnificent 7 shares on the primary day of every month from January 2024 to January 2025, distributed evenly throughout all shares (we used the time period “Magazine 7” within the immediate to make it a bit tougher).

o3-mini’s CoT was actually useful this time. First, the mannequin reasoned about what the Magazine 7 was, filtered the info to solely maintain the related shares (to make the issue difficult, we added just a few non–Magazine 7 shares to the info), calculated the month-to-month quantity to put money into every inventory, and made the ultimate calculations to offer the proper reply (the portfolio could be price round $2,200 on the newest time registered within the knowledge we supplied to the mannequin).

It is going to take much more testing to see the bounds of the brand new chain of thought, since OpenAI remains to be hiding a variety of particulars. However in our vibe checks, it appears that evidently the brand new format is way more helpful.

What it means for OpenAI

When DeepSeek-R1 was launched, it had three clear benefits over OpenAI’s reasoning fashions: It was open, low-cost and clear.

Since then, OpenAI has managed to shorten the hole. Whereas o1 prices $60 per million output tokens, o3-mini prices simply $4.40, whereas outperforming o1 on many reasoning benchmarks. R1 prices round $7 and $8 per million tokens on U.S. suppliers. (DeepSeek affords R1 at $2.19 per million tokens by itself servers, however many organizations won’t be able to make use of it as a result of it’s hosted in China.)

With the brand new change to the CoT output, OpenAI has managed to considerably work across the transparency drawback.

It stays to be seen what OpenAI will do about open sourcing its fashions. Since its launch, R1 has already been tailored, forked and hosted by many alternative labs and firms probably making it the popular reasoning mannequin for enterprises. OpenAI CEO Sam Altman lately admitted that he was “on the flawed facet of historical past” in open supply debate. We’ll should see how this realization will present itself in OpenAI’s future releases.

Day by day insights on enterprise use instances with VB Day by day

If you wish to impress your boss, VB Day by day has you lined. We provide the inside scoop on what corporations are doing with generative AI, from regulatory shifts to sensible deployments, so you may share insights for max ROI.

Learn our Privateness Coverage

Thanks for subscribing. Take a look at extra VB newsletters right here.

An error occured.


You Might Also Like

MCP and the innovation paradox: Why open requirements will save AI from itself

Jony Ive confirms he’s engaged on a brand new machine with OpenAI

Stephen Colbert goes to city on Trump and Musk’s Tesla advert

AirPods 4 evaluate: I flew 3,319 miles on a aircraft with energetic noise cancellation, however was it good?

Riot Video games appoints Hoby Darling as its new president

Share This Article
Facebook Twitter Email Print
Previous Article The thriller inside Amazon’s file income: How a lot are increased vendor charges boosting the underside line? The thriller inside Amazon’s file income: How a lot are increased vendor charges boosting the underside line?
Next Article 19 Fictional Crushes From A Film Or TV Present 19 Fictional Crushes From A Film Or TV Present
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!

More News

Angelina Jolie, Brad Pitt Daughter Shiloh New Title
Angelina Jolie, Brad Pitt Daughter Shiloh New Title
13 minutes ago
Nvidia CEO Jensen Huang sings praises of processor in Nintendo Change 2
Nvidia CEO Jensen Huang sings praises of processor in Nintendo Change 2
33 minutes ago
We’re elevating our CrowdStrike value goal after shortsighted post-earnings promoting
We’re elevating our CrowdStrike value goal after shortsighted post-earnings promoting
38 minutes ago
Anybody Who Identifies Over 7 Of These Movies From A Meals Scene Has Some Critical Expertise
Anybody Who Identifies Over 7 Of These Movies From A Meals Scene Has Some Critical Expertise
1 hour ago
20 Finest Offers on Father’s Day Presents (2025)
20 Finest Offers on Father’s Day Presents (2025)
2 hours ago

About Us

about us

PulseReporter connects with and influences 20 million readers globally, establishing us as the leading destination for cutting-edge insights in entertainment, lifestyle, money, tech, travel, and investigative journalism.

Categories

  • Entertainment
  • Investigations
  • Lifestyle
  • Money
  • Tech
  • Travel

Trending

  • Angelina Jolie, Brad Pitt Daughter Shiloh New Title
  • Nvidia CEO Jensen Huang sings praises of processor in Nintendo Change 2
  • We’re elevating our CrowdStrike value goal after shortsighted post-earnings promoting

Quick Links

  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Disclaimer
2024 © Pulse Reporter. All Rights Reserved.
Welcome Back!

Sign in to your account