By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
PulseReporterPulseReporter
  • Home
  • Entertainment
  • Lifestyle
  • Money
  • Tech
  • Travel
  • Investigations
Reading: DeepSeek-R1-Lite-Preview AI reasoning mannequin beats OpenAI o1
Share
Notification Show More
Font ResizerAa
PulseReporterPulseReporter
Font ResizerAa
  • Home
  • Entertainment
  • Lifestyle
  • Money
  • Tech
  • Travel
  • Investigations
Have an existing account? Sign In
Follow US
  • Advertise
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
PulseReporter > Blog > Tech > DeepSeek-R1-Lite-Preview AI reasoning mannequin beats OpenAI o1
Tech

DeepSeek-R1-Lite-Preview AI reasoning mannequin beats OpenAI o1

Last updated: November 21, 2024 12:37 pm
6 months ago
Share
DeepSeek-R1-Lite-Preview AI reasoning mannequin beats OpenAI o1
SHARE

Be part of our every day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Be taught Extra


DeepSeek, an AI offshoot of Chinese language quantitative hedge fund Excessive-Flyer Capital Administration centered on releasing high-performance open-source tech, has unveiled the R1-Lite-Preview, its newest reasoning-focused massive language mannequin (LLM), out there for now solely by DeepSeek Chat, its web-based AI chatbot.

Identified for its progressive contributions to the open-source AI ecosystem, DeepSeek’s new launch goals to convey high-level reasoning capabilities to the general public whereas sustaining its dedication to accessible and clear AI.

And the R1-Lite-Preview, regardless of solely being out there by the chat software for now, is already turning heads by providing efficiency nearing and in some circumstances exceeding OpenAI’s vaunted o1-preview mannequin.

Like that mannequin launched in Sept. 2024, DeepSeek-R1-Lite-Preview displays “chain-of-thought” reasoning, exhibiting the consumer the totally different chains or trains of “thought” it goes down to answer their queries and inputs, documenting the method by explaining what it’s doing and why.

Whereas among the chains/trains of ideas might seem nonsensical and even inaccurate to people, DeepSeek-R1-Lite-Preview seems on the entire to be strikingly correct, even answering “trick” questions which have tripped up different, older, but highly effective AI fashions reminiscent of GPT-4o and Claude’s Anthropic household, together with “what number of letter Rs are within the phrase Strawberry?” and “which is bigger, 9.11 or 9.9?” See screenshots beneath of my checks of those prompts on DeepSeek Chat:

DeepSeek-R1-Lite-Preview AI reasoning mannequin beats OpenAI o1

A brand new strategy to AI reasoning

DeepSeek-R1-Lite-Preview is designed to excel in duties requiring logical inference, mathematical reasoning, and real-time problem-solving.

In line with DeepSeek, the mannequin exceeds OpenAI o1-preview-level efficiency on established benchmarks reminiscent of AIME (American Invitational Arithmetic Examination) and MATH.

DeepSeek-R1-Lite-Preview benchmark outcomes posted on X.

Its reasoning capabilities are enhanced by its clear thought course of, permitting customers to observe alongside because the mannequin tackles complicated challenges step-by-step.

DeepSeek has additionally revealed scaling information, showcasing regular accuracy enhancements when the mannequin is given extra time or “thought tokens” to unravel issues. Efficiency graphs spotlight its proficiency in reaching larger scores on benchmarks reminiscent of AIME as thought depth will increase.

Benchmarks and Actual-World Purposes

DeepSeek-R1-Lite-Preview has carried out competitively on key benchmarks.

The corporate’s revealed outcomes spotlight its capability to deal with a variety of duties, from complicated arithmetic to logic-based situations, incomes efficiency scores that rival top-tier fashions in reasoning benchmarks like GPQA and Codeforces.

The transparency of its reasoning course of additional units it aside. Customers can observe the mannequin’s logical steps in actual time, including a component of accountability and belief that many proprietary AI programs lack.

Nonetheless, DeepSeek has not but launched the complete code for impartial third-party evaluation or benchmarking, nor has it but made DeepSeek-R1-Lite-Preview out there by an API that might enable the identical type of impartial checks.

As well as, the corporate has not but revealed a weblog submit nor a technical paper explaining how DeepSeek-R1-Lite-Preview was educated or architected, leaving many query marks about its underlying origins.

Accessibility and Open-Supply Plans

The R1-Lite-Preview is now accessible by DeepSeek Chat at chat.deepseek.com. Whereas free for public use, the mannequin’s superior “Deep Suppose” mode has a every day restrict of fifty messages, providing ample alternative for customers to expertise its capabilities.

Wanting forward, DeepSeek plans to launch open-source variations of its R1 collection fashions and associated APIs, in line with the corporate’s posts on X.

This transfer aligns with the corporate’s historical past of supporting the open-source AI neighborhood.

Its earlier launch, DeepSeek-V2.5, earned reward for combining basic language processing and superior coding capabilities, making it one of the crucial highly effective open-source AI fashions on the time.

Constructing on a Legacy

DeepSeek is continuous its custom of pushing boundaries in open-source AI. Earlier fashions like DeepSeek-V2.5 and DeepSeek Coder demonstrated spectacular capabilities throughout language and coding duties, with benchmarks putting it as a frontrunner within the subject.

The discharge of R1-Lite-Preview provides a brand new dimension, specializing in clear reasoning and scalability.

As companies and researchers discover functions for reasoning-intensive AI, DeepSeek’s dedication to openness ensures that its fashions stay an important useful resource for improvement and innovation.

By combining excessive efficiency, clear operations, and open-source accessibility, DeepSeek isn’t just advancing AI but additionally reshaping how it’s shared and used.

The R1-Lite-Preview is accessible now for public testing. Open-source fashions and APIs are anticipated to observe, additional solidifying DeepSeek’s place as a frontrunner in accessible, superior AI applied sciences.

VB Every day

Keep within the know! Get the newest information in your inbox every day

By subscribing, you comply with VentureBeat’s Phrases of Service.

Thanks for subscribing. Take a look at extra VB newsletters right here.

An error occured.


You Might Also Like

Narwal’s Freo X Extremely, the perfect mopping robotic obtainable, is on sale for a brand new low value

Greatest Apple deal: Save 10% on Apple equipment when buying and selling in a tool in-store

Alibaba’s ‘ZeroSearch’ lets AI be taught to google itself — slashing coaching prices by 88 %

Amazon Luna indicators multi-year cope with EA to deliver large video games to cloud gaming

ChatGPT unveils main redesign with new ‘Canvas’ interface for writers and coders

Share This Article
Facebook Twitter Email Print
Previous Article Black Friday airfare deal: Enterprise-class flights to France and Italy beginning at ,200 Black Friday airfare deal: Enterprise-class flights to France and Italy beginning at $2,200
Next Article Miley Cyrus Talks Relationship With Maxx Morando Miley Cyrus Talks Relationship With Maxx Morando
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!

More News

Slash MTTP, block exploits: Ring deployment now important
Slash MTTP, block exploits: Ring deployment now important
14 minutes ago
Tracee Ellis Ross On Being Single And Baby-Free
Tracee Ellis Ross On Being Single And Baby-Free
50 minutes ago
House Depot Promo Codes & Coupons: 50% Off | Could 2025
House Depot Promo Codes & Coupons: 50% Off | Could 2025
1 hour ago
Swiss operating model On grew to become  billion richer within the final week. It’s coming for Nike and Adidas subsequent
Swiss operating model On grew to become $3 billion richer within the final week. It’s coming for Nike and Adidas subsequent
1 hour ago
Trump Admin Immigrant Competitors Present
Trump Admin Immigrant Competitors Present
2 hours ago

About Us

about us

PulseReporter connects with and influences 20 million readers globally, establishing us as the leading destination for cutting-edge insights in entertainment, lifestyle, money, tech, travel, and investigative journalism.

Categories

  • Entertainment
  • Investigations
  • Lifestyle
  • Money
  • Tech
  • Travel

Trending

  • Slash MTTP, block exploits: Ring deployment now important
  • Tracee Ellis Ross On Being Single And Baby-Free
  • House Depot Promo Codes & Coupons: 50% Off | Could 2025

Quick Links

  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Disclaimer
2024 © Pulse Reporter. All Rights Reserved.
Welcome Back!

Sign in to your account