By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
PulseReporterPulseReporter
  • Home
  • Entertainment
  • Lifestyle
  • Money
  • Tech
  • Travel
  • Investigations
Reading: Contextual AI’s new AI mannequin crushes GPT-4o in accuracy — right here’s why it issues
Share
Notification Show More
Font ResizerAa
PulseReporterPulseReporter
Font ResizerAa
  • Home
  • Entertainment
  • Lifestyle
  • Money
  • Tech
  • Travel
  • Investigations
Have an existing account? Sign In
Follow US
  • Advertise
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
PulseReporter > Blog > Tech > Contextual AI’s new AI mannequin crushes GPT-4o in accuracy — right here’s why it issues
Tech

Contextual AI’s new AI mannequin crushes GPT-4o in accuracy — right here’s why it issues

Pulse Reporter
Last updated: March 5, 2025 8:00 am
Pulse Reporter 3 months ago
Share
Contextual AI’s new AI mannequin crushes GPT-4o in accuracy — right here’s why it issues
SHARE

Be a part of our day by day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Study Extra


Contextual AI unveiled its grounded language mannequin (GLM) at the moment, claiming it delivers the best factual accuracy within the {industry} by outperforming main AI programs from Google, Anthropic and OpenAI on a key benchmark for truthfulness.

The startup, based by the pioneers of retrieval-augmented technology (RAG) expertise, reported that its GLM achieved an 88% factuality rating on the FACTS benchmark, in comparison with 84.6% for Google’s Gemini 2.0 Flash, 79.4% for Anthropic’s Claude 3.5 Sonnet and 78.8% for OpenAI’s GPT-4o.

Whereas massive language fashions have remodeled enterprise software program, factual inaccuracies — typically known as hallucinations — stay a vital problem for enterprise adoption. Contextual AI goals to unravel this by making a mannequin particularly optimized for enterprise RAG functions the place accuracy is paramount.

“We knew that a part of the answer could be a way known as RAG — retrieval-augmented technology,” mentioned Douwe Kiela, CEO and cofounder of Contextual AI, in an unique interview with VentureBeat. “And we knew that as a result of RAG is initially my concept. What this firm is about is absolutely about doing RAG the appropriate manner, to type of the following stage of doing RAG.”

The corporate’s focus differs considerably from general-purpose fashions like ChatGPT or Claude, that are designed to deal with all the things from inventive writing to technical documentation. Contextual AI as a substitute targets high-stakes enterprise environments the place factual precision outweighs inventive flexibility.

“In case you have a RAG downside and also you’re in an enterprise setting in a extremely regulated {industry}, you haven’t any tolerance in anyway for hallucination,” defined Kiela. “The identical general-purpose language mannequin that’s helpful for the advertising and marketing division will not be what you need in an enterprise setting the place you might be rather more delicate to errors.”

A benchmark comparability exhibiting Contextual AI’s new grounded language mannequin (GLM) outperforming opponents from Google, Anthropic and OpenAI on factual accuracy assessments. The corporate claims its specialised method reduces AI hallucinations in enterprise settings.(Credit score: Contextual AI)

How Contextual AI makes ‘groundedness’ the brand new gold normal for enterprise language fashions

The idea of “groundedness” — making certain AI responses stick strictly to data explicitly supplied within the context — has emerged as a vital requirement for enterprise AI programs. In regulated industries like finance, healthcare and telecommunications, corporations want AI that both delivers correct data or explicitly acknowledges when it doesn’t know one thing.

Kiela supplied an instance of how this strict groundedness works: “In case you give a recipe or a method to a normal language mannequin, and someplace in it, you say, ‘however that is solely true for many instances,’ most language fashions are nonetheless simply going to provide the recipe assuming it’s true. However our language mannequin says, ‘Really, it solely says that that is true for many instances.’ It’s capturing this extra little bit of nuance.”

The power to say “I don’t know” is an important one for enterprise settings. “Which is known as a very highly effective characteristic, if you consider it in an enterprise setting,” Kiela added.

Contextual AI’s RAG 2.0: A extra built-in solution to course of firm data

Contextual AI’s platform is constructed on what it calls “RAG 2.0,” an method that strikes past merely connecting off-the-shelf parts.

“A typical RAG system makes use of a frozen off-the-shelf mannequin for embeddings, a vector database for retrieval, and a black-box language mannequin for technology, stitched collectively via prompting or an orchestration framework,” in line with an organization assertion. “This results in a ‘Frankenstein’s monster’ of generative AI: the person parts technically work, however the entire is way from optimum.”

As a substitute, Contextual AI collectively optimizes all parts of the system. “Now we have this mixture-of-retrievers element, which is known as a solution to do clever retrieval,” Kiela defined. “It seems on the query, after which it thinks, primarily, like a lot of the newest technology of fashions, it thinks, [and] first it plans a method for doing a retrieval.”

This whole system works in coordination with what Kiela calls “one of the best re-ranker on the earth,” which helps prioritize probably the most related data earlier than sending it to the grounded language mannequin.

Past plain textual content: Contextual AI now reads charts and connects to databases

Whereas the newly introduced GLM focuses on textual content technology, Contextual AI’s platform has lately added assist for multimodal content material together with charts, diagrams and structured knowledge from common platforms like BigQuery, Snowflake, Redshift and Postgres.

“Probably the most difficult issues in enterprises are on the intersection of unstructured and structured knowledge,” Kiela famous. “What I’m largely enthusiastic about is absolutely this intersection of structured and unstructured knowledge. Many of the actually thrilling issues in massive enterprises are smack bang on the intersection of structured and unstructured, the place you’ve gotten some database data, some transactions, perhaps some coverage paperwork, perhaps a bunch of different issues.”

The platform already helps a wide range of advanced visualizations, together with circuit diagrams within the semiconductor {industry}, in line with Kiela.

Contextual AI’s future plans: Creating extra dependable instruments for on a regular basis enterprise

Contextual AI plans to launch its specialised re-ranker element shortly after the GLM launch, adopted by expanded document-understanding capabilities. The corporate additionally has experimental options for extra agentic capabilities in improvement.

Based in 2023 by Kiela and Amanpreet Singh, who beforehand labored at Meta’s Elementary AI Analysis (FAIR) group and Hugging Face, Contextual AI has secured clients together with HSBC, Qualcomm and the Economist. The corporate positions itself as serving to enterprises lastly understand concrete returns on their AI investments.

“That is actually a chance for corporations who’re perhaps beneath stress to begin delivering ROI from AI to begin extra specialised options that truly clear up their issues,” Kiela mentioned. “And a part of that basically is having a grounded language mannequin that’s perhaps a bit extra boring than a normal language mannequin, however it’s actually good at ensuring that it’s grounded within the context and that you may actually belief it to do its job.”

Day by day insights on enterprise use instances with VB Day by day

If you wish to impress your boss, VB Day by day has you lined. We provide the inside scoop on what corporations are doing with generative AI, from regulatory shifts to sensible deployments, so you’ll be able to share insights for max ROI.

Learn our Privateness Coverage

Thanks for subscribing. Try extra VB newsletters right here.

An error occured.


You Might Also Like

The Finest Low-cost TVs (2025): TCL, Hisense, and Extra

Finest Apple M4 MacBook Air deal: Base mannequin now simply $911

Microsoft Undertaking Skilled is now solely £15.54 for all times

Magnit’s AI assistant Maggi goals to search out expertise simpler, sooner

Revenge of the Savage Planet’s Cosmic Hoarder Version launches at the moment

Share This Article
Facebook Twitter Email Print
Previous Article These inexpensive vacation theme cruises are excellent for a fast getaway These inexpensive vacation theme cruises are excellent for a fast getaway
Next Article Adrien Brody’s Girlfriend, Georgina Chapman, Reacted To His Impromptu Viral Kiss With Halle Berry At The Oscars Adrien Brody’s Girlfriend, Georgina Chapman, Reacted To His Impromptu Viral Kiss With Halle Berry At The Oscars
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!

More News

"I'm Falling In Love": Al Pacino Simply Did Our Pet Interview, And The Canine Completely Adored Him
"I'm Falling In Love": Al Pacino Simply Did Our Pet Interview, And The Canine Completely Adored Him
23 minutes ago
Cybercriminals Are Hiding Malicious Net Site visitors in Plain Sight
Cybercriminals Are Hiding Malicious Net Site visitors in Plain Sight
44 minutes ago
RFK Jr. will ‘finish the battle’ in opposition to different medication on the FDA, from stem cell remedy to chelation. Right here’s what to know
RFK Jr. will ‘finish the battle’ in opposition to different medication on the FDA, from stem cell remedy to chelation. Right here’s what to know
49 minutes ago
Flip Into Glinda Or Elphaba With This ‘Depraved’ Generator
Flip Into Glinda Or Elphaba With This ‘Depraved’ Generator
1 hour ago
Audible deal: Get Premium Plus for a yr for
Audible deal: Get Premium Plus for a yr for $89
2 hours ago

About Us

about us

PulseReporter connects with and influences 20 million readers globally, establishing us as the leading destination for cutting-edge insights in entertainment, lifestyle, money, tech, travel, and investigative journalism.

Categories

  • Entertainment
  • Investigations
  • Lifestyle
  • Money
  • Tech
  • Travel

Trending

  • "I'm Falling In Love": Al Pacino Simply Did Our Pet Interview, And The Canine Completely Adored Him
  • Cybercriminals Are Hiding Malicious Net Site visitors in Plain Sight
  • RFK Jr. will ‘finish the battle’ in opposition to different medication on the FDA, from stem cell remedy to chelation. Right here’s what to know

Quick Links

  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Disclaimer
2024 © Pulse Reporter. All Rights Reserved.
Welcome Back!

Sign in to your account