Final month, the $61.5 billion-valuated AI startup Anthropic arrange a gaming livestream on Twitch. Gaming livestreams are nothing new on Twitch, however this one is somewhat completely different: Claude, Anthropic’s AI mannequin, is making an attempt to beat Pokémon Crimson.
We are actually one month in, and the livestream continues to be going. Nevertheless, Claude has not progressed all that a lot. And, at this price, Anthropic’s AI agent could presumably by no means be the easiest, like nobody ever was.
In response to Anthropic, when it first launched the “Claude Performs Pokémon” undertaking, earlier variations of its AI agent Claude failed at some very primary duties. For instance, in line with Anthropic, Claude 3.5 would attempt to run away from virtually each battle in June 2024.
A couple of months and some variations of Claude later, Anthropic stated there was a stark change. In February 2025, Anthropic gave Claude 3.7 Sonnet a whirl at enjoying Pokémon.
“Inside hours, Claude defeated Brock. Days later, it trounced Misty,” Anthropic stated. “Progress that older fashions had little hope of attaining.”
Mashable Mild Velocity
Anthropic stated that Claude 3.7 Sonnet may plan forward, keep in mind aims, and study from its errors, not like earlier variations of the AI agent. It additionally constructed a data base, noticed the display, and simulated button presses.
Nevertheless, the progress Claude 3.7 Sonnet initially made within the recreation appears to have stalled.
For instance, livestream viewers watched as Clause 3.7 took 78 hours to get by means of Mt. Moon within the recreation. On Reddit, avid gamers estimated that it will usually take a toddler only a few hours to advance by means of the identical stage.
Claude will be seen stepping into circles, stumbling across the similar paths, and infrequently knocking into partitions because it tries to get across the recreation.
The livestream is partaking, particularly as a textual content field lays out Claude’s “considering” because the AI agent tries to determine what strikes to make subsequent.
In response to Anthropic engineers in an interview with Ars Technica, Claude has a neater time with elements of the sport which contain textual content, similar to Pokémon battles. Nevertheless, it struggles with the extra visible elements of the sport, similar to transferring round from city to city on the map.
Claude 3.7 Sonnet has gone a lot additional within the recreation than earlier Claude fashions, so there’s been progress. Nevertheless, for these warning that AI will quickly have the ability to take over the world, we’re nowhere near that being a actuality but. Claude nonetheless has 151 Pokémon to catch.