Anthropic launches Claude 2 amid continuing AI hullabaloo

Share This Post

The new model demonstrates measurable improvements across numerous categories, including near-instant query response times and the ability to parse inputs up to 100K tokens in size.

Anthropic, an artificial intelligence (AI) and “public benefit” company, launched Claude 2 on July 11, marking another milestone in a year full of seemingly nonstop progress from the burgeoning generative AI sector. 

According to a company blog post, Claude 2 shows improvements across nearly every measurable category. Perhaps most noteworthy among the differences between it and its predecessor is how the researchers discuss their work.

There’s no mention of traditional machine learning benchmarking or computational scores against similar models in the blog post announcing Claude 2. Instead, Anthropic tested both Claude and Claude 2 head-to-head on numerous tests meant to represent real-world knowledge, skills and problem-solving tests.

Claude 2 beat its predecessor across the board on knowledge, coding and other exams and, according to Anthropic, even scores well against human averages:

“When compared to college students applying to graduate school, Claude 2 scores above the 90th percentile on the GRE reading and writing exams, and similarly to the median applicant on quantitative reasoning.”

It is worth noting that many experts believe comparisons between human and AI test takers are inefficacious due to the nature of human cognitive reasoning and the likelihood that a large language model’s training data set contains test information. Essentially, tests designed for humans may not actually “test” an AI’s ability to reason or provide a proper demonstration of actual knowledge or skill.

Along with the launch of Claude 2, Anthropic debuted a beta version of a web-based “Talk to Claude” interface providing general access to the chatbot for users in the United States and the United Kingdom.

Related: How to land a high-paying job as an AI prompt engineer

Cointelegraph conducted brief testing of the new version and, anecdotally speaking, the improvements were immediately noticeable. Claude 2 responded to Cointelegraph prompts near instantly with clear, concise answers.

Chat with Claude 2. Source: Anthropic

According to Anthropic, the new model’s prompt limit is 100,000 tokens, or about the equivalent of 75,000 words. The site’s user interface indicates that users can upload PDF, TXT, CSV and similar documents for parsing; however, this functionality did not work in Cointelegraph’s limited testing prior to publishing this article.

Collect this article as an NFT to preserve this moment in history and show your support for independent journalism in the crypto space.

Read Entire Article
spot_img
- Advertisement -spot_img

Related Posts

8 Months of Inactivity, Then Millions Withdrawn: What’s Going on With the US Government’s Seized Crypto?

Blockchain monitoring firm Arkham Intelligence has reported that the US government recently initiated a significant transaction, withdrawing $54 million from the decentralized finance (defi) platform

Ripple CEO optimistic about crypto post-election, regardless of outcome

Ripple Labs CEO Brad Garlinghouse believes the US will become more crypto-friendly regardless of which political party wins the upcoming election, CNBC reported on Oct 24 Garlinghouse said during DC

Goatseus Maximus (GOAT) Enters Crypto’s Top-100: Time To Buy Or Sell?

Goatseus Maximus (GOAT) has surged into the top 100 cryptocurrencies by market capitalization, currently holding the #81 position The memecoin has experienced a remarkable 27% increase in the last 24

US prosecutors recommend leniency for former FTX executive Nishad Singh following ‘substantial assistance’

US prosecutors have requested that the court favorably consider former FTX executive Nishad Singh’s “substantial assistance” during their investigation into the failed crypto

Pennsylvania House Passes ‘Bitcoin Rights’ Bill With Bipartisan Support

The Pennsylvania House of Representatives has made a significant move in the cryptocurrency regulation landscape in the US by passing the ‘Bitcoin Rights’ bill with “overwhelming”

The Slow Death of Private Blockchain Tech—R3 Reportedly Explores Sale Despite Big Bank Support

According to a recent report, R3, the private blockchain initiative backed by several major companies—including Intel, Bank of America, and Wells Fargo—has been looking into various strategic