Menu

Categories:

Hot right now:

Follow on:

Coinsurges provides coverage of fintech, blockchain, and Bitcoin, delivering the most recent news and analyses on the future of money. Stay up-to-date with live prices, charts, and trading options for the top exchanges. Keep track of the day's top cryptocurrency gainers and losers, as well as which coins have experienced gains and losses in the past 24 hours.
Trust Coinsurges as your go-to source for all news and updates in the industry.

Menu

Categories:

Hot right now:

Follow on:

Coinsurges provides coverage of fintech, blockchain, and Bitcoin, delivering the most recent news and analyses on the future of money. Stay up-to-date with live prices, charts, and trading options for the top exchanges. Keep track of the day's top cryptocurrency gainers and losers, as well as which coins have experienced gains and losses in the past 24 hours.
Trust Coinsurges as your go-to source for all news and updates in the industry.

Claude 3.5 sets new AI benchmarks, beating GPT-4o in coding and reasoning

Share This Post

Anthropic has launched Claude 3.5 Sonnet, the latest addition to its AI model lineup, claiming it surpasses previous models and competitors like OpenAI’s GPT-4 Omni. Available for free on Claude.ai and the Claude iOS app, the model is also accessible via the Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI. Claude 3.5 Sonnet is priced at $3 per million input tokens and $15 per million output tokens, with a 200,000-token context window.

Claude 3.5 Sonnet benchmarks (Anthropic)
Claude 3.5 Sonnet benchmarks (Anthropic)

Claude 3.5 Sonnet sets new benchmarks in graduate-level reasoning (GPQA), undergraduate-level knowledge (MMLU), and coding proficiency (HumanEval). It demonstrates significant improvements in understanding nuance, humor, and complex instructions and excels at generating high-quality content with a natural tone. The model operates at twice the speed of Claude 3 Opus, making it suitable for complex tasks like context-sensitive customer support and multi-step workflows.

“In an internal agentic coding evaluation, Claude 3.5 Sonnet solved 64% of problems, outperforming Claude 3 Opus, which solved 38%.”

The model can independently write, edit, and execute code, making it effective for updating legacy applications and migrating codebases. It also excels in visual reasoning tasks, such as interpreting charts and graphs, and can accurately transcribe text from imperfect images, benefiting sectors like retail, logistics, and financial services.

Anthropic has also introduced Artifacts, a new feature on Claude.ai that allows users to generate and edit content like code snippets, text documents, or website designs in real time. This feature marks Claude’s evolution from a conversational AI to a collaborative work environment, with plans to support team collaboration and centralized knowledge management in the future.

Anthropic emphasizes its commitment to safety and privacy, stating that Claude 3.5 Sonnet has undergone rigorous testing to reduce misuse. The model has been evaluated by external experts, including the UK’s Artificial Intelligence Safety Institute (UK AISI), and has integrated feedback from child safety experts to update its classifiers and fine-tune its models. Anthropic assures that it does not train its generative models on user-submitted data without explicit permission.

Looking ahead, Anthropic plans to release Claude 3.5 Haiku and Claude 3.5 Opus later this year, along with new features like Memory, which will enable Claude to remember user preferences and interaction history.

The post Claude 3.5 sets new AI benchmarks, beating GPT-4o in coding and reasoning appeared first on CryptoSlate.

Read Entire Article
spot_img
- Advertisement -spot_img

Related Posts

PEPE Price Breaks Ascending Triangle To Target Another 20% Crash

The PEPE price has taken a sudden bearish turn after breaking out of an Ascending Triangle pattern In light of this breakout, a crypto analyst has predicted that PEPE could face a massive 20% price

‘Cut Interest Rates, Jerome’: Trump Lashes Out as Stocks Tumble

Amid today’s ongoing market volatility, Trump stated that Federal Reserve Chair Jerome Powell must “stop playing politics” and swiftly cut the federal funds rate Wall Street in Retreat, Trump

Bitcoin decoupling from tech stocks indicates new geopolitical use as economic hedge – StanChart

Bitcoin (BTC) outperformed most major tech stocks on April 3 and April 4 as markets reeled from steep losses across the so-called “Magnificent Seven” (MAG7) Standard Chartered head of

$50 Million Bounty Offered by Tron’s Justin Sun to Recover Stolen Trueusd Reserves

Tron founder Justin Sun has announced a $50 million bounty for information to help recover $456 million in misappropriated Trueusd stablecoin reserves Sun Blames Licensed Intermediaries Tron founder

Ethereum Whales Buy the Dip – Over 130K ETH Added In A Single Day

Ethereum is trading below the $1,900 level, facing ongoing selling pressure as the broader crypto market continues to weaken After a sharp rejection from the $2,500 mark in late February, bulls have

Grayscale moves closer to Solana ETF with SEC filing

Grayscale has taken the next step in its efforts to launch a spot Solana exchange-traded fund (ETF) On April 4, the digital asset manager filed a Form S-1 with the US Securities and Exchange