Tuesday, March 5, 2024

Anthropic says its new Claude 3 AI chatbot scores better than GPT-4 on key benchmarks

The battle between AI chatbots is more than a two-horse race. Anthropic, a company founded by several former OpenAI employees, claims that its new Claude 3 language model outperforms ChatGPT and Google’s Gemini on several key industry benchmarks. The company wrote in its blog that it even achieved “near-human” performance on some tasks.

Claude 3 has launched three new chatbots, including Haiku, Sonnet and Opus. Sonnet powers the Claude.ai chatbot, which is free to use by logging in via email. Meanwhile, Opus, the largest and most powerful LL.M., will be available for subscription through the “Claude Pro” service for $20 per month. It’s also multimodal, so unlike past versions, it can handle both text and image input.

The company says that all Claude 3 models “can support live customer chat, autocomplete and data extraction tasks where responses must be immediate and immediate.” In addition to promising “near-instant results,” they can also handle longer, multi-step instructions with greater accuracy.

Anthropic says its new Claude 3 AI chatbot scores better than GPT-4 on key benchmarksAnthropic says its new Claude 3 AI chatbot scores better than GPT-4 on key benchmarks
Anthropic selection

Opus demonstrates better graduate-level reasoning than GPT-4, scoring 14.7% higher than GPT-4 on this test. It also beat OpenAI’s chatbot in tasks involving math, coding, reasoning and knowledge.

They also surpass past Cloud models. “For the vast majority of workloads, Sonnet is 2x faster than Claude 2 and Claude 2.1, and has a higher level of intelligence. It excels at tasks that require fast response, such as knowledge retrieval or sales automation. Opus is as fast as Claude 2 is similar to 2.1, but with a higher level of intelligence,” according to Anthropic.

Meanwhile, Haiku, the smallest version of the Claude 3, is “the fastest and most cost-effective model on the market.” To this end, it is able to read dense research papers containing charts and graphs in just three seconds.

The company also notes that Claude 3 “can handle a variety of visual formats, including photos, charts, diagrams, and technical diagrams,” assisting companies working with PDFs, flowcharts, or presentation slides. With a more nuanced understanding of the request, it’s also less likely to reject innocuous content while still recognizing “real harm.”

Anthropic says Claude AI follows 10 secret foundational pillars of fairness. Claude 3 was trained on non-public internal and public-facing materials using hardware from Amazon Web Services (AWS) and Google Cloud (Amazon recently invested $4 billion in Anthropic).

Claude 3 Opus and Claude 3 Sonnet are now available through Anthropic’s API, and Haiku will be available soon. Sonnet is also accessible through Amazon Bedrock and available for private preview on Google Cloud’s Vertex AI Model Garden.

This article contains affiliate links; if you click on such links and make a purchase, we may earn a commission.

Source link



from Tech Empire Solutions https://techempiresolutions.com/anthropic-says-its-new-claude-3-ai-chatbot-scores-better-than-gpt-4-on-key-benchmarks/
via https://techempiresolutions.com/

No comments:

Post a Comment

Chuzo Login

How to Login to Chuzo Are you having trouble logging into Chuzo? Let’s explore this guide to trouble shoot your problems. Make Sure...