Anthropic sets a new standard for AI coding and agents with Claude Opus 4.5

Anthropic has released Claude Opus 4.5, its most capable model for coding and agents with state-of-the-art performance on software engineering benchmarks, dramatically improved token efficiency, and reduced pricing to $5/$25 per million tokens.

Anthropic has released Claude Opus 4.5, positioning it as the world's best model for coding, agents, and computer use. The launch comes with significant pricing improvements—now $5/$25 per million tokens—making enterprise-grade AI capabilities more accessible than ever.

Claude Opus 4.5 achieves state-of-the-art performance on SWE-bench Verified, a key software engineering benchmark, and delivers substantial improvements in everyday tasks like research, spreadsheet work, and slide creation. Internal testing revealed the model scored higher than any human candidate on Anthropic's notoriously difficult performance engineering take-home exam within the prescribed two-hour limit.

What sets Opus 4.5 apart is its ability to handle ambiguity and reason about complex tradeoffs with minimal guidance. Early testers consistently reported that the model simply "gets it," successfully tackling tasks that were near-impossible for previous models. The system demonstrates creative problem-solving—in one benchmark scenario, it found an innovative workaround to help an airline customer by upgrading their cabin class before modifying flights, a solution the test designers hadn't anticipated.

Anthropic is introducing a new "effort" parameter that lets developers control the tradeoff between speed and thoroughness. At medium effort, Opus 4.5 matches Sonnet 4.5's performance while using 76% fewer output tokens. At maximum effort, it surpasses Sonnet 4.5 by over 4 percentage points while still using 48% fewer tokens.

The model also represents Anthropic's most robustly aligned release to date, with improved resistance to prompt injection attacks compared to competing frontier models. Opus 4.5 is available now via the Claude API, apps, and major cloud platforms, with expanded access to features like Claude for Chrome and Claude for Excel rolling out to Max, Team, and Enterprise users.

Subscribe

Anthropic sets a new standard for AI coding and agents with Claude Opus 4.5

Comments

Read Next

Amazon, NVIDIA and SoftBank are all part of OpenAI's recent $110B private raising effort

Mistral AI and Accenture announce partnership focused on enterprise AI solutions

Perplexity launches Computer, an AI workflow system that integrates 19 different models

ŌURA's first proprietary AI model focuses on evidence-based women's health guidance

Freeform Raises $67M Series B to scale its 'AI-native' metal manufacturing platform