Claude Opus 4.8

4.8 · Family: Claude

Anthropic’s latest flagship model in the Claude Opus family, optimized for agentic tasks, coding, and long multi-step sessions.

✓ Active✓ Public accessLLMMultimodalReasoning modelTool-using model📁 Claude

Context window

tokens

Release date

28 May 2026

🏢AnthropicProducer

Access:APIHostedDeployment:☁ Cloud

Overview

Claude Opus 4.8 is Anthropic’s flagship Claude-family model released on May 28, 2026, succeeding Claude Opus 4.7. Standard pricing matches the predecessor ($5/$25 per million input/output tokens), while fast mode is three times cheaper than on Opus 4.7 and 4.6 ($10/$50 per million tokens) at 2.5× the generation speed.

Anthropic reports that Opus 4.8 is roughly four times less likely than its predecessor to let flaws in its own code pass unremarked, and the alignment assessment shows substantially lower rates of misaligned behavior (e.g., deception) than Opus 4.7. The release ships alongside dynamic workflows in Claude Code (hundreds of parallel subagents) and an effort control (low/medium/high/extra/max) in claude.ai and Cowork.

The model is available via the claude-opus-4-8 identifier in the Claude API, in claude.ai apps (Free, Pro, Max, Team, Enterprise plans), and on partner cloud platforms.

Classification

LLMMultimodalReasoning modelTool-using model

Family: Claude

Access & deployment

APIHosted

Cloud

Weights: Closed

Key parameters

📏 Context: 1M

✓ Tools

📥 Input: text, image, documents

Platforms

Anthropic Claude API Amazon Bedrock Vertex AI Microsoft Azure AI Foundry

Technical specification

Context window

tokens

Features:✓ Tool use

Modalities

⬇ Input

textimagedocuments

⬆ Output

textcodestructured_data

Capabilities and applications

Native model capabilities

Reasoning

The model's ability to reason logically and solve complex problems.

Category: reasoning

Multi-step reasoning

Carrying out multi-step chains of reasoning across long, complex tasks.

Category: reasoning

Long context

Support for large context windows — tens to hundreds of thousands (or millions) of input tokens. Enables analysis of entire codebases, long documents, and many parallel conversations without losing earlier information. GPT-5.1 supports 400,000 tokens.

Category: language

Coding

Generating, analysing and modifying code in many programming languages. Covers writing functions, debugging, refactoring, code review, and creating tests. Measured by benchmarks such as HumanEval and SWE-bench.

Category: coding

Structured output

Producing data in structured formats such as JSON.

Category: structured_generation

Image understanding

Analysing and interpreting the content of images.

Category: vision

Chart understanding

Reading and interpreting charts, tables and diagrams.

Category: vision

OCR

Recognising text within images and documents.

Category: vision

Multilingual

Competence in many natural languages (from a few to over a hundred): understanding, generation, translation, and code-switching within a single conversation. Frontier models support a wide range of languages with comparable quality.

Category: language

Planning

Forming and executing action plans for complex tasks.

Category: planning

Benchmark results

8 benchmarks

SWE-Bench Pro

agentic coding

69.2%%