Home AI Article

Claude Opus 4.8 Shipped With Honesty Improvements and Mid-Conversation System Messages

TL;DR

Anthropic released Claude Opus 4.8 with reduced hallucination rates, mid-conversation system prompt injection, and lower prompt cache minimums while maintaining previous pricing.

Key Points

  • 4x lower incorrect-rate than predecessor by abstaining on uncertain questions rather than confident false claims
  • Mid-conversation system messages now supported, enabling dynamic prompt steering without restating full context and preserving prompt cache hits
  • Prompt cache minimum reduced from previous threshold to 1,024 tokens, lowering costs for agentic loops
  • Pricing unchanged: $5/million input, $25/million output tokens; knowledge cutoff remains January 2026

Why It Matters

For developers building LLM applications, reduced hallucination rates improve reliability in production systems. Mid-conversation system messages enable more sophisticated agentic patterns without cache invalidation, directly reducing costs and latency in long-running conversations. The honesty improvements matter for code generation workflows where flagging uncertainty prevents silent bugs.
Official announcement and system card

Source: simonwillison.net