Back to List
Anthropic Addresses Claude Code Quality Degradation Reports and Implements Fixes for Sonnet and Opus Models
Industry NewsAnthropicClaudeAI Engineering

Anthropic Addresses Claude Code Quality Degradation Reports and Implements Fixes for Sonnet and Opus Models

Anthropic has released a postmortem addressing recent user reports regarding the degradation of Claude's performance across specific tools, including Claude Code, the Claude Agent SDK, and Claude Cowork. The investigation identified three distinct technical issues occurring between March and April 2026: an intentional but poorly received reduction in reasoning effort to manage latency, a session-clearing bug that caused repetitive behavior and memory loss, and a system prompt change aimed at reducing verbosity that inadvertently harmed coding quality. While the API remained unaffected, these issues impacted Sonnet 4.6, Opus 4.6, and Opus 4.7. Anthropic has since reverted the problematic changes and fixed the bugs as of April 20 (v2.1.116), emphasizing their commitment to maintaining model intelligence over speed.

Hacker News

Key Takeaways

  • Three Distinct Issues Identified: The perceived degradation was caused by a change in reasoning effort, a session-clearing bug, and a system prompt instruction to reduce verbosity.
  • Specific Tools Affected: Issues were limited to Claude Code, the Claude Agent SDK, and Claude Cowork; the core API and inference layer were not impacted.
  • Models Impacted: The performance dips affected Sonnet 4.6, Opus 4.6, and Opus 4.7 across different timeframes.
  • Full Resolution: All identified issues were resolved as of April 20 with the release of version 2.1.116.

In-Depth Analysis

Reasoning Effort and Latency Trade-offs

On March 4, Anthropic attempted to address UI latency issues where the interface appeared frozen by changing the default reasoning effort from "high" to "medium." While this was intended to improve the user experience by reducing wait times, it resulted in a noticeable drop in intelligence for Sonnet 4.6 and Opus 4.6. Following user feedback indicating a preference for higher intelligence over speed, Anthropic reverted this change on April 7. The company acknowledged that prioritizing lower latency at the expense of reasoning quality was the "wrong tradeoff."

Technical Bugs and Prompting Side Effects

Two additional technical factors contributed to the degradation. On March 26, a feature designed to clear old thinking from idle sessions to improve resumption speed introduced a bug. This bug caused the system to clear thinking every turn, making the models appear forgetful and repetitive. Furthermore, an April 16 update to the system prompt intended to reduce verbosity negatively impacted coding quality when combined with other prompt adjustments. This specific issue affected the latest models, including Opus 4.7. Both the bug and the prompt changes were corrected and reverted by April 20.

Investigation Challenges and Aggregate Effects

Anthropic noted that because these three changes occurred on different schedules and affected different segments of traffic, the resulting feedback appeared as broad and inconsistent degradation. The investigation began in early March but was complicated by the difficulty of distinguishing these specific technical failures from the normal variation in user feedback. The company has reaffirmed that they never intentionally degrade models and are implementing changes to prevent similar regressions in the future.

Industry Impact

This incident highlights the delicate balance AI providers must maintain between model "intelligence" (reasoning effort) and operational performance (latency). For the AI industry, it serves as a case study in how minor optimizations—such as reducing verbosity or clearing session cache—can have significant, unintended consequences on the quality of complex tasks like coding. Anthropic's transparent postmortem underscores the importance of user feedback loops in identifying non-obvious regressions that automated testing might miss, particularly when those regressions are tied to UI-specific implementations rather than the underlying API.

Frequently Asked Questions

Question: Was the Claude API affected by these quality issues?

No. Anthropic confirmed that the API and inference layer remained unaffected throughout this period; the issues were isolated to Claude Code, the Claude Agent SDK, and Claude Cowork.

Question: Which Claude models were impacted by the performance degradation?

The issues affected Sonnet 4.6, Opus 4.6, and Opus 4.7, depending on the specific technical change and the timeframe.

Question: How has Anthropic resolved these issues?

As of April 20 (v2.1.116), Anthropic has reverted the reasoning effort to "high," fixed the session-clearing bug, and removed the system prompt instructions that were harming coding quality.

Related News

Meituan LongCat Open-Sources General 365: A Rigorous New Benchmark for AI Reasoning Performance
Industry News

Meituan LongCat Open-Sources General 365: A Rigorous New Benchmark for AI Reasoning Performance

Meituan's LongCat team has officially released General 365, a new open-source benchmark designed to evaluate the reasoning capabilities of large language models (LLMs). The benchmark's debut has sent ripples through the AI community by revealing a significant performance gap in current technology. In a comprehensive test of 26 mainstream models, even the industry-leading Gemini 3 Pro managed an accuracy rate of only 62.8%. More strikingly, the vast majority of the models tested failed to reach the 60% threshold, which is typically considered a passing grade. This release by Meituan Technical Team establishes a new, more challenging standard for AI reasoning, suggesting that current models still face substantial hurdles in complex cognitive tasks.

Meituan BI Evolution: Building a Next-Generation Metric Platform and Analysis Engine for Enhanced Data Consistency
Industry News

Meituan BI Evolution: Building a Next-Generation Metric Platform and Analysis Engine for Enhanced Data Consistency

Meituan's data platform team has pioneered a new generation of Business Intelligence (BI) architecture centered on a unified Metric Platform. This strategic shift addresses critical challenges inherent in traditional BI systems, such as inconsistent data definitions (data caliber confusion) and poor query performance resulting from personalized dataset-driven models. By developing two core technical capabilities—Automatic Semantics and Enhanced Computing—Meituan has successfully streamlined its data analysis processes. This architecture ensures that business metrics remain consistent across the organization while significantly optimizing the efficiency of complex data queries. The practice represents a significant advancement in Meituan's technical infrastructure, moving toward a more centralized and performant data-driven decision-making environment.

50 Rising AI Startups in Asia: Tech in Asia Identifies the Region's Next Major Tech Leaders
Industry News

50 Rising AI Startups in Asia: Tech in Asia Identifies the Region's Next Major Tech Leaders

Tech in Asia has released a curated selection of 50 rising artificial intelligence startups across the Asian continent, marking them as high-potential ventures poised to become the "next big thing" in the global technology sector. This identification underscores a significant surge in AI innovation within the region, highlighting a diverse group of companies that are currently on an upward trajectory. The report suggests that these specific startups possess the necessary momentum and technological foundations to challenge existing market structures and lead the next wave of digital transformation. By focusing on these emerging players, the analysis points toward a maturing Asian AI ecosystem that is increasingly capable of producing world-class technology leaders.