Asserting Anthropic’s latest mannequin, Claude Opus 4.5, in Microsoft Foundry. Opus 4.5 is now accessible in public preview in Microsoft Foundry, GitHub Copilot paid plans, and Microsoft Copilot Studio.
We’re at an actual inflection level within the AI panorama, a threshold the place fashions transfer from helpful assistants to real collaborators. Fashions that perceive the target, consider constraints, and execute complicated multi-tool workflows. Fashions that not solely help processes, however assist restructure them for reliability, scale, and operational effectivity.
Anthropic’s latest mannequin, Claude Opus 4.5, embodies that shift. Right this moment, we’re excited to share that Opus 4.5 is now accessible in public preview in Microsoft Foundry, GitHub Copilot paid plans, and Microsoft Copilot Studio.
Constructing on the Microsoft Ignite announcement of our expanded partnership with Anthropic, Microsoft Foundry delivers its dedication to giving Azure prospects fast entry to the widest number of superior and frontier AI fashions of any cloud. Foundry empowers builders to speed up innovation with an built-in, interoperable, and safe AI platform that permits seamless deployment, integration, and scaling for AI apps and brokers.
We’re excited to make use of Anthropic Claude fashions from Microsoft Foundry. Having Claude’s superior reasoning alongside GPT fashions in a single platform provides us flexibility to construct scalable, enterprise-grade workflows that transfer far past prototypes.
—Michele Catasta, President, Replit
Opus 4.5 for actual work
Opus 4.5 units a brand new bar for coding, agentic workflows, and enterprise productiveness: outperforming Sonnet 4.5 and Opus 4.1, at a extra accessible worth level. Its versatility throughout software program engineering, complicated reasoning, device use, and imaginative and prescient unlocks new alternatives for organizations to modernize techniques, automate vital workstreams, and ship ROI quicker.
By prioritizing speedy integration of the newest fashions, Foundry permits Azure prospects to remain forward of the curve and maximize the affect of their agentic AI techniques; all whereas sustaining centralized governance, safety, and observability at scale.
1. Constructed for manufacturing engineering and agentic capabilities
Based on Anthropic, Opus 4.5 delivers state-of-the-art efficiency on business normal software program engineering benchmarks, together with new highs on SWE-bench (80.9%). Early testers constantly describe the mannequin as having the ability to interpret ambiguous necessities, purpose over architectural tradeoffs, and establish fixes for points that span a number of techniques.
Opus 4.5 accelerates engineering velocity by finishing multi-day growth work in hours with:
- Improved multilingual coding efficiency
- Extra environment friendly code technology
- Stronger take a look at protection
- Cleaner architectural and refactoring selections
| Functionality / Benchmark | Claude Opus 4.5 | Claude Sonnet 4.5 | Claude Opus 4.1 | Gemini 3 Professional |
| Agentic coding (SWE-bench Verified) | 80.90% | 77.20% | 74.50% | 76.20% |
| Agentic terminal coding (Terminal-bench 2.0) | 59.30% | 50.00% | 46.50% | 54.20% |
| Agentic device use — Retail (t2-bench) | 88.90% | 86.20% | 86.80% | 85.30% |
| Agentic device use — Telecom (t2-bench) | 98.20% | 98.00% | 71.50% | 98.00% |
| Scaled device use (MCP Atlas) | 62.30% | 43.80% | 40.90% | _ |
| Pc use (OSWorld) | 66.30% | 61.40% | 44.40% | _ |
| Novel downside fixing (ARC-AGI-2 Verified) | 37.60% | 13.60% | _ | 31.10% |
| Graduate-level reasoning (GPQA Diamond) | 87.00% | 83.40% | 81.00% | 91.90% |
| Visible reasoning (MMMU validation) | 80.70% | 77.80% | 77.10% | _ |
| Multilingual Q&A (MMLU) | 90.80% | 89.10% | 89.50% | 91.80% |
Claude Opus 4.5 benchmark outcomes from Anthropic
Opus 4.5 can be one of many strongest tool-using fashions accessible immediately, able to powering brokers that work seamlessly throughout tons of of instruments. Builders achieve entry to a number of necessary upgrades:
- Programmatic Device Calling: Execute instruments immediately in Python for extra environment friendly, deterministic workflows.
- Device Search: Dynamically uncover instruments from massive libraries with out utilizing up area within the context window.
- Device Use Examples: Extra correct device calling for complicated device schemas.
Collectively, these capabilities allow refined brokers throughout cybersecurity, full-stack software program engineering, monetary modeling, and different workflows requiring a number of device interactions. Opus 4.5 exhibits sturdy, real-world intelligence making use of these instruments creatively inside constraints. In testing, the mannequin efficiently navigated complicated coverage environments, corresponding to airline change guidelines, chaining upgrades, downgrades, cancellations, and rebookings to optimize outcomes. This type of adaptive, constraint-aware problem-solving displays a significant step ahead in what agentic AI techniques can accomplish.
Manus deeply makes use of Anthropic’s Claude fashions due to their sturdy capabilities in coding and long-horizon activity planning, along with their prowess to deal with agentic duties. We’re very excited to be utilizing them now on Microsoft Foundry!
—Tao Zhang, Co-founder & Chief Product Officer, Manus AI
2. Improved developer expertise on Foundry
Opus 4.5 paired with new developer capabilities provided on Foundry is designed to assist groups construct more practical and environment friendly agentic techniques:
- Effort Parameter (Beta): Management how a lot computational effort Claude allocates throughout considering, device calls, and responses to stability efficiency with latency and value to your particular use instances.
- Compaction Management: Deal with long-running agentic duties extra successfully with new SDK helpers that handle context effectively over prolonged interactions.
These enhancements present higher predictability and operational management for enterprise workloads.
3. Enhanced workplace productiveness and laptop use
Opus 4.5 additionally doubles down as Anthropic’s finest imaginative and prescient mannequin, unlocking workflows that depend upon complicated visible interpretation and multi-step navigation. Pc use efficiency has improved considerably, enabling extra dependable automation of desktop duties.
For information staff, the mannequin delivers a step-change enchancment in powering brokers that create spreadsheets, displays, and paperwork. It produces work with consistency, skilled polish, and real area consciousness making it a match for finance, authorized, and different precision-critical verticals. The mannequin higher leverages reminiscence to take care of context and consistency throughout recordsdata all through sprawling skilled tasks.
4. Security and safety
Based on Anthropic, Opus 4.5 additionally delivers significant enhancements in security and safety. The mannequin exhibits a decreased charge of misaligned responses, stronger robustness in opposition to prompt-injection assaults, and extra dependable conduct throughout complicated duties.
These enhancements align with Microsoft’s dedication to offering enterprise prospects with fashions that meet excessive bars for security, governance, and operational integrity
Use instances
Opus 4.5 serves the next use instances
- Software program growth: Deploy brokers that deal with complicated, multi-system growth duties with minimal supervision.
- Monetary evaluation: Join insights throughout regulatory filings, market studies, and inner information for classy predictive modeling and proactive compliance monitoring.
- Cybersecurity: Correlate logs, vulnerability databases, and menace intelligence for professional-grade menace detection and automatic incident response.
- Enterprise operations: Handle refined workflows requiring coordination throughout a number of instruments, techniques, and data sources.
Pricing and availability
Opus 4.5 delivers frontier efficiency and units a brand new normal for a wide range of use instances at one third the value of earlier Opus-class fashions.
Mannequin | Provide sort | Deployment sort | Areas | Worth (1M tokens) | Availability |
Claude Opus 4.5 | Serverless Pay-go | World Customary | East US2, Sweden Central | Enter- $5 Output- $25 | November 24, 2025 (public preview) |
Get began immediately
Claude Opus 4.5 is accessible now in Microsoft Foundry and coming quickly in Visible Studio Code by way of the Foundry extension. Go to the Foundry portal to start constructing with Opus 4.5.
