Qwen fashions are actually out there in Amazon Bedrock

At the moment we’re including Qwen fashions from Alibaba in Amazon Bedrock. With this launch, Amazon Bedrock continues to broaden mannequin alternative by including entry to Qwen3 open weight basis fashions (FMs) in a full managed, serverless approach. This launch consists of 4 fashions: Qwen3-Coder-480B-A35B-Instruct, Qwen3-Coder-30B-A3B-Instruct, Qwen3-235B-A22B-Instruct-2507, and Qwen3-32B (Dense). Collectively, these fashions characteristic each mixture-of-experts (MoE) and dense architectures, offering versatile choices for various utility necessities.

Amazon Bedrock gives entry to industry-leading FMs via a unified API with out requiring infrastructure administration. You’ll be able to entry fashions from a number of mannequin suppliers, combine fashions into your purposes, and scale utilization primarily based on workload necessities. With Amazon Bedrock, buyer knowledge is rarely used to coach the underlying fashions. With the addition of Qwen3 fashions, Amazon Bedrock presents much more choices to be used instances like:

Code era and repository evaluation with prolonged context understanding
Constructing agentic workflows that orchestrate a number of instruments and APIs for enterprise automation
Balancing AI prices and efficiency utilizing hybrid considering modes for adaptive reasoning

Qwen3 fashions in Amazon Bedrock
These 4 Qwen3 fashions are actually out there in Amazon Bedrock, every optimized for various efficiency and value necessities:

Qwen3-Coder-480B-A35B-Instruct – It is a mixture-of-experts (MoE) mannequin with 480B whole parameters and 35B lively parameters. It’s optimized for coding and agentic duties and achieves robust ends in benchmarks resembling agentic coding, browser use, and power use. These capabilities make it appropriate for repository-scale code evaluation and multistep workflow automation.
Qwen3-Coder-30B-A3B-Instruct – It is a MoE mannequin with 30B whole parameters and 3B lively parameters. Particularly optimized for coding duties and instruction-following situations, this mannequin demonstrates robust efficiency in code era, evaluation, and debugging throughout a number of programming languages.
Qwen3-235B-A22B-Instruct-2507 – That is an instruction-tuned MoE mannequin with 235B whole parameters and 22B lively parameters. It delivers aggressive efficiency throughout coding, math, and common reasoning duties, balancing functionality with effectivity.
Qwen3-32B (Dense) – It is a dense mannequin with 32B parameters. It’s appropriate for real-time or resource-constrained environments resembling cell units and edge computing deployments the place constant efficiency is vital.

Architectural and practical options in Qwen3
The Qwen3 fashions introduce a number of architectural and practical options:

MoE in contrast with dense architectures – MoE fashions resembling Qwen3-Coder-480B-A35B, Qwen3-Coder-30B-A3B-Instruct, and Qwen3-235B-A22B-Instruct-2507, activate solely a part of the parameters for every request, offering excessive efficiency with environment friendly inference. The dense Qwen3-32B prompts all parameters, providing extra constant and predictable efficiency.

Agentic capabilities – Qwen3 fashions can deal with multi-step reasoning and structured planning in a single mannequin invocation. They’ll generate outputs that decision exterior instruments or APIs when built-in into an agent framework. The fashions additionally preserve prolonged context throughout lengthy periods. As well as, they assist instrument calling to permit standardized communication with exterior environments.

Hybrid considering modes – Qwen3 introduces a hybrid strategy to problem-solving, which helps two modes: considering and non-thinking. The considering mode applies step-by-step reasoning earlier than delivering the ultimate reply. That is best for complicated issues that require deeper thought. Whereas the non-thinking mode gives quick and near-instant responses for much less complicated duties the place velocity is extra essential than depth. This helps builders handle efficiency and value trade-offs extra successfully.

Lengthy-context dealing with – The Qwen3-Coder fashions assist prolonged context home windows, with as much as 256K tokens natively and as much as 1 million tokens with extrapolation strategies. This enables the mannequin to course of complete repositories, giant technical paperwork, or lengthy conversational histories inside a single activity.

When to make use of every mannequin
The 4 Qwen3 fashions serve distinct use instances. Qwen3-Coder-480B-A35B-Instruct is designed for complicated software program engineering situations. It’s suited to superior code era, long-context processing resembling repository-level evaluation, and integration with exterior instruments. Qwen3-Coder-30B-A3B-Instruct is especially efficient for duties resembling code completion, refactoring, and answering programming-related queries. If you happen to want versatile efficiency throughout a number of domains, Qwen3-235B-A22B-Instruct-2507 presents a steadiness, delivering robust general-purpose reasoning and instruction-following capabilities whereas leveraging the effectivity benefits of its MoE structure. Qwen3-32B (Dense) is suitable for situations the place constant efficiency, low latency, and value optimization are essential.

Getting began with Qwen fashions in Amazon Bedrock
To start utilizing Qwen fashions, within the Amazon Bedrock console, I can use the Chat/Textual content Playground part of the navigation pane to rapidly check the brand new Qwen fashions with a couple of prompts.

To combine Qwen3 fashions into my purposes, I can use any AWS SDKs. The AWS SDKs embody entry to the Amazon Bedrock InvokeModel and Converse API. I can even use these mannequin with any agentic framework that helps Amazon Bedrock and deploy the brokers utilizing Amazon Bedrock AgentCore. For instance, right here’s the Python code of a easy agent with instrument entry constructed utilizing Strands Brokers:

from strands import Agent
from strands_tools import calculator

agent = Agent(
    mannequin="qwen.qwen3-coder-480b-a35b-v1:0",
    instruments=[calculator]
)

agent("Inform me the sq. root of 42 ^ 9")

with open("operate.py", 'r') as f:
    my_function_code = f.learn()

agent(f"Assist me optimize this Python operate for higher efficiency:nn{my_function_code}")

Now out there
Qwen fashions can be found at the moment within the following AWS Areas:

Qwen3-Coder-480B-A35B-Instruct is out there within the US West (Oregon), Asia Pacific (Mumbai, Tokyo), and Europe (London, Stockholm) Areas.
Qwen3-Coder-30B-A3B-Instruct, Qwen3-235B-A22B-Instruct-2507, and Qwen3-32B can be found within the US East (N. Virginia), US West (Oregon), Asia Pacific (Mumbai, Tokyo), Europe (Eire, London, Milan, Stockholm), and South America (São Paulo) Areas.

Examine the full Area record for future updates. You can begin testing and constructing instantly with out infrastructure setup or capability planning. To be taught extra, go to the Qwen in Amazon Bedrock product web page and the Amazon Bedrock pricing web page.

Attempt Qwen fashions on the Amazon Bedrock console now, and provide suggestions via AWS re:Put up for Amazon Bedrock or your typical AWS Help channels.

— Danilo

Qwen fashions are actually out there in Amazon Bedrock

Related Articles

Mars rover makes use of wiggly wheels impressed by lizard

This Week’s Superior Tech Tales From Across the Internet (By means of June 20)

AURA Foresight Reaches World XPRIZE Wildfire Finals in Alaska

LEAVE A REPLY Cancel reply

Latest Articles

Mars rover makes use of wiggly wheels impressed by lizard

This Week’s Superior Tech Tales From Across the Internet (By means of June 20)

AURA Foresight Reaches World XPRIZE Wildfire Finals in Alaska

Photo voltaic Beat Coal in US Electrical energy Combine for the First Time in Might

Robots-Weblog | RoboCup 2050: Werden Roboter einmal Fußball-Weltmeister?

ABOUT US