Qwen3.5-35B-A3B API

Qwen3.5-35B-A3B is a native multi-modal large language model launched by Tongyi Qianwen team, with a total parameter amount of 35B and an activation parameter amount of only 3B. The model uses an efficient hybrid architecture that combines a gated delta network with a sparse mixture of experts (MoE), natively supports 256K context length and is scalable to approximately 1 million tokens. The model achieves unified basic visual language capabilities through early fusion training and supports t...

Provider: Qwen

Context window: 256K

Pricing: $0.14 input / $1.08 output per 1M tokens

Capabilities: serverless, json_mode, function_calling, structured_output