Qwen3-8B is the latest large language model of the Tongyi Qianwen series, with 8.2B parameters. The model uniquely supports seamless switching between thinking mode (suitable for complex logical reasoning, mathematics, and programming) and non-thinking mode (suitable for efficient general conversations), significantly enhancing reasoning capabilities. The model excels in mathematics, code generation, and common-sense logical reasoning, and demonstrates superior human preference alignment in a...
Provider: Qwen
Context window: 128K
Pricing: $0.03 input / $0.28 output per 1M tokens
Capabilities: serverless, json_mode, function_calling, structured_output