Question 1

What is DeepSeek-R1-Distill-Qwen-32B best suited for?

Accepted Answer

DeepSeek-R1-Distill-Qwen-32B is positioned for production workloads that need DeepSeek-R1-Distill-Qwen-32B is a model obtained through knowledge distillation based on Qwen2.5-32B. The model was fine-tuned using 800,000 handpicked samples generated by DeepSeek-R1, demonstrating superior performance in multiple domains including mathematics, programming, and reasoning. Obtained the best results in multiple benchmark tests such as AIME 2024, MATH-500, GPQA Diamond, etc....

Question 2

How much context does DeepSeek-R1-Distill-Qwen-32B support?

Accepted Answer

This model is currently published with 128K. For long-running agent flows, FlowAPI will automatically optimize prompt packing to stay within the available window.

Question 3

How is DeepSeek-R1-Distill-Qwen-32B billed on FlowAPI?

Accepted Answer

Billing for DeepSeek-R1-Distill-Qwen-32B follows $0.200000 input / $0.200000 output. Availability and routing are managed by FlowAPI while the underlying model is supplied by deepseek-ai.

DeepSeek-R1-Distill-Qwen-32B API