trinity-large-thinking vs @cf/deepseek-ai/deepseek-r1-distill-qwen-32b

Metric

trinity-large-thinking @cf/deepseek-ai/deepseek-r1-distill-qwen-32b

Input price

$0.22

$0.00

Output price

$0.85

$0.00

Context window

262K

128K

Throughput

147 tok/s

122 tok/s

Availability

99.1%

97.7%

Cost / task

$0.001

$0.000

Efficiency score

Estimated monthly cost by workload

Metric

TRINITY-LARGE-

-CF

Chat assistant

$168.00

$0.00

RAG / long context

$340.50

$0.00

Agent / tool use

$464.40

$0.00

Efficiency score: trinity-large-thinking

Across price, speed and reliability, trinity-large-thinking offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.

Figures are illustrative demo data, not financial advice.

Frequently asked questions

Is trinity-large-thinking or @cf/deepseek-ai/deepseek-r1-distill-qwen-32b cheaper?+

@cf/deepseek-ai/deepseek-r1-distill-qwen-32b has the lower input price — $0.00 vs $0.22 per 1M tokens — so for most blended workloads it is the more cost-effective of the two. Figures are illustrative demo data.

Which should I choose, trinity-large-thinking or @cf/deepseek-ai/deepseek-r1-distill-qwen-32b?+

Across price, speed and reliability, trinity-large-thinking offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.