DeepSeek's thinking mode is unlike anything from other providers. When enabled, the model outputs reasoning_content tokens — its chain-of-thought reasoning — alongside the final content. These tokens are visible, billable at output rates, and must be managed deliberately across conversation turns. This is fundamentally different from Claude's invisible extended thinking stream, where reasoning is hidden unless you explicitly access it.

The visibility of reasoning_content is both a superpower and a constraint. It's a superpower because you can debug the model's reasoning directly, understand where it went wrong, and use the reasoning as a quality signal. It's a constraint because you must decide whether to pass reasoning back to the API in subsequent turns — get this wrong and you'll get 400 errors or degraded multi-turn performance.

Note:

Key difference from Claude: DeepSeek's reasoning_content is always accessible. Claude's thinking stream requires special API handling. DeepSeek disables temperature and top_p in thinking mode — reasoning behavior is controlled solely by reasoning_effort.

DeepSeek Thinking Mode: Reasoning Tokens & Effort Control

What You'll Find Here

Thinking Mode Guide

Reasoning Effort Control

Multi-Turn Reasoning

Related Articles

Retro Anime SREF Codes for Midjourney

Gemini 1M Token Strategies: Context Placement & Retrieval

Gemini Video Processing: Summarization & Scene Analysis

On this page