Last verified
TEXT REASONING256K CONTEXT196B TOTAL11B ACTIVETOOLS

Step 3.5 Flash API Pricing

StepFun lists Step 3.5 Flash as a 196B-total / 11B-active sparse MoE language-reasoning model with 256K context. The official CNY price converts to $0.103/M input, $0.309/M output, and $0.0206/M cached input.

Input - per 1M tokens
$0.10/M
Source StepFun CNY cache miss
Output - per 1M tokens
$0.31/M
Context 256K
Cached input - per 1M tokens
$0.02/M
Prompt cache hit row 80% off
Effective - agentic blend
$0.06/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Calculator pre-loaded with StepFun's official Step 3.5 Flash CNY rates converted to USD. This is the text-reasoning Flash tier in the current StepFun docs.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
Open full calculator (all models · share URL · CSV) →
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · stepfun-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters 271
Words 45
Tokens (estimated) 70 tokens
Cost as input · uncached $0.000007 USD
Cost as output · uncached $0.000022 USD
Cost as cached input $0.000001 USD
§ 04 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Step 3.7 Flash $0.20 cache $0.04 $1.19 $0.16 CNY converted 256K StepFun flagship multimodal agents
Step 3.5 Flash Current $0.10 cache $0.02 $0.31 $0.06 CNY converted 256K Cheap StepFun text reasoning
MiMo-V2.5 $0.14 cache $0.00 $0.28 $0.05 verified Xiaomi row 1M Very low-cost 1M-context MiMo row
MiMo-V2.5-Pro $0.43 cache $0.00 $0.87 $0.14 verified Xiaomi row 1M MiMo Pro reasoning row
GLM-4.7 $0.60 cache $0.11 $2.20 $0.36 verified sibling 200K Coding agent backend
DeepSeek V4 Pro $0.43 cache $0.00 $0.87 $0.14 verified sibling 1M Low-cost reasoning workloads

Frequently asked.

Practical questions about Step 3.5 Flash pricing, cache hits, context size, and low-cost StepFun reasoning workloads.

Q · 01 What is Step 3.5 Flash priced at? +
StepFun's pricing docs list step-3-5-flash at 0.7 CNY/M input, 0.14 CNY/M cached input, and 2.1 CNY/M output. Converted at 1 CNY = 0.147135 USD, that is $0.103/M input, $0.0206/M cached input, and $0.309/M output.
Q · 02 How is the effective price calculated? +
The headline effective tile uses the site standard agentic blend: 92% input, 8% output, and 82% input cache hits. For Step 3.5 Flash, that lands at about $0.06/M blended tokens.
Q · 03 What context window does Step 3.5 Flash support? +
StepFun's model docs list 256K context for step-3-5-flash. The model page also lists a sparse MoE architecture with 196B total / 11B active.
Q · 04 Does this page model media/audio/image pricing? +
No. StepFun also has vision, image, and audio model rows. This page covers the text-token pricing for the listed reasoning model, not image generation or speech pricing.
Q · 05 When was this price last checked? +
This page was verified against platform.stepfun.com on 2026-07-01.