Back to Home

Supported Models

Tokensmart supports nearly every major LLM on the market. See the models page or pricing page for the complete list and real-time rates.

Major providers

OpenAI

  • gpt-5.5 (flagship)
  • gpt-5.4, gpt-5.4-mini
  • gpt-image-2 (image generation, per-call billing)

Anthropic

  • claude-fable-5 (Mythos-class, newest and strongest)
  • claude-opus-4-8 (Opus flagship)
  • claude-opus-4-7, claude-opus-4-6
  • claude-sonnet-4-6 (best value)
  • claude-haiku-4-5-20251001 (fastest)

Google

  • gemini-3-pro-preview (flagship)
  • gemini-3.5-flash
  • gemini-2.5-pro, gemini-2.5-pro-thinking

DeepSeek

  • deepseek-v4-pro (flagship: code / math / complex reasoning)
  • deepseek-v4-flash (economy: faster, cheaper, high-concurrency)

Chinese models

  • qwen3.5-plus, qwen3.5-flash
  • qwen3-max, qwen3-max-preview
  • qwen3-vl-plus (vision)

Image generation (per-image billing)

  • gpt-image-2 — $0.02 per image

⚠️ Image models use dedicated endpoints — do NOT send them to /v1/chat/completions. See Image Generation API, Image Edits API, and Image Compose API.

Switching models

Just change the model field — no SDK swap, no new API key:

# Today: GPT
resp = client.chat.completions.create(model="gpt-5.5", messages=[...])

# Tomorrow: try Claude
resp = client.chat.completions.create(model="claude-sonnet-4-6", messages=[...])

New model launches

We track every provider's releases. New models typically land on Tokensmart within 24-48 hours of launch. Watch the changelog for updates.