Popular Models
This page provides detailed information about the most popular AI models supported by Pydantic AI, including pricing, specifications, and available inference providers. Each model card lists the shorthand (ID) that can be passed to the Agent class to use that model and provider combination:
from pydantic_ai import Agent
agent = Agent('gateway/anthropic:claude-opus-4-5')
from pydantic_ai import Agent
agent = Agent('anthropic:claude-opus-4-5')
All popular models can be used via the Gateway.
For an overview of all supported providers, see Models and Providers.
About Modalities
Unless otherwise specified, "Modalities" refer to the types of data a model can accept as input. Output modality is implicitly Text for all models unless noted otherwise.
Anthropic Claude
Claude Opus 4.5
ID: anthropic:claude-opus-4-5
Anthropic's most intelligent model for complex reasoning and research.
Specifications:
- Context: 200K tokens
- Modalities: Text
- Release: November 2025
Performance:
- Cost
- Speed
- Intelligence
Alternative Providers:
- Gateway:
'gateway/anthropic:claude-opus-4-5' - AWS Bedrock:
'bedrock:us.anthropic.claude-opus-4-5-20251101-v1:0' - OpenRouter:
'openrouter:anthropic/claude-opus-4.5'
Pricing: $5.00 / $25.00 per 1M tokens (in/out)
Claude Sonnet 4.5
ID: anthropic:claude-sonnet-4-5
Best balance of speed, cost, and capability. Ideal for agents and coding.
Specifications:
- Context: 1M tokens
- Modalities: Text
- Release: September 2025
Performance:
- Cost
- Speed
- Intelligence
Alternative Providers:
- Gateway:
'gateway/anthropic:claude-sonnet-4-5' - AWS Bedrock:
'bedrock:us.anthropic.claude-sonnet-4-5-20250929-v1:0' - OpenRouter:
'openrouter:anthropic/claude-sonnet-4.5'
Pricing: $3.00-6.00 / $15.00-22.50 per 1M tokens (in/out)
Claude Haiku 4.5
ID: anthropic:claude-haiku-4-5
Fastest and most cost-effective. Ideal for high-volume tasks.
Specifications:
- Context: 200K tokens
- Modalities: Text
- Release: October 2025
Performance:
- Cost
- Speed
- Intelligence
Alternative Providers:
- Gateway:
'gateway/anthropic:claude-haiku-4-5' - AWS Bedrock:
'bedrock:us.anthropic.claude-haiku-4-5-20251001-v1:0' - OpenRouter:
'openrouter:anthropic/claude-haiku-4.5'
Pricing: $1.00 / $5.00 per 1M tokens (in/out)
Google Gemini
Gemini 3 Pro
ID: google-gla:gemini-3-pro-preview
Google's most capable model for complex multimodal tasks.
Specifications:
- Context: 1M tokens
- Modalities: Text, Vision
- Release: Preview 2025
Performance:
- Cost
- Speed
- Intelligence
Alternative Providers:
- Gateway:
'gateway/google-gla:gemini-3-pro-preview' - Vertex AI:
'google-vertex:gemini-3-pro-preview' - OpenRouter:
'openrouter:google/gemini-3-pro-preview'
Pricing: $2.00-4.00 / $12.00-18.00 per 1M tokens (in/out)
Gemini 3 Flash
ID: google-gla:gemini-3-flash-preview
Fast and efficient with excellent performance-to-cost ratio.
Specifications:
- Context: 1M tokens
- Modalities: Text, Audio
- Release: Preview 2025
Performance:
- Cost
- Speed
- Intelligence
Alternative Providers:
- Gateway:
'gateway/google-gla:gemini-3-flash-preview' - Vertex AI:
'google-vertex:gemini-3-flash-preview' - OpenRouter:
'openrouter:google/gemini-3-flash-preview'
Pricing: $0.50 / $3.00 per 1M tokens (in/out)
Gemini 2.5 Flash Lite
ID: google-gla:gemini-2.5-flash-lite
Most cost-effective with ultra-low latency for high-volume apps.
Specifications:
- Context: 1M tokens
- Modalities: Text, Audio
- Release: 2025
Performance:
- Cost
- Speed
- Intelligence
Alternative Providers:
- Gateway:
'gateway/google-gla:gemini-2.5-flash-lite' - Vertex AI:
'google-vertex:gemini-2.5-flash-lite' - OpenRouter:
'openrouter:google/gemini-2.5-flash-lite'
Pricing: $0.10 / $0.40 per 1M tokens (in/out)
OpenAI GPT
GPT-5.2 Pro
ID: openai:gpt-5.2-pro
Most intelligent reasoning model for complex tasks.
Specifications:
- Context: 400K tokens
- Modalities: Text
- Release: December 2025
Performance:
- Cost
- Speed
- Intelligence
Alternative Providers:
- Gateway:
'gateway/openai:gpt-5.2-pro' - Azure:
'azure:gpt-5.2-pro' - OpenRouter:
'openrouter:openai/gpt-5.2-pro'
Pricing: $21.00 / $168.00 per 1M tokens (in/out)
GPT-5.2
ID: openai:gpt-5.2
Flagship general-purpose model with excellent balance.
Specifications:
- Context: 400K tokens
- Modalities: Text
- Release: December 2025
Performance:
- Cost
- Speed
- Intelligence
Alternative Providers:
- Gateway:
'gateway/openai:gpt-5.2' - Azure:
'azure:gpt-5.2' - OpenRouter:
'openrouter:openai/gpt-5.2'
Pricing: $1.75 / $14.00 per 1M tokens (in/out)
xAI Grok
Grok 4
ID: grok:grok-4
Flagship reasoning model with excellent math and reasoning.
Specifications:
- Context: 256K tokens
- Modalities: Text, Vision
- Release: July 2025
Performance:
- Cost
- Speed
- Intelligence
Alternative Providers:
- Gateway:
'gateway/grok:grok-4' - OpenRouter:
'openrouter:x-ai/grok-4'
Pricing: $3.00 / $15.00 per 1M tokens (in/out)
Grok 4.1 Fast
ID: grok:grok-4-1-fast
Optimized for agentic tool calling with massive 2M context.
Specifications:
- Context: 2M tokens
- Modalities: Text, Vision
- Release: 2025
Performance:
- Cost
- Speed
- Intelligence
Alternative Providers:
- Gateway:
'gateway/grok:grok-4-1-fast' - OpenRouter:
'openrouter:x-ai/grok-4-1-fast'
Pricing: $0.20 / $0.50 per 1M tokens (in/out)
Performance Ratings: = lowest, = highest. Cost based on $/1M tokens, Speed on tokens/sec, Intelligence on benchmarks.
Sources: OpenRouter Rankings, Artificial Analysis, pydantic/genai-prices