Models

meinGPT brings together leading language models (LLMs) from multiple providers under a single interface. For each chat, you choose the model that fits the task — based on the work to be done, the speed you need, and your data protection requirements.

Indicators in the model picker

The model picker shows several indicators to help you decide:

EU flag — the model is hosted in the EU and is GDPR-compliant without further measures.
US flag / globe — the model runs outside the EU.
Quality and speed indicators — visible on hover, each rated 1–5.
Reasoning tag — marks models that use an explicit thinking step before answering.
Preview tag — the model is still in a preview phase (functionality and behavior may change).
Free indicator — the model is not billed and is always available (currently GPT-5 Nano).
Best model — heuristic auto-selection; defaults to EU-hosted models only.

Data protection: EU-hosted models process all data within the EU. For assistants, you must set the model explicitly — an EU model is not chosen automatically.

Recommendations for everyday use

Use case	Recommendation
Standard chat (default)	GPT-5.1, GPT-5.5, Claude Sonnet 4.6
Deep reasoning, strategy, audits	GPT-5 Thinking, o3, Gemini 2.5 Pro
Coding and refactoring	Claude Sonnet 4.6, Claude Opus 4.8, GPT-5.4
Very long documents / 1M context	GPT-5.4, GPT-5.5, Claude Opus 4.8, Gemini 2.5 Pro
High volume, high throughput, low cost	GPT-5 mini, GPT-5 Nano (free), Gemini 2.5 Flash
Web research with source citations	Perplexity Online, Perplexity Deep Research
Default for assistants with tool calls	o3, GPT-5 Thinking

When in doubt, start with a regular (non-reasoning) model. Only switch to a reasoning model if the answers aren't good enough — reasoning consumes noticeably more credits.

Reasoning models vs. standard LLMs

Reasoning models perform a visible thinking step before producing the actual answer. This delivers significantly better results on multi-step or logic-heavy tasks — at the cost of speed and credits.

Task type	Standard LLM	Reasoning model
Simple text, emails, summaries	Yes	No (too expensive)
Creative writing	Yes	No
Complex logic, mathematics, formal analysis	No	Yes
Multi-step planning	Sometimes	Yes
Coding (simple)	Yes	No
Coding (complex / architectural)	Sometimes	Yes
Compliance analysis, audits	No	Yes

OpenAI

OpenAI models mostly run via Azure OpenAI Service — for EU hosting, exclusively in EU data centers (Sweden Central). Individual US variants go directly via OpenAI. Data is not used for training.

Model	Hosting	Reasoning	Context	Good for
GPT-4o Mini	EU	—	128k	Classification, tagging, simple summaries, FAQ bots
GPT-4o	EU	—	128k	General assistant, coding, multimodal, multilingual
GPT-4.1	EU	—	1M	Knowledge work, consistent output, tool use, long documents
o4-mini	EU	✓	200k	Efficient reasoning tasks, many requests with a logic component
o3	EU	✓	200k	Structured reasoning, multi-step workflows, tool calls in assistants
o3-pro	US	✓	200k	Highly demanding logic, formal analysis, compliance, audits
GPT-5	EU	—	400k	High-quality text, code, agents, long contexts
GPT-5 Thinking	EU	✓	400k	Difficult logic, planning, mathematics, deep-dive analysis
GPT-5 mini	EU	—	400k	Many requests, simple tasks, classification, pipelines
GPT-5 Nano	EU	—	400k	Free for basic conversations and highly cost-sensitive bulk processing
GPT-5.1	EU	—	400k	Standard model for business and development, agents, planning
GPT-5.2	EU	✓ Coding	400k	Coding-heavy workflows that need reasoning (EU)
GPT-5.2 (US)	US	✓ Coding	400k	Same class as GPT-5.2, US region
GPT-5.3 Chat	US	✓ Coding	128k	Chat-oriented variant with reasoning and coding focus
GPT-5.4	EU	✓ Coding	1M	Very long contexts, heavy reasoning and coding tasks (EU)
GPT-5.4 Pro	US	✓ Coding	1M	Maximum reasoning + 1M context for the most demanding workloads
GPT-5.5	EU	✓ Coding	1M	Current EU flagship: reasoning, coding, long contexts

Anthropic (Claude)

Claude models run via Google Vertex. Anthropic pioneered the Artifacts concept — accordingly, Claude models produce especially clean artifact output (interactive documents, diagrams, code snippets).

Model	Hosting	Reasoning	Context	Good for
Claude Haiku 4.5	EU	—	200k	Fast, cheap Anthropic responses for simple tasks
Claude Sonnet 4	EU	—	200k	Advanced coding, good balance of intelligence and speed
Claude Sonnet 4.5	EU	—	200k	Production coding, complex agents, multi-tool workflows
Claude Sonnet 4.6	EU	—	200k	Efficient Sonnet all-rounder with high speed
Claude Opus 4.1	US	—	200k	Agentic tasks, large refactorings, software engineering
Claude Opus 4.5	EU	—	200k	Opus class with EU hosting for high-quality requirements
Claude Opus 4.6	EU	—	200k	Current EU Opus for the most demanding tasks
Claude Opus 4.7	EU	—	1M	Opus with 1M context for very long documents and codebases
Claude Opus 4.8	EU	—	1M	Current Opus with 1M context, sharper agentic coding and stronger reasoning

Google (Gemini)

Gemini models run via Google Vertex and stand out with the largest context windows (1M+ tokens). Ideal for long documents, entire knowledge bases, or extensive file uploads.

Model	Hosting	Reasoning	Context	Good for
Gemini 2.5 Flash	EU	✓	1M	Bulk processing, fast APIs, image analysis
Gemini 2.5 Pro	EU	✓	1M	Complex code projects, STEM, large datasets, agentic workflows
Gemini 3 Flash (Preview)	US	✓	1M	Next-generation Flash — preview
Gemini 3.5 Flash	EU	✓	1M	Latest Flash, combines reasoning at Pro level with high speed, multimodal input (text, image, audio, video)
Gemini 3.1 Pro (Preview)	US	✓	1M	Multimodal reasoning, long contexts, agentic workflows — preview

Mistral

Mistral models are hosted in European data centers and offer a strong EU-native alternative for business applications.

Model	Hosting	Reasoning	Context	Good for
Mistral Medium 3	EU	—	128k	Enterprise deployments, multimodal analysis, long documents, coding
Mistral Medium 3.5	EU	—	256k	Latest Mistral Medium generation with larger context
Magistral Medium 1.2	EU	✓	128k	EU reasoning alternative for logic and analysis tasks
CodeStral	EU	— Coding	256k	IDE code completion, fill-in-the-middle, code tests

DeepSeek

DeepSeek models run via Azure AI Foundry and offer cost-efficient reasoning and coding options.

Model	Hosting	Reasoning	Context	Good for
DeepSeek-R1	Global	✓	128k	Mathematical proofs, chain-of-thought, logic puzzles
DeepSeek-V3.2	Global	— Coding	128k	Cost-efficient batch processing, coding (Python/C++), multilingual

Other models

Model	Provider	Hosting	Reasoning	Context	Good for
Llama 3.3	Meta (hosted on Nebius)	EU	—	128k	Enterprise RAG, tool use, JSON output, on-premise scenarios
GPT-OSS 120B	OpenAI open weights (hosted on Nebius)	EU	✓	128k	Self-hosting, RAG, on-prem agents
Kimi K2.5	Moonshot (hosted on Azure AI Foundry)	Global	✓ Coding	131k	Reasoning and coding tasks with large context
Kimi K2.6	Moonshot (hosted on Azure AI Foundry Sweden)	Global	✓ Coding	256k	Successor to K2.5 with doubled context window
Perplexity Online	Perplexity	US	—	128k	Web research, source citations, fact-checking
Perplexity Deep Research	Perplexity	US	—	128k	Deeper web research with multiple search steps and summaries

Image generation

Image models (FLUX family, GPT Image, Gemini image models, Recraft) are documented separately under Image model management — including hosting region, premium status, and recommendations.

Enabling or disabling models

Which models your users can choose from in the workspace is controlled by you as an admin in the settings. Details: Workspace configuration.

Image models — image generation models and their use cases
Workspace configuration — control which models are enabled per workspace
Privacy — GDPR compliance, EU hosting, provider overview
Pricing — token prices per model

Models

On this page