WebsitePlatform Login

Models

The language models available in meinGPT, their strengths, and recommendations for usage

meinGPT brings together leading language models (LLMs) from multiple providers under a single interface. For each chat, you choose the model that fits the task β€” based on the work to be done, the speed you need, and your data protection requirements.

Indicators in the model picker

The model picker shows several indicators to help you decide:

  • EU flag β€” the model is hosted in the EU and is GDPR-compliant without further measures.
  • US flag / globe β€” the model runs outside the EU.
  • Quality and speed indicators β€” visible on hover, each rated 1–5.
  • Reasoning tag β€” marks models that use an explicit thinking step before answering.
  • Preview tag β€” the model is still in a preview phase (functionality and behavior may change).
  • Free indicator β€” the model is not billed and is always available (currently GPT-5 Nano).
  • Best model β€” heuristic auto-selection; defaults to EU-hosted models only.

Data protection: EU-hosted models process all data within the EU. For assistants, you must set the model explicitly β€” an EU model is not chosen automatically.

Recommendations for everyday use

Use caseRecommendation
Standard chat (default)GPT-5.1, GPT-5.5, Claude Sonnet 4.6
Deep reasoning, strategy, auditsGPT-5 Thinking, o3, Gemini 2.5 Pro
Coding and refactoringClaude Sonnet 4.6, Claude Opus 4.8, GPT-5.4
Very long documents / 1M contextGPT-5.4, GPT-5.5, Claude Opus 4.8, Gemini 2.5 Pro
High volume, high throughput, low costGPT-5 mini, GPT-5 Nano (free), Gemini 2.5 Flash
Web research with source citationsPerplexity Online, Perplexity Deep Research
Default for assistants with tool callso3, GPT-5 Thinking

When in doubt, start with a regular (non-reasoning) model. Only switch to a reasoning model if the answers aren't good enough β€” reasoning consumes noticeably more credits.

Reasoning models vs. standard LLMs

Reasoning models perform a visible thinking step before producing the actual answer. This delivers significantly better results on multi-step or logic-heavy tasks β€” at the cost of speed and credits.

Task typeStandard LLMReasoning model
Simple text, emails, summariesYesNo (too expensive)
Creative writingYesNo
Complex logic, mathematics, formal analysisNoYes
Multi-step planningSometimesYes
Coding (simple)YesNo
Coding (complex / architectural)SometimesYes
Compliance analysis, auditsNoYes

OpenAI

OpenAI models mostly run via Azure OpenAI Service β€” for EU hosting, exclusively in EU data centers (Sweden Central). Individual US variants go directly via OpenAI. Data is not used for training.

ModelHostingReasoningContextGood for
GPT-4o MiniEUβ€”128kClassification, tagging, simple summaries, FAQ bots
GPT-4oEUβ€”128kGeneral assistant, coding, multimodal, multilingual
GPT-4.1EUβ€”1MKnowledge work, consistent output, tool use, long documents
o4-miniEUβœ“200kEfficient reasoning tasks, many requests with a logic component
o3EUβœ“200kStructured reasoning, multi-step workflows, tool calls in assistants
o3-proUSβœ“200kHighly demanding logic, formal analysis, compliance, audits
GPT-5EUβ€”400kHigh-quality text, code, agents, long contexts
GPT-5 ThinkingEUβœ“400kDifficult logic, planning, mathematics, deep-dive analysis
GPT-5 miniEUβ€”400kMany requests, simple tasks, classification, pipelines
GPT-5 NanoEUβ€”400kFree for basic conversations and highly cost-sensitive bulk processing
GPT-5.1EUβ€”400kStandard model for business and development, agents, planning
GPT-5.2EUβœ“ Coding400kCoding-heavy workflows that need reasoning (EU)
GPT-5.2 (US)USβœ“ Coding400kSame class as GPT-5.2, US region
GPT-5.3 ChatUSβœ“ Coding128kChat-oriented variant with reasoning and coding focus
GPT-5.4EUβœ“ Coding1MVery long contexts, heavy reasoning and coding tasks (EU)
GPT-5.4 ProUSβœ“ Coding1MMaximum reasoning + 1M context for the most demanding workloads
GPT-5.5EUβœ“ Coding1MCurrent EU flagship: reasoning, coding, long contexts

Anthropic (Claude)

Claude models run via Google Vertex. Anthropic pioneered the Artifacts concept β€” accordingly, Claude models produce especially clean artifact output (interactive documents, diagrams, code snippets).

ModelHostingReasoningContextGood for
Claude Haiku 4.5EUβ€”200kFast, cheap Anthropic responses for simple tasks
Claude Sonnet 4EUβ€”200kAdvanced coding, good balance of intelligence and speed
Claude Sonnet 4.5EUβ€”200kProduction coding, complex agents, multi-tool workflows
Claude Sonnet 4.6EUβ€”200kEfficient Sonnet all-rounder with high speed
Claude Opus 4.1USβ€”200kAgentic tasks, large refactorings, software engineering
Claude Opus 4.5EUβ€”200kOpus class with EU hosting for high-quality requirements
Claude Opus 4.6EUβ€”200kCurrent EU Opus for the most demanding tasks
Claude Opus 4.7EUβ€”1MOpus with 1M context for very long documents and codebases
Claude Opus 4.8EUβ€”1MCurrent Opus with 1M context, sharper agentic coding and stronger reasoning

Google (Gemini)

Gemini models run via Google Vertex and stand out with the largest context windows (1M+ tokens). Ideal for long documents, entire knowledge bases, or extensive file uploads.

ModelHostingReasoningContextGood for
Gemini 2.5 FlashEUβœ“1MBulk processing, fast APIs, image analysis
Gemini 2.5 ProEUβœ“1MComplex code projects, STEM, large datasets, agentic workflows
Gemini 3 Flash (Preview)USβœ“1MNext-generation Flash β€” preview
Gemini 3.5 FlashEUβœ“1MLatest Flash, combines reasoning at Pro level with high speed, multimodal input (text, image, audio, video)
Gemini 3.1 Pro (Preview)USβœ“1MMultimodal reasoning, long contexts, agentic workflows β€” preview

Mistral

Mistral models are hosted in European data centers and offer a strong EU-native alternative for business applications.

ModelHostingReasoningContextGood for
Mistral Medium 3EUβ€”128kEnterprise deployments, multimodal analysis, long documents, coding
Mistral Medium 3.5EUβ€”256kLatest Mistral Medium generation with larger context
Magistral Medium 1.2EUβœ“128kEU reasoning alternative for logic and analysis tasks
CodeStralEUβ€” Coding256kIDE code completion, fill-in-the-middle, code tests

DeepSeek

DeepSeek models run via Azure AI Foundry and offer cost-efficient reasoning and coding options.

ModelHostingReasoningContextGood for
DeepSeek-R1Globalβœ“128kMathematical proofs, chain-of-thought, logic puzzles
DeepSeek-V3.2Globalβ€” Coding128kCost-efficient batch processing, coding (Python/C++), multilingual

Other models

ModelProviderHostingReasoningContextGood for
Llama 3.3Meta (hosted on Nebius)EUβ€”128kEnterprise RAG, tool use, JSON output, on-premise scenarios
GPT-OSS 120BOpenAI open weights (hosted on Nebius)EUβœ“128kSelf-hosting, RAG, on-prem agents
Kimi K2.5Moonshot (hosted on Azure AI Foundry)Globalβœ“ Coding131kReasoning and coding tasks with large context
Kimi K2.6Moonshot (hosted on Azure AI Foundry Sweden)Globalβœ“ Coding256kSuccessor to K2.5 with doubled context window
Perplexity OnlinePerplexityUSβ€”128kWeb research, source citations, fact-checking
Perplexity Deep ResearchPerplexityUSβ€”128kDeeper web research with multiple search steps and summaries

Image generation

Image models (FLUX family, GPT Image, Gemini image models, Recraft) are documented separately under Image model management β€” including hosting region, premium status, and recommendations.

Enabling or disabling models

Which models your users can choose from in the workspace is controlled by you as an admin in the settings. Details: Workspace configuration.

On this page