Skip to content

AI Providers and Models

The Paquet Builder AI Assistant supports multiple AI providers — both cloud-based services and locally hosted models. This page helps you choose the right provider and model for your needs.

ProviderRecommended ModelFunction CallingBest ForNotes
OpenAIgpt-5.4-miniExcellentFast tasks, affordable usageReasoning effort option for o-series models
Anthropic (Claude)claude-sonnet-4-6ExcellentComplex multi-step tasksBest accuracy for tool calling
Anthropic (Claude)claude-opus-4-6ExcellentMost demanding tasksPremium, highest capability
Google (Gemini)gemini-2.5-flashGoodBudget-conscious usageFree API tier available for low volume
Grok (xAI)GoodAlternative provider
MistralGoodEuropean-hosted providerEU data residency
DeepSeekdeepseek-chatGoodVery affordableCompetitive pricing

Run AI models on your own machine — no API key, no internet, no data leaves your computer.

SoftwareProvider SettingDefault PortNotes
OllamaOllama11434Native integration, easiest setup
LM StudioLlamaCpp1234User-friendly model browser
llama.cpp serverLlamaCpp8080Lightweight, for advanced users

See Using Local AI Models for step-by-step setup instructions.

Changing settings, adding files, querying project state — use a fast, affordable model:

  • GPT 5.4-mini (OpenAI) — Best speed-to-quality ratio
  • Gemini 2.5 Flash (Google) — Free tier, good for experimentation

Usage costs depend on the provider and model you choose. Most cloud providers offer pay-as-you-go pricing based on the number of tokens (words) processed.

A typical Paquet Builder session with 10–20 exchanges costs:

  • GPT 5.4-mini: A few cents
  • Claude Sonnet 4.6: Around $0.05–0.15
  • Claude Opus 4.6: Around $0.30–0.80
  • Gemini 2.5 Flash: Free tier covers most usage
  • Reasoning Effort: Available for o-series and GPT-5 models. Controls how much “thinking” the model does before responding. Higher effort = better results for complex tasks but slower and more expensive.
  • The default model (when the field is empty) is gpt-4o.
  • Generally provides the most reliable tool-calling behavior for multi-step operations.
  • Supports both Sonnet (faster, cheaper) and Opus (more capable) tiers.
  • Free API tier available at aistudio.google.com with generous usage limits.
  • Some complex tool-calling scenarios may require retry (handled automatically by Paquet Builder).
  • Very competitive pricing, often 10-20x cheaper than equivalent Western providers.
  • Good function calling support for most operations.