AI Providers and Models

The Paquet Builder AI Assistant supports multiple AI providers, both cloud-based services and locally hosted models. This page helps you choose the right provider and model for your needs.

Cloud Providers

Provider	Recommended Model	Function Calling	Best For	Notes
OpenAI	`gpt-5.4-mini`	Excellent	Fast tasks, affordable usage	Reasoning effort option for o-series models
Anthropic (Claude)	`claude-sonnet-4-6`	Excellent	Complex multi-step tasks	Best accuracy for tool calling
Anthropic (Claude)	`claude-opus-4-6`	Excellent	Most demanding tasks	Premium, highest capability
Google (Gemini)	`gemini-2.5-flash`	Good	Budget-conscious usage	Free API tier available for low volume
Grok (xAI)	(any)	Good	Alternative provider
Mistral	(any)	Good	European-hosted provider	EU data residency
DeepSeek	`deepseek-chat`	Good	Very affordable	Competitive pricing

Local AI Providers

Run AI models on your own machine: no API key, no internet, no data leaves your computer.

Software	Provider Setting	Default Port	Notes
Ollama	Ollama	11434	Native integration, easiest setup
LM Studio	LlamaCpp	1234	User-friendly model browser
llama.cpp server	LlamaCpp	8080	Lightweight, for advanced users

See Using Local AI Models for step-by-step setup instructions.

Choosing the Right Model

Changing settings, adding files, querying project state: use a fast, affordable model:

GPT 5.4-mini (OpenAI): best speed-to-quality ratio
Gemini 2.5 Flash (Google): free tier, good for experimentation

Cost Considerations

Usage costs depend on the provider and model you choose. Most cloud providers offer pay-as-you-go pricing based on the number of tokens (words) processed.

A typical Paquet Builder session with 10–20 exchanges costs:

GPT 5.4-mini: A few cents
Claude Sonnet 4.6: Around $0.05–0.15
Claude Opus 4.6: Around $0.30–0.80
Gemini 2.5 Flash: Free tier covers most usage

Provider-Specific Notes

OpenAI

Reasoning Effort: Available for o-series and GPT-5 models. Controls how much “thinking” the model does before responding. Higher effort = better results for complex tasks but slower and more expensive.
The default model (when the field is empty) is gpt-4o.

Anthropic (Claude)

Generally provides the most reliable tool-calling behavior for multi-step operations.
Supports both Sonnet (faster, cheaper) and Opus (more capable) tiers.

Google (Gemini)

Free API tier available at aistudio.google.com with generous usage limits.
Some complex tool-calling scenarios may require retry (handled automatically by Paquet Builder).

DeepSeek

Very competitive pricing, often 10-20x cheaper than equivalent Western providers.
Good function calling support for most operations.

Setting Up the AI Assistant Configuration guide

Using Local AI Models Ollama, LM Studio, llama.cpp

What the AI Assistant Can Do Full capabilities reference

Using the AI Assistant Prompt examples and tips

MCP Server Overview Use AI tools like Claude Code with Paquet Builder