LLM models¶

hai supports models from OpenAI, Anthropic, DeepSeek, Google, and xAI. Local models served via llama.cpp or Ollama are supported as well.

Default model¶

The default model is the active model when a new instance of hai is launched.

Select a default model with:

/ai-default <model>

This writes to your hai.toml config which can also be changed manually:

default_ai_model = "gpt-5"

If a default model isn't set, it's automatically chosen based on API key availability.

Switching models¶

Switching is as easy as:

/ai <model>

Models can even be switched mid-conversation as long as the target model supports all the capabilities (images, tools) utilized in the conversation.

Notable models¶

Provider	Notable Models (Not Comprehensive)
OpenAI	gpt-5 (`g5`), gpt-5-mini (`g5m`), gpt-5-nano (`g5n`), gpt-4.1 (`41`), gpt-4.1-mini (`41m`), gpt-4.1-nano (`41n`), chatgpt-4o, gpt-4o (`4o`), gpt-4o-mini (`4om`)
	o4-mini (`o4m`), o3, o3-mini (`o3m`), o1, o1-mini (`o1m`)
Anthropic	sonnet-4 (`sonnet`), sonnet-4-thinking (`sonnet-thinking`), opus-4 (`opus`), opus-4-thinking (`opus-thinking`), haiku-3.5 (`haiku`)
Google	gemini-2.5-flash (`flash25`), gemini-2.5-pro (`gemini25pro`), gemini-2.0-flash (`flash20`)
DeepSeek	deepseek-reasoner (`r1`), deepseek-chat (`v3`)
xAI	grok-4
Ollama	gemma3:27b, gpt-oss:20b, llama3.2, llama3.2-vision

If a model doesn’t have a built-in shortcut, or if you want to use a specific version, you can specify it as <ai_provider>/<official_model_name>.

`llama.cpp` server¶

llama-server serves only one model per instance decided by the command-line arguments. Because the model parameter in the API request is ignored, llamacpp can be specified as the <ai_provider> without any model name.

/ai llamacpp

LLM models¶

Default model¶

Switching models¶

Notable models¶

llama.cpp server¶

`llama.cpp` server¶