Skip to main content
Agent Platform supports a wide range of AI models—platform-hosted, open-source, externally integrated, and third-party. Choose based on your use case: agentic workflows, real-time voice, or background processing.

Models for Agentic Apps and Agents

External Models

Agentic Apps support Agent and Supervisor orchestration with models from these providers:
ProviderModel Variants
OpenAI
  • gpt-5.2-2025-12-11
  • gpt-5.2
  • gpt-5.2-chat-latest
  • gpt-5.1-2025-11-13
  • gpt-5.1
  • gpt-5.1-chat-latest
  • gpt-realtime-mini-2025-10-06
  • gpt-audio-mini-2025-10-06
  • gpt-5-2025-08-07
  • gpt-5-mini-2025-08-07
  • gpt-5-nano-2025-08-07
  • gpt-audio-2025-08-28
  • gpt-realtime-2025-08-28
  • gpt-5
  • gpt-5-mini
  • gpt-5-nano
  • gpt-5-chat-latest
  • gpt-4o-audio-preview-2025-06-03
  • gpt-4o-realtime-preview-2025-06-03
  • gpt-4.5-preview-2025-02-27
  • o3-2025-04-16
  • o4-mini-2025-04-16
  • gpt-4.1-2025-04-14
  • gpt-4.1-mini-2025-04-14
  • gpt-4.1-nano-2025-04-14
  • gpt-4o-search-preview-2025-03-11
  • o3-mini-2025-01-31
  • gpt-4o-realtime-preview-2024-12-17
  • gpt-4o-mini-audio-preview-2024-12-17
  • gpt-4o-audio-preview-2024-12-17
  • o1-2024-12-17
  • gpt-4o-2024-11-20
  • gpt-4o-2024-08-06
  • gpt-4o-mini-2024-07-18
  • gpt-4o-2024-05-13
  • gpt-4-turbo-2024-04-09
  • gpt-4-1106-preview
  • gpt-4-0125-preview
  • gpt-4-turbo-preview
  • gpt-4-0613
  • gpt-4o
  • gpt-4o-mini
  • gpt-4
  • gpt-4o-realtime-preview
  • gpt-4o-mini-realtime-preview
  • gpt-4o-search-preview
  • gpt-4.1
  • gpt-4.1-mini
  • gpt-4.1-nano
  • gpt-4-turbo
  • gpt-3.5-turbo-1106
  • gpt-3.5-turbo-0125
  • gpt-3.5-turbo
  • gpt-4o-mini-transcribe
  • gpt-4o-mini-audio-preview
  • gpt-4o-audio-preview
  • gpt-audio-mini
  • gpt-audio
  • gpt-image-1-mini
  • gpt-image-1
  • gpt-image-1.5
  • gpt-realtime-mini
  • gpt-realtime
  • o4-mini
  • o3
  • o1
Azure OpenAI
  • GPT-3.5-Turbo
  • GPT-4
  • GPT-4o
  • GPT-4o-Mini
  • GPT-4.1
  • GPT-4.1-Nano
  • GPT-4.1-Mini
  • O1
  • O1-Mini
  • O3-Mini
  • GPT-5
  • GPT-5-Mini
  • GPT-5-Nano
  • GPT-5-Chat
  • GPT-5.1
  • GPT-5.1-Chat
  • GPT-5.2
  • GPT-5.2-Chat
Anthropic
  • claude-3-5-sonnet
  • claude-3-haiku
  • claude-3-sonnet
  • claude-3-opus
  • claude-3-7-sonnet-20250219
  • claude-3-5-sonnet-20241022
  • claude-sonnet-4-20250514
  • claude-sonnet-4-5-20250929
  • claude-3-5-haiku-20241022
  • claude-haiku-4-5-20251001
  • claude-opus-4-20250514
  • claude-opus-4-1-20250805
  • claude-opus-4-5-20251101
  • claude-opus-4-6
Google
  • gemini-2.5-flash-native-audio-preview-12-2025
  • gemini-2.5-flash-native-audio-preview-09-2025
  • gemini-2.5-flash-preview-09-2025
  • gemini-2.5-flash-lite-preview-09-2025
  • gemini-3-pro-preview
  • gemini-3-pro-image-preview
  • gemini-3-flash-preview
  • gemini-2.5-flash-preview-05-20
  • gemini-2.5-flash
  • gemini-2.5-pro
  • gemini-2.5-flash-lite
  • gemini-2.5-flash-image
  • gemini-2.0-flash
  • gemini-2.0-flash-lite
  • gemini-1.5-flash-latest
  • gemini-1.5-pro
  • gemini-1.0-pro

Open-Source Models

Agentic Apps support select open-source models from these providers:
ProviderModel Variants
Meta-llama
  • meta-llama/Meta-Llama-3.1-8B-Instruct
  • meta-llama/Llama-3.2-1B-Instruct
  • meta-llama/Llama-3.2-3B-Instruct
Mistralai
  • mistralai/Mistral-7B-Instruct-v0.3
  • mistralai/Mistral-Nemo-Instruct-2407
Xiaomimimo
  • XiaomiMiMo/MiMo-VL-7B-RL

Real-Time Voice Models

Real-Time Voice is available only with the following models:
ProviderModel Variants
OpenAI
  • gpt-4o-realtime-preview
  • gpt-4o-mini-realtime-preview
Google Gemini
  • gemini-live-2.5-flash-preview
  • gemini-2.0-flash-live-001
Azure OpenAI
  • GPT-Realtime
  • GPT-Realtime-Mini

Custom Model Integration

To use a custom third-party model in Agentic Apps, it must meet two requirements:

Tool Calling Support (Mandatory)

The model must natively support Tool Calling. Models without this capability cannot be used in Agentic Apps. If supported, enable Tool Calling explicitly during model configuration. Enable Tool Calling Learn how to enable model features

Compatible API Structure

The request and response structure must follow either:
  • Anthropic – Messages API
  • OpenAI – Chat Completions API
Request/Response Structures For setup steps, see Add an External Model Using API Integration.

Platform-Hosted Open-Source Models

The platform hosts 30+ curated open-source models, available as a service and optimizable before deployment.
ProviderModel Variants
Argilla
  • argilla/notus-7b-v1
  • argilla/notux-8x7b-v1
DeepSeek
  • deepseek-ai/DeepSeek-R1-Distill-Llama-8B
  • deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
  • deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
  • deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
EleutherAI
  • EleutherAI/gpt-j-6b
  • EleutherAI/gpt-neo-1.3B
  • EleutherAI/gpt-neo-125m
  • EleutherAI/gpt-neo-2.7B
  • EleutherAI/gpt-neox-20b
Facebook
  • facebook/opt-1.3b
  • facebook/opt-2.7b
  • facebook/opt-350m
  • facebook/opt-6.7b
Google
  • google/flan-t5-base
  • google/flan-t5-large
  • google/flan-t5-small
  • google/flan-t5-xl
  • google/flan-t5-xxl
  • google/gemma-2-27b-it
  • google/gemma-2-9b-it
  • google/gemma-2b
  • google/gemma-2b-it
  • google/gemma-3-12b-it
  • google/gemma-7b
  • google/gemma-7b-it
Helsinki-NLP
  • Helsinki-NLP/opus-mt-es-en
HuggingFaceH4
  • HuggingFaceH4/zephyr-7b-alpha
  • HuggingFaceH4/zephyr-7b-beta
Meta-llama
  • meta-llama/Llama-2-13b-hf
  • meta-llama/Llama-2-7b-hf
  • meta-llama/Llama-3.2-1B
  • meta-llama/Llama-3.2-1B-Instruct
  • meta-llama/Llama-3.2-3B
  • meta-llama/Llama-3.2-3B-Instruct
  • meta-llama/Llama-3.2-11B-Vision-Instruct
  • meta-llama/Llama-Guard-4-12B
  • meta-llama/Meta-Llama-3-8B
  • meta-llama/Meta-Llama-3-8B-Instruct
  • meta-llama/Meta-Llama-3.1-8B
  • meta-llama/Meta-Llama-3.1-8B-Instruct
Microsoft
  • microsoft/phi-1
  • microsoft/phi-1_5
  • microsoft/phi-2
  • microsoft/Phi-3-medium-128k-instruct
  • microsoft/Phi-3-medium-4k-instruct
  • microsoft/Phi-3-mini-128k-instruct
  • microsoft/Phi-3-mini-4k-instruct
Mistralai
  • mistralai/Mistral-7B-Instruct-v0.1
  • mistralai/Mistral-7B-Instruct-v0.2
  • mistralai/Mistral-7B-Instruct-v0.3
  • mistralai/Mistral-7B-v0.1
  • mistralai/Mistral-Nemo-Instruct-2407
  • mistralai/Mixtral-8x7B-Instruct-v0.1
  • mistralai/Mixtral-8x7B-v0.1
OpenAI
  • GPT2
OpenAI Community
  • openai-community/gpt2-large
  • openai-community/gpt2-medium
  • openai-community/gpt2-xl
Stable Diffusion
  • stabilityai/stable-diffusion-xl-base-1.0
  • stabilityai/stable-diffusion-2-1
  • stable-diffusion-v1-5/stable-diffusion-v1-5 (Available only in the text-to-image node; no Prompt Studio support)
T5
  • t5-base
  • t5-large
  • t5-small
Tiiuae
  • tiiuae/falcon-40b
  • tiiuae/falcon-40b-instruct
  • tiiuae/falcon-7b
  • tiiuae/falcon-7b-instruct
  • tiiuae/falcon-rw-1b
Xiaomimimo
  • XiaomiMiMo/MiMo-VL-7B-RL

Structured Output

Platform-hosted models can produce structured JSON responses, making outputs consistent and easy to parse. Supported optimization techniques: No optimization or vLLM only.
CT2-optimized, fine-tuned, Hugging Face imported, and locally imported models do not support structured output.
ModelvLLMNo Optimization
amazon/MistralLite
argilla/notus-7b-v1
EleutherAI/gpt-j-6b
facebook/opt-1.3b
facebook/opt-2.7b
facebook/opt-350m
facebook/opt-6.7b
google/gemma-2b
google/gemma-2b-it
google/gemma-7b
google/gemma-7b-it
HuggingFaceH4/zephyr-7b-alpha
HuggingFaceH4/zephyr-7b-beta
meta-llama/Llama-2-7b-chat-hf
meta-llama/Llama-2-7b-hf
meta-llama/Llama-3.2-1B
meta-llama/Llama-3.2-1B-Instruct
meta-llama/Llama-3.2-3B
meta-llama/Llama-3.2-3B-Instruct
meta-llama/Meta-Llama-3-8B
meta-llama/Meta-Llama-3-8B-Instruct
meta-llama/Meta-Llama-3.1-8B
meta-llama/Meta-Llama-3.1-8B-Instruct
microsoft/Phi-3-medium-128k-instruct
microsoft/Phi-3-medium-4k-instruct
microsoft/Phi-3-mini-128k-instruct
microsoft/Phi-3-mini-4k-instruct
microsoft/phi-1
microsoft/phi-1_5
microsoft/phi-2
mistralai/Mistral-7B-Instruct-v0.1
mistralai/Mistral-7B-Instruct-v0.2
mistralai/Mistral-7B-Instruct-v0.3
mistralai/Mistral-7B-v0.1
openai-community/gpt2-large
openai-community/gpt2-medium
openai-community/gpt2-xl
tiiuae/falcon-7b
tiiuae/falcon-7b-instruct
tiiuae/falcon-rw-1b

External Models for Easy Integration

With Easy Integration, connect to external model providers without any infrastructure setup—just authenticate and start using models in flows, tools, or agents.
ProviderModel Variants
Anthropic
  • claude-3-5-sonnet-20240620
  • claude-3-haiku-20240307
  • claude-3-opus-20240229
  • claude-3-sonnet-20240229
  • claude-2.1
  • claude-2.0
  • claude-3-7-sonnet-20250219
  • claude-3-5-sonnet-20241022
  • claude-3-5-haiku-20241022
  • claude-sonnet-4-20250514
  • claude-opus-4-20250514
  • claude-opus-4-1-20250805
  • claude-opus-4-5-20251101
  • claude-haiku-4-5-20251001
  • claude-sonnet-4-5-20250929
  • claude-opus-4-6
  • Claude Sonnet Vision (Available only for the image-to-text node; no Prompt Studio support)
Azure OpenAI
  • GPT-4
  • GPT-3.5-Turbo
  • GPT-4o-Mini
  • GPT-4o
  • GPT-4.1
  • GPT-4.1-mini
  • GPT-4.1-nano
  • GPT-4.5-preview
  • O1-Mini
  • O1
  • O3-Mini
  • GPT-5
  • GPT-5-Mini
  • GPT-5-Nano
  • GPT-5-Chat
  • GPT-5.1
  • GPT-5.1-Chat
  • GPT-5.2
  • GPT-5.2-Chat
Cohere
  • command-light-nightly
  • command-light
  • command
  • command-nightly
Google
  • gemini-2.5-flash-native-audio-preview-12-2025
  • gemini-2.5-flash-native-audio-preview-09-2025
  • gemini-2.5-flash-preview-09-2025
  • gemini-2.5-flash-lite-preview-09-2025
  • gemini-3-pro-preview
  • gemini-3-pro-image-preview
  • gemini-3-flash-preview
  • gemini-2.5-flash-preview-05-20
  • gemini-2.5-Pro
  • gemini-2.5-flash
  • gemini-2.5-flash-lite
  • gemini-2.5-flash-image
  • gemini-2.0-flash
  • gemini-2.0-flashlite
  • gemini-1.5-flash-latest
  • gemini-1.5-pro
  • gemini-1.0-pro
OpenAI
  • gpt-5.2-2025-12-11
  • gpt-5.2
  • gpt-5.2-chat-latest
  • gpt-5.1-2025-11-13
  • gpt-5.1
  • gpt-5.1-chat-latest
  • gpt-5-2025-08-07
  • gpt-5-nano-2025-08-07
  • gpt-5-mini-2025-08-07
  • gpt-5
  • gpt-5-nano
  • gpt-5-mini
  • gpt-5-chat-latest
  • gpt-4.5-preview-2025-02-27
  • gpt-4.1-2025-04-14
  • gpt-4.1-mini-2025-04-14
  • gpt-4.1-nano-2025-04-14
  • gpt-realtime-mini-2025-10-06
  • gpt-audio-mini-2025-10-06
  • gpt-audio-2025-08-28
  • gpt-realtime-2025-08-28
  • gpt-4o-audio-preview-2025-06-03
  • gpt-4o-realtime-preview-2025-06-03
  • o3-2025-04-16
  • o4-mini-2025-04-16
  • gpt-4o-search-preview-2025-03-11
  • o3-mini-2025-01-31
  • gpt-4o-realtime-preview-2024-12-17
  • gpt-4o-mini-audio-preview-2024-12-17
  • gpt-4o-audio-preview-2024-12-17
  • o1-2024-12-17
  • gpt-4o-2024-11-20
  • gpt-4o-2024-08-06
  • gpt-4o-mini-2024-07-18
  • gpt-4o-2024-05-13
  • gpt-4-turbo-2024-04-09
  • gpt-4-1106-preview
  • gpt-4-0125-preview
  • gpt-4-turbo-preview
  • gpt-4-0613
  • gpt-4o
  • gpt-4o-mini
  • gpt-4
  • gpt-4o-realtime-preview
  • gpt-4o-mini-realtime-preview
  • gpt-4o-search-preview
  • gpt-4.1
  • gpt-4.1-mini
  • gpt-4.1-nano
  • gpt-4-turbo
  • gpt-3.5-turbo-0125
  • gpt-3.5-turbo-1106
  • gpt-3.5-turbo
  • gpt-4o-mini-transcribe
  • gpt-4o-mini-audio-preview
  • gpt-4o-audio-preview
  • gpt-audio-mini
  • gpt-audio
  • gpt-image-1-mini
  • gpt-image-1
  • gpt-image-1.5
  • gpt-realtime-mini
  • gpt-realtime
  • o4-mini
  • o3
  • o1-mini
  • o1-preview
  • o1
  • whisper-1
  • whisper (Available only for the audio-to-text node; no Prompt Studio support)
  • dall-e-3
  • dall-e-2
  • text-embedding-3-large
  • text-embedding-3-small
  • text-embedding-ada-002