AI Models
Learn how to configure and manage AI model deployments in the Management Portal.
Overview
AI Models define the large language models (LLMs) and other AI models available to your FoundationaLLM deployment. These models power agent conversations, embeddings, and specialized capabilities like code interpretation and image generation.
Accessing AI Models
- In the Management Portal sidebar, click AI Models under the Models and Endpoints section
- The models list loads, showing all configured models
Model List
The table displays:
| Column | Description |
|---|---|
| Name | Model identifier |
| Source Type | Model provider (Azure OpenAI, Anthropic, etc.) |
| Edit | Settings icon to modify configuration |
| Delete | Trash icon to remove the model |
Searching and Managing
- Use the search box to filter by name
- Click the refresh button to reload the list
- Click column headers to sort
Model Types
| Type | Description | Use Cases |
|---|---|---|
| Chat/Completion | Text generation models | Agent conversations, responses |
| Embedding | Vector embedding models | Semantic search, similarity |
| Image Generation | Image creation models | DALL-E, image generation tools |
| Vision | Image understanding models | Image analysis, OCR |
Creating a Model
- Click Create Model at the top right of the page
- Configure the model settings
Model Configuration
TODO: Document specific model configuration fields when available in the UI, including:
| Field | Description |
|---|---|
| Model Name | Unique identifier |
| Source Type | Provider/platform (Azure OpenAI, Anthropic, etc.) |
| Deployment Name | Cloud deployment identifier |
| API Endpoint | Endpoint URL reference |
| Model Parameters | Default parameters (temperature, max tokens, etc.) |
Azure OpenAI Models
For Azure OpenAI deployments:
- Select Azure OpenAI as the source type
- Configure:
- API endpoint reference
- Deployment name
- Model version
Other Model Providers
For other providers (Anthropic, custom):
- Select the appropriate source type
- Configure provider-specific settings
- Enter authentication details
Model Configuration Details
API Endpoint Association
Models are associated with API endpoint configurations:
- Create or select an existing API endpoint
- Link the model to the endpoint
- The endpoint provides connection details
Model Parameters
Configure default model behavior:
| Parameter | Description | Typical Range |
|---|---|---|
| temperature | Response randomness | 0.0 - 2.0 |
| max_tokens | Maximum response length | 1 - model limit |
| top_p | Nucleus sampling | 0.0 - 1.0 |
Editing Models
- Locate the model in the list
- Click the Settings icon (⚙️)
- Modify settings as needed
- Click Save Changes
Deleting Models
- Click the Trash icon (🗑️) for the model
- Confirm deletion in the dialog
Warning: Deleting a model affects any agents using it. Verify dependencies before deleting.
Using Models in Agents
Models are referenced in agent configurations:
Workflow Main Model
The primary model for agent conversations:
- In agent creation/editing, find the Workflow section
- Select Workflow Main Model from the dropdown
- Only compatible models appear
Tool Models
Models assigned to specific tools:
- In tool configuration, add a Model resource
- Select the model
- Assign a role (e.g.,
main_model)
Access Control
Configure who can access and manage models:
| Permission | Description |
|---|---|
FoundationaLLM.AIModel/aiModels/read |
View models |
FoundationaLLM.AIModel/aiModels/write |
Edit models |
FoundationaLLM.AIModel/aiModels/delete |
Delete models |
Best Practices
Naming Conventions
- Use descriptive names indicating model type and purpose
- Include version information when relevant
- Example:
gpt-4o-chat-main,text-embedding-3-large
Model Selection
- Use appropriate models for each task (chat vs. embedding)
- Consider cost and performance tradeoffs
- Test models before production use
Parameter Configuration
- Set reasonable defaults
- Override at agent/tool level when needed
- Document parameter choices
Troubleshooting
Model Not Available in Dropdown
- Verify the model exists and is active
- Check your permissions
- Ensure the model type is compatible with the selection
Model Responses Failing
- Verify API endpoint configuration
- Check authentication credentials
- Review Azure OpenAI deployment status
Performance Issues
- Review token limits and quotas
- Check for rate limiting
- Consider model tier/capacity