Skip to main content

Model Endpoints

Inferoa talks to model providers through configured endpoint profiles. The main agent model can use direct vLLM, vLLM Semantic Router, or an external compatible provider.

Setup Wizard

inferoa setup

Or from the TUI:

/setup
/model
/system

The setup flow writes endpoint URLs, selected models, context windows, and vault references. Raw API keys are stored in the local vault.

Direct vLLM

The default configuration expects a local OpenAI-compatible vLLM endpoint:

model_setup:
mode: direct
provider: vllm
base_url: http://localhost:8000/v1
model: your-model-id
context_window: 32768

Inferoa probes /v1/models and, for vLLM endpoints, may read optional load or metrics routes when available.

vLLM Semantic Router

Use auto mode when vLLM Semantic Router should select the model path:

model_setup:
mode: auto
router: vllm-sr
base_url: http://localhost:8000/v1
model: auto

You can also set:

INFEROA_MODE=auto inferoa

External Compatible Providers

External providers are useful for compatibility validation or fallback paths. Inferoa supports custom OpenAI-compatible endpoints and provider profiles for hosted APIs. Prefer the setup wizard so credentials go into the local vault.

For environment overrides:

INFEROA_BASE_URL=https://example.com/v1 \
INFEROA_MODEL=model-id \
inferoa

VLLM_BASE_URL and VLLM_MODEL are accepted as fallback environment variable names.

Signal Flow