Model Endpoints
Inferoa talks to model providers through configured endpoint profiles. The main agent model can use direct vLLM, vLLM Semantic Router, or an external compatible provider.
Setup Wizard
inferoa setup
Or from the TUI:
/setup
/model
/system
The setup flow writes endpoint URLs, selected models, context windows, and vault references. Raw API keys are stored in the local vault.
Direct vLLM
The default configuration expects a local OpenAI-compatible vLLM endpoint:
model_setup:
mode: direct
provider: vllm
base_url: http://localhost:8000/v1
model: your-model-id
context_window: 32768
Inferoa probes /v1/models and, for vLLM endpoints, may read optional load or
metrics routes when available.
vLLM Semantic Router
Use auto mode when vLLM Semantic Router should select the model path:
model_setup:
mode: auto
router: vllm-sr
base_url: http://localhost:8000/v1
model: auto
You can also set:
INFEROA_MODE=auto inferoa
External Compatible Providers
External providers are useful for compatibility validation or fallback paths. Inferoa supports custom OpenAI-compatible endpoints and provider profiles for hosted APIs. Prefer the setup wizard so credentials go into the local vault.
For environment overrides:
INFEROA_BASE_URL=https://example.com/v1 \
INFEROA_MODEL=model-id \
inferoa
VLLM_BASE_URL and VLLM_MODEL are accepted as fallback environment variable
names.