Skip to main content

Context And RTK

Inferoa uses context settings and RTK to reduce token waste while preserving the evidence required for accurate work.

Context Settings

context:
compression_threshold: 0.8
context_window: 32768
protected_recent_loops: 3
engine:
provider: auto
startup: welcome
require_ready_before_chat: true
watch: true

compression_threshold controls when Inferoa starts compacting older context. protected_recent_loops keeps recent model and tool work visible. The context engine can use automatic detection, CodeGraph, built-in behavior, or be turned off.

RTK Settings

rtk:
enabled: true
delivery: managed
version: 0.42.3
auto_download: true

Environment overrides:

INFEROA_RTK=false inferoa
INFEROA_RTK_PATH=/path/to/rtk inferoa
INFEROA_RTK_AUTO_DOWNLOAD=false inferoa

Operational Views

/context
/tokenmaxxing

/context reports context usage and compression state. /tokenmaxxing reports RTK savings and token pressure alongside endpoint usage evidence.