Trigger HALO
Interact with the HALO system through a clean prompt-based interface. Use document grounding only when you want HALO to use a Markdown document for the current request.
Prompt → Response
A simple interface for interacting with HALO.
Document Grounding
Use a sample or custom Markdown document for this request.
LLM
Configure model and advanced chat behavior.
Answer Trace
Grounding, retrieval, and runtime details for the latest RAG response.
Trace Summary
- Trace ID —
- Model —
- Search Mode —
- Grounding Mode —
- Top K —
- Context Budget —
Retrieval
- Semantic Candidates —
- Lexical Candidates —
- Fused Candidates —
- Used Chunks —
Sources and Warnings
System Telemetry
Live hardware load and latest runtime information
System Load
Real-time hardware load information
CPU: —
—
GPU: —
—
RAM: —
Runtime Info
Information about the latest response
- Model —
- Created At —
- Total Duration —
- Load Duration —
- Prompt Eval Duration —
- Eval Count —
- Eval Duration —
RAG Options
Document for this request
Advanced RAG Settings
- Per-request document grounding
- Sample and custom Markdown support
- Preset-based retrieval configuration
- top_k and search mode controls
- Strict, balanced, and explain grounding modes
- Configurable context budget
- Optional reranking and citations
- Trace output for retrieval diagnostics
- Inline file preview and full file modal