Adaptive KV-cache selection for long-context RAG.

Simple infrastructure for long-context retrieval systems.

contact@hexinfra.com