Cache Policy: Context aware

Context Aware Cache

Map what users are asking the agent in real time, grouped by project namespace, intent, cost, latency, and drift risk.

Cluster Alpha: stableVariance monitor: active
Cluster Density Map
Every point is an HDBSCAN prompt; shaded regions represent cluster density from Postgres.
DIMENSIONS: 1536 / KL_DIVERGENCE: 0.823 / PERPLEXITY: 30
Actionable Insight
Semantic cache opportunity

This cluster has 50 prompts with avg cosine similarity 0.94. At a cache threshold matching that observed similarity, 0.94 would serve 47 of them without hitting the LLM.

Estimated savings$525/mo
Efficiency Audit
Cache coverage across 50 plotted prompt datapoints.
94.0%
Variance Monitor
Drift detected in the highest-latency HDBSCAN cluster.
cos_sim_drift: +0.042
affected_cluster: id_882
window: 2026-04-28T02:00Z
Live Prompt Inventory
Representative prompts, assigned clusters, and runtime profile.
All clusters
Prompt IDAsked PromptClusterP50 LatencyCost/1kStatus
PR-8221-A-01

Rewrite this query plan for lower scan cost

Tech-alpha245ms$0.0014Healthy
PR-4432-B-02

Find the regression in this auth middleware

Tech-alpha198ms$0.0011Healthy
PR-1140-C-03

Summarize failures from the last deploy logs

Tech-alpha312ms$0.0017Healthy
PR-9104-Z-04

Draft a friendlier onboarding email sequence

Creative-beta842ms$0.0028Review
PR-7719-D-05

Turn this changelog into a launch story

Creative-beta690ms$0.0022Healthy
PR-5520-M-06

Make this support answer concise and warm

Creative-beta718ms$0.0025Healthy
Showing 1-6 of 50 prompt families