Cache Policy: Context aware
Context Aware Cache
Map what users are asking the agent in real time, grouped by project namespace, intent, cost, latency, and drift risk.
Cluster Alpha: stableVariance monitor: active
Cluster Density Map
Every point is an HDBSCAN prompt; shaded regions represent cluster density from Postgres.
DIMENSIONS: 1536 / KL_DIVERGENCE: 0.823 / PERPLEXITY: 30
Technical QueriesCreative WritingSafety ReviewLong-tail Prompts
Actionable Insight
Semantic cache opportunity
This cluster has 50 prompts with avg cosine similarity 0.94. At a cache threshold matching that observed similarity, 0.94 would serve 47 of them without hitting the LLM.
Estimated savings$525/mo
Efficiency Audit
Cache coverage across 50 plotted prompt datapoints.
94.0%
Variance Monitor
Drift detected in the highest-latency HDBSCAN cluster.
cos_sim_drift: +0.042
affected_cluster: id_882
window: 2026-04-28T02:00Z
affected_cluster: id_882
window: 2026-04-28T02:00Z
Live Prompt Inventory
Representative prompts, assigned clusters, and runtime profile.
All clusters
| Prompt ID | Asked Prompt | Cluster | P50 Latency | Cost/1k | Status |
|---|---|---|---|---|---|
| PR-8221-A-01 | Rewrite this query plan for lower scan cost | Tech-alpha | 245ms | $0.0014 | Healthy |
| PR-4432-B-02 | Find the regression in this auth middleware | Tech-alpha | 198ms | $0.0011 | Healthy |
| PR-1140-C-03 | Summarize failures from the last deploy logs | Tech-alpha | 312ms | $0.0017 | Healthy |
| PR-9104-Z-04 | Draft a friendlier onboarding email sequence | Creative-beta | 842ms | $0.0028 | Review |
| PR-7719-D-05 | Turn this changelog into a launch story | Creative-beta | 690ms | $0.0022 | Healthy |
| PR-5520-M-06 | Make this support answer concise and warm | Creative-beta | 718ms | $0.0025 | Healthy |
Showing 1-6 of 50 prompt families