Veritas · LLM gateway

A semantic cache you can prove is correct.

A provider-agnostic LLM gateway whose two-tier cache is measured the way the commercial tier never is: not just a hit rate, but the precision and false-positive rate of those hits — so a paraphrase reuses an answer, but a near-identical question with the opposite answer does not.

Most semantic caches report only how often they hit. GPTCache, for one, publishes hit-ratio and recall but no precision — so a high hit rate can quietly include wrong answers. Veritas scores the metric they omit and gates CI on it.

Open the live playground →

Runs fully offline with no API key (deterministic mock provider + committed eval embeddings). Add an ANTHROPIC_API_KEY or OPENAI_API_KEY to route to a live model.