Unkey Semantic Caching demo

Try prompting the chatbot with an in-depth question. The first time you ask, it will be streamed as normal. Subsequent times, it will be served from the cache near-instantly.

Because the caching is semantic, identical questions with different phrasing will return the same answer from the cache.