Unkey Semantic Caching demo
Try prompting the chatbot with an in-depth question. The first time you ask, it will be streamed as normal. Subsequent times, it will be served from the cache near-instantly.
Because the caching is semantic, identical questions with different phrasing will return the same answer from the cache.