Prefix Caching in AIExplains prefix caching for reusing attention KV computations to speed up shared-prefix AI inference.