b9489
b9489
View on GitHubView PackagePublished: Jun 3, 2026

Release Notes

cuda: reserve space for quantize kv-cache at startup (#23907)

  • cuda: reserve space for quantize kv-cache at startup

  • address review comments

  • remove forward decl

Co-authored-by: Johannes Gäßler [email protected]

  • remove assert in ggml-cuda.cu

Co-authored-by: Johannes Gäßler [email protected]


Co-authored-by: Johannes Gäßler [email protected]

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI: