v0.31.1
v0.31.1
View on GitHubView PackagePublished: Jun 30, 2026

Release Notes

What's Changed

  • mlx: tighten up gemma4 moe loading code by @pdevine in https://github.com/ollama/ollama/pull/16964
  • mlx: bump to latest version to include new small batch matmul kernel @jessegross @dhiltgen
  • llama.cpp: bump to b9840 @dhiltgen
  • improved gemma4 MTP performance @jessegross

Full Changelog: https://github.com/ollama/ollama/compare/v0.31.0...v0.31.1