v0.31.1
v0.31.1
Release Notes
What's Changed
- mlx: tighten up gemma4 moe loading code by @pdevine in https://github.com/ollama/ollama/pull/16964
- mlx: bump to latest version to include new small batch matmul kernel @jessegross @dhiltgen
- llama.cpp: bump to b9840 @dhiltgen
- improved gemma4 MTP performance @jessegross
Full Changelog: https://github.com/ollama/ollama/compare/v0.31.0...v0.31.1