nomic-ai/gpt4all | My Release Notes

Never miss a release that matters

AI-powered summaries of every GitHub release.

AI Summaries

Changelogs condensed into clear, actionable insights.

Always Free

Track up to 5 packages at no cost, forever.

Weekly Digest

A curated summary of every release, delivered weekly.

Get Started Free

nomic-ai/gpt4allv3.x

20 releases

TL;DR

GPT4All now supports connecting to and using remote models from Groq, OpenAI, and Mistral, expanding access to powerful language models.

New

Remote Model Support: Easily configure and use models hosted by Groq, OpenAI, and Mistral (cloud-based AI services).
Granite Model: Added support for the new Granite model.

Fixes Worth Knowing

Whitespace: Improved output formatting for DeepSeek-R1 models.
Stability: Resolved several crash issues.

Before You Upgrade

Ensure your GPU meets the CUDA 5.0 compatibility requirement if you intend to use the CUDA backend.

v3.10.0v3.10.0

Feb 25, 2025

v3.9.0v3.9.0

Feb 5, 2025

v3.8.0v3.8.0

Jan 31, 2025

v3.7.0v3.7.0

Jan 23, 2025

v3.6.1v3.6.1

Dec 20, 2024

v3.6.0v3.6.0

Dec 19, 2024

v3.5.3v3.5.3

Dec 16, 2024

v3.5.2v3.5.2

Dec 14, 2024

v3.5.1v3.5.1

Dec 10, 2024

v3.5.0

Dec 9, 2024

v3.4.2v3.4.2

Oct 16, 2024

v3.4.1v3.4.1

Oct 11, 2024

v3.4.0v3.4.0

Oct 8, 2024

v3.3.1v3.3.1

Sep 27, 2024

v3.3.0v3.3.0

Sep 23, 2024

v3.2.1v3.2.1

Aug 13, 2024

v3.2.0v3.2.0

Aug 12, 2024

v3.1.1v3.1.1

Jul 27, 2024

v3.1.0v3.1.0

Jul 24, 2024

v3.0.0v3.0.0

Jul 2, 2024

nomic-ai/gpt4allv-1.x

2 releases

TL;DR

GPT4All now includes a beta web search feature (internet access) powered by the Llama 3.1 model, allowing it to answer questions using current information.

Breaking

Model Renaming: Rename Meta-Llama-3.1-8B-Instruct-128k-Q4_0.gguf to Meta-Llama-3.1-8B-Instruct-Q4_0.gguf.
Prompt/System Template Update: Update prompt and system templates with provided code snippets.

New

Web Search: Access the internet to provide more informed responses. (Requires setup - see link in release notes)

Fixes Worth Knowing

Improved web search context injection and source display.
Fixed RoPE scaling issue with Llama 3.1.

Before You Upgrade

Rename the specified model file.
Update your prompt and system templates with the provided code. See the wiki for detailed instructions: https://github.com/nomic-ai/gpt4all/wiki/Web-Search-Beta-Release

v3.1.1-web_search_beta_2v3.1.1-web_search_beta_2

Jul 27, 2024

v3.1.0-web_search_betav3.1.0-web_search_beta

Jul 25, 2024

nomic-ai/gpt4allv2.x

14 releases

TL;DR

GPT4All now supports NVIDIA GPUs (CUDA) via llama.cpp, significantly accelerating prompt processing and generation on compatible hardware.

Breaking

Unsupported models (Mamba, Persimmon, PLaMo) have been removed from the allowed list.

New

CUDA Support: Utilize NVIDIA GPUs for faster performance with compatible model types.
InternLM Models: Added support for the InternLM family of models.

Fixes Worth Knowing

Message sending is now blocked while the LLM is responding.
Improved chat title generation quality.
Corrected model loading progress display.
Resolved several memory leaks for improved stability.

Before You Upgrade

If you have an NVIDIA GPU, ensure you have the latest drivers installed to take advantage of the CUDA backend. You can select it in the Settings menu.

v2.8.0v2.8.0

May 24, 2024

v2.7.5v2.7.5

May 3, 2024

v2.7.4v2.7.4

Apr 26, 2024

v2.7.3v2.7.3

Mar 13, 2024

v2.7.2v2.7.2

Mar 13, 2024

v2.7.1v2.7.1

Feb 26, 2024

v2.7.0v2.7.0

Feb 8, 2024

v2.5.1v2.5.1

Feb 4, 2024

v2.5.0v2.5.0

Feb 4, 2024

v2.6.2v2.6.2

Feb 1, 2024

v2.6.1v2.6.1

Jan 11, 2024

v2.5.4v2.5.4

Nov 21, 2023

v2.5.3v2.5.3

Nov 20, 2023

v2.5.2v2.5.2

Oct 30, 2023

nomic-ai/gpt4allv2.xprerelease

2 releases

TL;DR

GPT4All now supports NVIDIA GPUs (CUDA) via llama.cpp, significantly improving performance for compatible models and quantization types.

Breaking

Removed support for Mamba, Persimmon, and PLaMo models (less common model architectures).

New

CUDA Support: Utilize NVIDIA GPUs for faster prompt processing and generation.
InternLM Models: Added support for the InternLM family of models (open-source language models).

Fixes Worth Knowing

Message sending is now blocked while the LLM is responding.
Improved chat title generation quality.
Corrected model loading progress display.

Before You Upgrade

If you have an NVIDIA GPU, ensure you have the latest drivers installed to take advantage of the CUDA backend. You can select it in Settings.

v2.8.0-pre1v2.8.0-pre1

May 15, 2024

v2.5.0-pre1v2.5.0-pre1

Oct 6, 2023