Never miss a release that matters

AI-powered summaries of every GitHub release.

AI Summaries

Changelogs condensed into clear, actionable insights.

Always Free

Track up to 5 packages at no cost, forever.

Weekly Digest

A curated summary of every release, delivered weekly.

Get Started Free
huggingface/transformersv5.x
18 releases

TL;DR

The transformers library now supports Gemma 4 Unified, a multimodal model without dedicated vision/audio towers, alongside new models like Sapiens2 (human-centric vision), DeepSeek-OCR-2 (OCR tasks), and Mellum (code generation).

Breaking

  • The Gemma4 vision pooler now casts inputs to float32 to prevent potential overflow errors when using float16 precision. (This affects models using the Gemma4 vision component.)

New

  • Gemma 4 Unified: A simplified multimodal model offering strong performance without separate vision and audio encoders. (Multimodal models process both text and images/audio.)
  • Sapiens2: New vision transformers designed for human-centric tasks like pose estimation. (Pose estimation identifies body positions.)
  • Mellum: A code-focused Mixture-of-Experts model for code generation. (Mixture-of-Experts models use multiple sub-models.)

Fixes Worth Knowing

  • Fixed a potential float16 overflow issue in the Gemma4 vision pooler, improving stability.

Before You Upgrade

  • If you are using the Gemma4 vision pooler with float16 precision, be aware of the change to float32 casting, which may slightly alter results.
huggingface/transformersv5.xprerelease
4 releases

TL;DR

The transformers library expands model support with GLM-4.7, GLM-Image, LWDetr, LightOnOCR, and MiniMax-M2, alongside numerous bug fixes and performance improvements focused on generation and stability.

Breaking

  • Deprecated classes have been removed.
  • dtype per sub config is deprecated.
  • Unsafe torch.load() has been fixed, potentially impacting custom loading procedures (security fix).

New

  • Added support for GLM-4.7 and GLM-Image models.
  • Expanded model coverage with LWDetr and LightOnOCR.

Fixes Worth Knowing

  • Resolved generation length issues with qwen2_5_omni and DiT models.
  • Corrected bugs in Fuyu processor width calculation.
  • Fixed failing tests for several models including Bart, llava, Pix2Struct, and others.
  • Addressed a crash when using FSDP2 with Tensor Parallelism.
  • Improved stability with FlashAttention and quantized models.
  • Resolved UTF-8 encoding issues on Windows.

Before You Upgrade

  • Review
Jan 26, 2026
Jan 8, 2026
Jan 8, 2026
Dec 1, 2025
huggingface/transformersv4.x
169 releases

TL;DR

Qwen models (image and language) now load and function correctly, resolving issues with model type recognition and cached tokenizers.

Fixes Worth Knowing

  • Grouped beam search (advanced decoding) now correctly uses configuration parameters.
  • Offline tokenizers (pre-downloaded vocabularies) now load properly for Mistral models.
  • Learning rate scheduler parsing is more robust.
Jan 16, 2026
Jan 13, 2026
Jan 13, 2026
Nov 25, 2025
Nov 24, 2025
Oct 14, 2025
Sep 17, 2025
Sep 4, 2025
Aug 22, 2025
Aug 21, 2025
Aug 13, 2025
Jul 29, 2025
Jul 22, 2025
Jul 11, 2025
Jul 4, 2025
Jun 26, 2025
May 30, 2025
May 22, 2025
May 21, 2025
Apr 14, 2025
Apr 10, 2025
Apr 8, 2025
Mar 28, 2025
Mar 27, 2025
Mar 25, 2025
Mar 21, 2025
Feb 7, 2025
Jan 30, 2025
Jan 20, 2025
v4.47.1v4.47.1
Dec 17, 2024
Nov 18, 2024
Nov 5, 2024
Oct 29, 2024
Oct 24, 2024
Oct 7, 2024
Sep 26, 2024
Aug 22, 2024
Aug 20, 2024
Aug 6, 2024
Aug 5, 2024
Jul 26, 2024
Jul 24, 2024
Jul 23, 2024
Jul 11, 2024
Jun 28, 2024
Jun 28, 2024
Jun 27, 2024
May 30, 2024
v4.40.2v4.40.2
May 6, 2024
Apr 2, 2024
Mar 28, 2024
Mar 22, 2024
Mar 21, 2024
v4.38.2v4.38.2
Mar 1, 2024
v4.38.1v4.38.1
Feb 22, 2024
Jan 29, 2024
Jan 24, 2024
Dec 18, 2023
Dec 14, 2023
Nov 15, 2023
Nov 14, 2023
Oct 18, 2023
Sep 27, 2023
Sep 15, 2023
Aug 28, 2023
Aug 22, 2023
Jun 13, 2023
Jun 9, 2023
May 16, 2023
May 11, 2023
Apr 14, 2023
Mar 29, 2023
Mar 23, 2023
Mar 20, 2023
Mar 15, 2023
Feb 9, 2023
Oct 11, 2022
Sep 27, 2022
Sep 16, 2022
Sep 5, 2022
Aug 24, 2022
Aug 4, 2022
Jun 21, 2022
Jun 10, 2022
Jun 9, 2022
May 16, 2022
May 13, 2022
Jan 31, 2022
Jan 28, 2022
v4.15.0v4.15.0
Dec 22, 2021
Dec 15, 2021
Nov 17, 2021
Nov 16, 2021
Nov 3, 2021
Oct 29, 2021
Oct 29, 2021
Oct 6, 2021
Sep 30, 2021
Sep 29, 2021
Sep 22, 2021
Sep 10, 2021
Sep 10, 2021
Aug 9, 2021
Jul 26, 2021
Jun 30, 2021
Jun 24, 2021
May 20, 2021
Apr 13, 2021
Mar 18, 2021
Feb 24, 2021
Feb 9, 2021
Feb 9, 2021
Jan 21, 2021
Jan 14, 2021
huggingface/transformersv-1.x
25 releases

TL;DR

The transformers library now supports Vault-Gemma, a new 1B parameter text generation model (privacy-focused language model) from Google, offering a privacy-preserving alternative to existing models.

New

  • Vault-Gemma Support: Added the google/vaultgemma-1b model, trained with differential privacy for enhanced data security.
  • Chat Interface: Interact with Vault-Gemma directly using the transformers chat command-line tool.

Before You Upgrade

Install Vault-Gemma specifically using pip install git+https://github.com/huggingface/[email protected] as it’s a preview release and doesn’t follow standard versioning.

v4.56.1-Vault-Gemma-previewVault-Gemma (based on v4.56.1)
Sep 12, 2025
v4.56.0-Embedding-Gemma-previewEmbedding Gemma (based on v4.56.0)
Sep 4, 2025
4.55.0-GLM-4.5V-previewGLM-4.5V preview based on 4.55.0
Aug 11, 2025
Jul 23, 2025
v4.53.2-modernbert-decoder-previewModernBERT Decoder (based on v4.53.2)
Jul 16, 2025
v4.52.4-Kyutai-STT-previewKyutai-STT (based on v4.52.4)
Jun 24, 2025
v4.52.4-VJEPA-2-previewV-JEPA 2 (based on v4.52.4)
Jun 11, 2025
v4.52.4-ColQwen2-previewColQwen2 (based on v4.52.4)
Jun 2, 2025
v4.51.3-CSM-previewCSM (based on v4.51.3)
May 8, 2025
v4.51.3-GraniteMoeHybrid-previewGraniteMoeHybrid (based on v4.51.3)
May 8, 2025
v4.51.3-D-FINE-previewD-FINE (based on v4.51.3)
May 8, 2025
v4.51.3-SAM-HQ-previewSAM-HQ (based on v4.51.3)
May 8, 2025
v4.51.3-BitNet-previewBitNet (based on v4.51.3)
May 8, 2025
v4.51.3-LlamaGuard-previewLlamaGuard-4 (based on v4.51.3)
Apr 30, 2025
v4.51.3-Qwen2.5-Omni-previewQwen2.5-Omni (based on 4.51.3)
Apr 24, 2025
Apr 22, 2025
v4.51.3-Janus-previewJanus (based on v4.51.3)
Apr 22, 2025
v4.51.3-TimesFM-previewTimesFM (based on v4.51.3)
Apr 22, 2025
v4.51.3-MLCD-previewMLCD (based on 4.51.3)
Apr 22, 2025
Mar 28, 2025
Mar 18, 2025
Mar 18, 2025
Feb 21, 2025
Feb 20, 2025
huggingface/transformersv4.xprerelease
2 releases

TL;DR

Aya Vision, a new state-of-the-art multilingual multimodal model (handles images & text), is now available, enabling image understanding and text generation in 23 languages.

New

  • Aya Vision Models: Added 8B and 32B parameter models for multimodal tasks.
  • Multilingual Support: Supports 23 languages for both visual and textual understanding.

Before You Upgrade

Install using pip install git+https://github.com/huggingface/[email protected] to access the Aya Vision models.

huggingface/transformersv3.x
10 releases

TL;DR

The transformers library now uses Git repositories for model storage, enabling versioning, access control, and scalability, fundamentally changing how models are downloaded and shared.

Breaking

  • Model uploads using the previous system are no longer supported; upgrade to this release or use the new CLI tools.
  • TensorFlow users: pinned sentencepiece to 0.1.91 to resolve build issues.

New

  • Git-backed Model Storage: Models are now stored in Git repositories (with S3 for large files), providing versioning via tags, branches, or commit hashes (e.g., AutoTokenizer.from_pretrained("model", revision="v2.0.1")). You can even clone model repositories locally.
  • TensorFlow 2.0 Support: Added functionality for state-of-the-art sequence-to-sequence transformers in TensorFlow.
  • Seq2Seq Trainer: A specialized Trainer for sequence-to-sequence models is available, improving API support and performance.

Fixes Worth Knowing

  • Fixed issues with pipelines (text generation, QA) and tokenizers, improving stability and functionality.
  • Improved error messages
huggingface/transformersv2.x
18 releases

TL;DR

The release introduces Longformer, a new model for processing long sequences of text, alongside several community notebooks demonstrating its use and other models.

Breaking

  • Model instantiation for BART, Flaubert, Japanese BERT variants, Finnish BERT variants, Dutch BERT, and ALBERT from TensorFlow now requires the full model ID (e.g., "cl-tohoku/bert-base-japanese") instead of relying on hardcoded URLs.

New

  • Longformer Support: Added the Longformer model architecture, tokenizer, and pre-trained weights for tasks like question answering and sequence classification.
  • Community Notebooks: Several new notebooks are available demonstrating fine-tuning and pre-training techniques for various models, including Longformer, BART, and T5.

Fixes Worth Knowing

  • Corrected tokenizer behavior for summarization pipelines and fast tokenizers.
  • Fixed issues with MNLI and SST-2 datasets.
  • Improved robustness of the max_len attribute and added deprecation warnings.
  • Fixed tokenization of extra ID symbols in the T5 tokenizer.

Before You Upgrade

  • Update your code to use the full model ID when instantiating
huggingface/transformersv1.x
3 releases

TL;DR

The transformers library now supports DistilBERT, a faster and lighter version of BERT, alongside new checkpoints for GPT-2 Large and XLM, significantly expanding model options for various natural language processing (NLP) tasks.

Breaking

  • A new dependency, sacremoses (a Moses tokenizer port), is required for XLM support.
  • XLM tokenization in Thai, Japanese, and Chinese may require additional, optional dependencies (pythainlp, kytea, jieba) which must be installed separately.

New

  • DistilBERT: A distilled version of BERT offering improved speed and efficiency.
  • GPT-2 Large: The 774M parameter GPT-2 model is now available.
  • AutoModels: Generic classes for easier model instantiation using from_pretrained().

Fixes Worth Knowing

  • Improved multi-GPU training stability.
  • Corrected saving and reloading of models with pruned heads.
  • Fixed issues with GPT-2 and RoBERTa tokenizers related to sentence spacing.
  • Enhanced XLM tokenization for multilingual inputs.
  • Added shortcuts for accessing special token IDs (e
huggingface/transformersv0.x
9 releases

TL;DR

This release updates the transformers library with improved model saving/loading and replaces the old learning rate warmup with more flexible scheduling options.

Breaking

  • warmup_linear in OpenAIAdam and BertAdam is removed; use the new schedule classes instead (learning rate adjustments).

New

  • BERT language model fine-tuning scripts are added (scripts for training).
  • GLUE task support is expanded in run_classifier.py (natural language understanding benchmark).

Fixes Worth Knowing

  • Tokenizers now support sequences longer than 512 tokens (input length).
  • GPT-2 loss computation and FP16 training stability are improved (generation quality).
  • Model serialization is more reliable (saving/loading models).