huggingface/transformers | My Release Notes

Never miss a release that matters

AI-powered summaries of every GitHub release.

AI Summaries

Changelogs condensed into clear, actionable insights.

Always Free

Track up to 5 packages at no cost, forever.

Weekly Digest

A curated summary of every release, delivered weekly.

Get Started Free

huggingface/transformersv5.x

27 releases

TL;DR

This update introduces support for the multimodal Inkling model and significantly boosts performance for large inputs via FlashAttention kernel integration with StaticCache.

Breaking

GPTNeoX and GPTBigCode weight naming and attention backend behavior have changed to ensure vLLM (high-throughput LLM serving) compatibility.

New

Model Additions: Added Inkling (multimodal), TIPSv2, and TIPSv2 DPT.
Generation Enhancements: Added Multi-Token Prediction (MTP) decoding and static ensemble verification for speculative decoding.

Fixes Worth Knowing

Resolved crashes in greedy assisted generation when using different tokenizers.
Fixed DeepGEMM (low-precision matrix multiplication) issues on multi-device setups.
Corrected SDPA prefill and assisted decoding bugs specifically affecting Inkling and OlmoHybrid models.

Before You Upgrade

If using GPTNeoX or GPTBigCode, update your code to handle the embed_out to lm_head remapping.

v5.14.1Patch release: v5.14.1

Jul 16, 2026

v5.14.0Release v5.14.0

Jul 15, 2026

v5.13.1Patch release v5.13.1

Jul 11, 2026

v5.13.0Release v5.13.0

Jul 3, 2026

v5.12.1Patch release v5.12.1

Jun 15, 2026

v5.10.3Patch release v5.10.3

Jun 15, 2026

v5.12.0Release v5.12.0

Jun 12, 2026

v5.11.0Release v5.11.0

Jun 10, 2026

v5.10.2Patch release v5.10.2

Jun 4, 2026

v5.10.1Release v5.10.1

Jun 3, 2026

v5.9.0Release v5.9.0

May 20, 2026

v5.8.1Patch release v5.8.1

May 13, 2026

v5.8.0Release 5.8.0

May 5, 2026

v5.7.0Release v5.7.0

Apr 28, 2026

v5.6.2Patch release v5.6.2

Apr 23, 2026

v5.6.1Patch release v5.6.1

Apr 23, 2026

v5.6.0Release v5.6.0

Apr 22, 2026

v5.5.4Patch release v5.5.4

Apr 13, 2026

v5.5.3Patch release: v5.5.3

Apr 9, 2026

v5.5.2Patch release: v5.5.2

Apr 9, 2026

v5.5.1Patch release v5.5.1

Apr 9, 2026

v5.5.0Release v5.5.0

Apr 2, 2026

v5.4.0Release v5.4.0: PaddlePaddle models 🙌, Mistral 4, PI0, VidEoMT, UVDoc, SLANeXt, Jina Embeddings v3

Mar 27, 2026

v5.3.0v5.3.0: EuroBERT, VibeVoice ASR, TimesFM2.5, PP-DocLayoutV2, OlmoHybrid, ModernVBert, Higgs Audio V2

Mar 4, 2026

v5.2.0v5.2.0: GLM-5, Qwen3.5, Voxtral Realtime, VibeVoice Acoustic Tokenizer

Feb 16, 2026

v5.1.0v5.1.0: EXAONE-MoE, PP-DocLayoutV3, Youtu-LLM, GLM-OCR

Feb 5, 2026

v5.0.0Transformers v5

Jan 26, 2026

huggingface/transformersv5.xprerelease

4 releases

TL;DR

The transformers library expands model support with GLM-4.7, GLM-Image, LWDetr, LightOnOCR, and MiniMax-M2, alongside numerous bug fixes and performance improvements focused on generation and stability.

Breaking

Deprecated classes have been removed.
dtype per sub config is deprecated.
Unsafe torch.load() has been fixed, potentially impacting custom loading procedures (security fix).

New

Added support for GLM-4.7 and GLM-Image models.
Expanded model coverage with LWDetr and LightOnOCR.

Fixes Worth Knowing

Resolved generation length issues with qwen2_5_omni and DiT models.
Corrected bugs in Fuyu processor width calculation.
Fixed failing tests for several models including Bart, llava, Pix2Struct, and others.
Addressed a crash when using FSDP2 with Tensor Parallelism.
Improved stability with FlashAttention and quantized models.
Resolved UTF-8 encoding issues on Windows.

Before You Upgrade

Review

v5.0.0rc3Release candidate v5.0.0rc3

Jan 26, 2026

v5.0.0rc2Release candidate 5.0.0rc2

Jan 8, 2026

v5.0.0rc1Release candidate 5.0.0rc1

Jan 8, 2026

v5.0.0rc0Transformers v5.0.0rc0

Dec 1, 2025

huggingface/transformersv4.x

169 releases

TL;DR

Qwen models (image and language) now load and function correctly, resolving issues with model type recognition and cached tokenizers.

Fixes Worth Knowing

Grouped beam search (advanced decoding) now correctly uses configuration parameters.
Offline tokenizers (pre-downloaded vocabularies) now load properly for Mistral models.
Learning rate scheduler parsing is more robust.

v4.57.6Patch release v4.57.6

Jan 16, 2026

v4.57.5Patch release v4.57.5

Jan 13, 2026

v4.57.4Patch release v4.57.4

Jan 13, 2026

v4.57.3Patch release v4.57.3

Nov 25, 2025

v4.57.2Patch Release v4.57.2

Nov 24, 2025

v4.57.1Patch release v4.57.1

Oct 14, 2025

v4.57.0v4.57.0: Qwen3-Next, Vault Gemma, Qwen3 VL, LongCat Flash, Flex OLMO, LFM2 VL, BLT, Qwen3 OMNI MoE, Parakeet, EdgeTAM, OLMO3

Oct 3, 2025

v4.56.2 Patch release v4.56.2

Sep 17, 2025

v4.56.1Patch release v4.56.1

Sep 4, 2025

v4.56.0v4.56: Dino v3, X-Codec, Ovis 2, MetaCLIP 2, Florence 2, SAM 2, Kosmos 2.5, HunYuan, GLMV-4.5

Aug 29, 2025

v4.55.4Patch v4.55.4

Aug 22, 2025

v4.55.3Patch release v4.55.3

Aug 21, 2025

v4.55.2Patch release 4.55.2: for FA2 users!

Aug 13, 2025

v4.55.1Patch release 4.55.1

Aug 13, 2025

v4.55.0v4.55.0: New openai GPT OSS model!

Aug 5, 2025

4.54.1Patch release 4.54.1

Jul 29, 2025

v4.54.0v4.54.0: Kernels, Transformers Serve, Ernie, Voxtral, LFM2, DeepSeek v2, ModernBERT Decoder...

Jul 25, 2025

v4.53.3Patch release v4.53.3

Jul 22, 2025

v4.53.2Patch Release v4.53.2

Jul 11, 2025

v4.53.1Patch Release v4.53.1

Jul 4, 2025

v4.53.0Release v4.53.0

Jun 26, 2025

v4.52.4Patch release: v4.52.4

May 30, 2025

v4.52.3Patch release v4.52.3

May 22, 2025

v4.52.2Patch release v4.52.2

May 21, 2025

v4.52.1v4.52.1: Qwen2.5-Omni, SAM-HQ, GraniteMoeHybrid, D-FINE, CSM, BitNet, LlamaGuard, TimesFM, MLCD, Janus, InternVL

May 20, 2025

v4.51.3Patch release v4.51.3

Apr 14, 2025

v4.51.2Patch Release 4.51.2

Apr 10, 2025

v4.51.1Patch release v4.51.1

Apr 8, 2025

v4.51.0v4.51.0: Llama 4, Phi4-Multimodal, DeepSeek-v3, Qwen3

Apr 5, 2025

v4.50.3Patch release v4.50.3

Mar 28, 2025

v4.50.2Patch release v4.50.2

Mar 27, 2025

v4.50.1Patch release v4.50.1

Mar 25, 2025

v4.50.0Release v4.50.0

Mar 21, 2025

v4.49.0v4.49.0: Helium, Qwen2.5-VL, SuperGlue, Granite Vision, Zamba2, GOT-OCR 2.0, DAB-DETR, Depth Pro, RT-DETRv2, GPTQModel

Feb 17, 2025

v4.48.3Patch release v4.48.3

Feb 7, 2025

v4.48.2Patch release v4.48.2

Jan 30, 2025

v4.48.1Patch release v4.48.1

Jan 20, 2025

v4.48.0v4.48.0: ModernBERT, Aria, TimmWrapper, ColPali, Falcon3, Bamba, VitPose, DinoV2 w/ Registers, Emu3, Cohere v2, TextNet, DiffLlama, PixtralLarge, Moonshine

Jan 10, 2025

v4.47.1v4.47.1

Dec 17, 2024

v4.47.0 v4.47.0: PaliGemma-2, I-JEPA, OLMo-2, LayerSkip, Tensor Parallel

Dec 5, 2024

v4.46.3Patch release v4.46.3

Nov 18, 2024

v4.46.2Patch release v4.46.2

Nov 5, 2024

v4.46.1Patch release v4.46.1

Oct 29, 2024

v4.46.0Release v4.46.0

Oct 24, 2024

v4.45.2Release v4.45.2

Oct 7, 2024

v4.45.1Patch Release v4.45.1

Sep 26, 2024

v4.45.0Llama 3.2, mllama, Qwen2-Audio, Qwen2-VL, OLMoE, Llava Onevision, Pixtral, FalconMamba, Modular Transformers

Sep 25, 2024

v4.44.2Release v4.44.2

Aug 22, 2024

v4.44.1Patch release v4.44.1

Aug 20, 2024

v4.44.0Release v4.44.0

Aug 6, 2024

v4.43.4v4.43.4 Patch Release

Aug 5, 2024

v4.43.3v4.43.3 Patch deepspeed

Jul 26, 2024

v4.43.2v4.43.2: Patch release

Jul 24, 2024

v4.43.1v4.43.1: Patch release

Jul 23, 2024

v4.43.0v4.43.0: Llama 3.1, Chameleon, ZoeDepth, Hiera

Jul 23, 2024

v4.42.4Patch release v4.42.4

Jul 11, 2024

v4.42.3Patch release v4.42.3

Jun 28, 2024

v4.42.2Patch release v4.42.2

Jun 28, 2024

v4.42.1v4.42.1: Patch release

Jun 27, 2024

v4.42.0v4.42.0: Gemma 2, RTDETR, InstructBLIP, LLAVa Next, New Model Adder

Jun 27, 2024

v4.41.2Release v4.41.2

May 30, 2024

v4.41.1Release v4.41.1 Fix PaliGemma finetuning, and some small bugs

May 22, 2024

v4.41.0v4.41.0: Phi3, JetMoE, PaliGemma, VideoLlava, Falcon2, FalconVLM & GGUF support

May 17, 2024

v4.40.2v4.40.2

May 6, 2024

v4.40.1v4.40.1: fix `EosTokenCriteria` for `Llama3` on `mps`

Apr 23, 2024

v4.40.0v4.40.0: Llama 3, Idefics 2, Recurrent Gemma, Jamba, DBRX, OLMo, Qwen2MoE, Grounding Dino

Apr 18, 2024

v4.39.3Release v4.39.3

Apr 2, 2024

v4.39.2Patch release v4.39.2

Mar 28, 2024

v4.39.1Patch release v4.39.1

Mar 22, 2024

v4.39.0Release v4.39.0

Mar 21, 2024

v4.38.2v4.38.2

Mar 1, 2024

v4.38.1v4.38.1

Feb 22, 2024

v4.38.0v4.38: Gemma, Depth Anything, Stable LM; Static Cache, HF Quantizer, AQLM

Feb 21, 2024

v4.37.2Patch release v4.37.2

Jan 29, 2024

v4.37.1Patch release: v4.37.1

Jan 24, 2024

v4.37.0v4.37 Qwen2, Phi-2, SigLIP, ViP-LLaVA, Fast2SpeechConformer, 4-bit serialization, Whisper longform generation

Jan 22, 2024

v4.36.2Patch release: v4.36.2

Dec 18, 2023

v4.36.1Patch release: v4.36.1

Dec 14, 2023

v4.36.0v4.36: Mixtral, Llava/BakLlava, SeamlessM4T v2, AMD ROCm, F.sdpa wide-spread support

Dec 11, 2023

v4.35.2Patch release: v4.35.2

Nov 15, 2023

v4.35.1Patch release: v4.35.1

Nov 14, 2023

v4.35.0Safetensors serialization by default, DistilWhisper, Fuyu, Kosmos-2, SeamlessM4T, Owl-v2

Nov 2, 2023

v4.34.1Patch release: v4.34.1

Oct 18, 2023

v4.34.0v4.34: Mistral, Persimmon, Prompt templating, Flash Attention 2, Tokenizer refactor

Oct 3, 2023

v4.33.3Patch release: v4.33.3

Sep 27, 2023

v4.33.2Patch release: v4.33.2

Sep 15, 2023

v4.33.1Falcon, Code Llama, ViTDet, DINO v2, VITS

Sep 6, 2023

v4.32.1Patch release: v4.32.1

Aug 28, 2023

v4.32.0IDEFICS, GPTQ Quantization

Aug 22, 2023

v4.31.0v4.31.0: Llama v2, MusicGen, Bark, MMS, EnCodec, InstructBLIP, Umt5, MRa, vIvIt

Jul 18, 2023

v4.30.2v4.30.2: Patch release

Jun 13, 2023

v4.30.1v4.30.1 Patch release

Jun 9, 2023

v4.30.0v4.30.0: 100k, Agents improvements, Safetensors core dependency, Swiftformer, Autoformer, MobileViTv2, timm-as-a-backbone

Jun 8, 2023

v4.29.2v4.29.2: Patch release

May 16, 2023

v4.29.1V4.29.1: Patch release

May 11, 2023

v4.29.0v4.29.0: Transformers Agents, SAM, RWKV, FocalNet, OpenLLaMa

May 10, 2023

v4.28.1v4.28.1: Patch release

Apr 14, 2023

v4.28.0v4.28.0: LLaMa, Pix2Struct, MatCha, DePlot, MEGA, NLLB-MoE, GPTBigCode

Apr 13, 2023

v4.27.4v4.27.4: Patch release

Mar 29, 2023

v4.27.3v4.27.3: Patch release

Mar 23, 2023

v4.27.2v4.27.2: Patch release

Mar 20, 2023

v4.27.1v4.27.1: Patch release

Mar 15, 2023

v4.27.0BridgeTower, Whisper speedup, DETA, SpeechT5, BLIP-2, CLAP, ALIGN, API updates

Mar 15, 2023

v4.26.1V4.26.1: Patch release

Feb 9, 2023

v4.26.0v4.26.0: Generation configs, image processors, backbones and plenty of new models!

Jan 25, 2023

v4.25.1PyTorch 2.0 support, Audio Spectogram Transformer, Jukebox, Switch Transformers and more

Dec 2, 2022

v4.24.0v4.24.0: ESM-2/ESMFold, LiLT, Flan-T5, Table Transformer and Contrastive search decoding

Nov 1, 2022

v4.23.1v4.23.1 Patch release

Oct 11, 2022

v4.23.0v4.23.0: Whisper, Deformable DETR, Conditional DETR, MarkupLM, MSN, `safetensors`

Oct 10, 2022

v4.22.2# v4.22.2 Patch release

Sep 27, 2022

v4.22.1v4.22.1: Patch release:

Sep 16, 2022

v4.22.0v4.22.0: Swin Transformer v2, VideoMAE, Donut, Pegasus-X, X-CLIP, ERNIE

Sep 14, 2022

v4.21.3v4.21.3: Patch release

Sep 5, 2022

v4.21.2v4.21.2: Patch release

Aug 24, 2022

v4.21.1# v4.21.1: Patch release

Aug 4, 2022

v4.21.0v4.21.0: TF XLA text generation - Custom Pipelines - OwlViT, NLLB, MobileViT, Nezha, GroupViT, MVP, CodeGen, UL2

Jul 27, 2022

v4.20.1# v4.20.1 Patch release

Jun 21, 2022

v4.20.0v4.20.0 Big Model inference, BLOOM, CvT, GPT Neo-X, LayoutLMv3, LeViT, LongT5, M-CTC-T, Trajectory Transformer and Wav2Vec2-Conformer

Jun 16, 2022

v4.19.4# v4.19.4 Patch release

Jun 10, 2022

v4.19.3# v4.19.3 Patch release

Jun 9, 2022

v4.19.2v4.19.2: Patch release

May 16, 2022

v4.19.1v4.19.1 Patch release

May 13, 2022

v4.19.0v4.19.0: OPT, FLAVA, YOLOS, RegNet, TAPEX, Data2Vec vision, FSDP integration

May 12, 2022

v4.18.0v4.18.0: Checkpoint sharding, vision models

Apr 7, 2022

v4.17.0v4.17.0: XGLM, ConvNext, PoolFormer, PLBart, Data2Vec, MaskFormer and code in the Hub

Mar 3, 2022

v4.16.2v4.16.2: Patch release

Jan 31, 2022

v4.16.1# V4.16.1: Patch Release

Jan 28, 2022

v4.16.0v4.16.0: Nyströmformer, REALM, ViTMAE, ViLT, Swin Transformer, YOSO, ...

Jan 27, 2022

v4.15.0v4.15.0

Dec 22, 2021

v4.14.1v4.14.1: Patch release

Dec 15, 2021

v4.14.0v4.14.0: Perceiver, Keras model cards

Dec 15, 2021

v4.13.0v4.13.0: Perceiver, ImageGPT, mLUKE, Vision-Text dual encoders, QDQBert, new documentation frontend

Dec 9, 2021

v4.12.5v4.12.5: Patch release

Nov 17, 2021

v4.12.4v4.12.4: Patch release

Nov 16, 2021

v4.12.3v4.12.3: Patch release

Nov 3, 2021

v4.12.2v4.12.2: Patch release

Oct 29, 2021

v4.12.1v4.12.1: Patch release

Oct 29, 2021

v4.12.0v4.12.0: TrOCR, SEW & SEW-D, Unispeech & Unispeech-SAT, BARTPho

Oct 28, 2021

v4.11.3v4.11.3: Patch release

Oct 6, 2021

v4.11.2v4.11.2: Patch release

Sep 30, 2021

v4.11.1v4.11.1: Patch release

Sep 29, 2021

v4.11.0v4.11.0: GPT-J, Speech2Text2, FNet, Pipeline GPU utilization, dynamic model code loading

Sep 27, 2021

v4.10.3v4.10.3: Patch release

Sep 22, 2021

v4.10.2v4.10.2: Patch release

Sep 10, 2021

v4.10.1v4.10.1: Patch release

Sep 10, 2021

v4.10.0v4.10.0: LayoutLM-v2, LayoutXLM, BEiT

Aug 31, 2021

v4.9.2v4.9.2: Patch release

Aug 9, 2021

v4.9.1v4.9.1: Patch release

Jul 26, 2021

v4.9.0v4.9.0: TensorFlow examples, CANINE, tokenizer training, ONNX rework

Jul 22, 2021

v4.8.2Patch release: v4.8.2

Jun 30, 2021

v4.8.1v4.8.1: Patch release

Jun 24, 2021

v4.8.0v4.8.0 Integration with the Hub and Flax/JAX support

Jun 23, 2021

v4.7.0v4.7.0: DETR, RoFormer, ByT5, HuBERT, support for torch 1.9.0

Jun 17, 2021

v4.6.1v4.6.1: Patch release

May 20, 2021

v4.6.0v4.6.0: ViT, DeiT, CLIP, LUKE, BigBirdPegasus, MegatronBERT

May 12, 2021

v4.5.1v4.5.1: Patch release

Apr 13, 2021

v4.5.0v4.5.0: BigBird, GPT Neo, Examples, Flax support

Apr 6, 2021

v4.4.2Patch release V4.4.2

Mar 18, 2021

v4.4.0v4.4.0: S2T, M2M100, I-BERT, mBART-50, DeBERTa-v2, XLSR-Wav2Vec2

Mar 16, 2021

v4.3.3v4.3.3: Patch release

Feb 24, 2021

v4.3.2V4.3.2: Patch release

Feb 9, 2021

v4.3.1v4.3.1: Patch release

Feb 9, 2021

v4.3.0v4.3.0: Wav2Vec2, ConvBERT, BORT, Amazon SageMaker

Feb 8, 2021

v4.2.2v4.2.2: Patch release

Jan 21, 2021

v4.2.1v4.2.1 Patch release

Jan 14, 2021

v4.2.0v4.2.0: LED from AllenAI, Generation Scores, TensorFlow 2x speedup, faster import

Jan 13, 2021

v4.1.1v4.1.1: TAPAS, MPNet, model parallelization, Sharded DDP, conda, multi-part downloads.

Dec 17, 2020

v4.0.1Patch release: better error message & invalid trainer attribute

Dec 9, 2020

v4.0.0Transformers v4.0.0: Fast tokenizers, model outputs, file reorganization

Nov 30, 2020

huggingface/transformersv-1.x

25 releases

TL;DR

The transformers library now supports Vault-Gemma, a new 1B parameter text generation model (privacy-focused language model) from Google, offering a privacy-preserving alternative to existing models.

New

Vault-Gemma Support: Added the google/vaultgemma-1b model, trained with differential privacy for enhanced data security.
Chat Interface: Interact with Vault-Gemma directly using the transformers chat command-line tool.

Before You Upgrade

Install Vault-Gemma specifically using pip install git+https://github.com/huggingface/[email protected] as it’s a preview release and doesn’t follow standard versioning.

v4.56.1-Vault-Gemma-previewVault-Gemma (based on v4.56.1)

Sep 12, 2025

v4.56.0-Embedding-Gemma-previewEmbedding Gemma (based on v4.56.0)

Sep 4, 2025

4.55.0-GLM-4.5V-previewGLM-4.5V preview based on 4.55.0

Aug 11, 2025

v4.53.2-Ernie-4.5-previewErnie-4.5 and Ernie-4.5 MoE (based on v4.53.2)

Jul 23, 2025

v4.53.2-modernbert-decoder-previewModernBERT Decoder (based on v4.53.2)

Jul 16, 2025

v4.52.4-Kyutai-STT-previewKyutai-STT (based on v4.52.4)

Jun 24, 2025

v4.52.4-VJEPA-2-previewV-JEPA 2 (based on v4.52.4)

Jun 11, 2025

v4.52.4-ColQwen2-previewColQwen2 (based on v4.52.4)

Jun 2, 2025

v4.51.3-CSM-previewCSM (based on v4.51.3)

May 8, 2025

v4.51.3-GraniteMoeHybrid-previewGraniteMoeHybrid (based on v4.51.3)

May 8, 2025

v4.51.3-D-FINE-previewD-FINE (based on v4.51.3)

May 8, 2025

v4.51.3-SAM-HQ-previewSAM-HQ (based on v4.51.3)

May 8, 2025

v4.51.3-BitNet-previewBitNet (based on v4.51.3)

May 8, 2025

v4.51.3-LlamaGuard-previewLlamaGuard-4 (based on v4.51.3)

Apr 30, 2025

v4.51.3-Qwen2.5-Omni-previewQwen2.5-Omni (based on 4.51.3)

Apr 24, 2025

v4.51.3-InternVL-previewInternVL (2.5 & 3) (based on v4.51.3)

Apr 22, 2025

v4.51.3-Janus-previewJanus (based on v4.51.3)

Apr 22, 2025

v4.51.3-TimesFM-previewTimesFM (based on v4.51.3)

Apr 22, 2025

v4.51.3-MLCD-previewMLCD (based on 4.51.3)

Apr 22, 2025

v4.50.3-DeepSeek-3Deepseek v3 (based on 4.50.3)

Mar 28, 2025

v4.49.0-Mistral-3Mistral 3 (Based on v4.49.0)

Mar 18, 2025

v4.49.0-Gemma-3Gemma 3 (Based on v4.49.0)

Mar 18, 2025

v4.49.0-SigLIP-2 SigLIP-2 (Based on v4.49.0)

Feb 21, 2025

v4.49.0-SmolVLM-2SmolVLM-2 (Based on v4.49.0)

Feb 20, 2025

v4.0.0-rc-1Transformers v4.0.0-rc-1: Fast tokenizers, model outputs, file reorganization

Nov 19, 2020

huggingface/transformersv4.xprerelease

2 releases

TL;DR

Aya Vision, a new state-of-the-art multilingual multimodal model (handles images & text), is now available, enabling image understanding and text generation in 23 languages.

New

Aya Vision Models: Added 8B and 32B parameter models for multimodal tasks.
Multilingual Support: Supports 23 languages for both visual and textual understanding.

Before You Upgrade

Install using pip install git+https://github.com/huggingface/[email protected] to access the Aya Vision models.

v4.49.0-AyaVisionAya Vision (Based on v4.49.0)

Mar 4, 2025

v4.3.0.rc1v4.3.0.rc1: Wav2Vec2, ConvBERT, BORT, Amazon SageMaker

Feb 4, 2021

huggingface/transformersv3.x

10 releases

TL;DR

The transformers library now uses Git repositories for model storage, enabling versioning, access control, and scalability, fundamentally changing how models are downloaded and shared.

Breaking

Model uploads using the previous system are no longer supported; upgrade to this release or use the new CLI tools.
TensorFlow users: pinned sentencepiece to 0.1.91 to resolve build issues.

New

Git-backed Model Storage: Models are now stored in Git repositories (with S3 for large files), providing versioning via tags, branches, or commit hashes (e.g., AutoTokenizer.from_pretrained("model", revision="v2.0.1")). You can even clone model repositories locally.
TensorFlow 2.0 Support: Added functionality for state-of-the-art sequence-to-sequence transformers in TensorFlow.
Seq2Seq Trainer: A specialized Trainer for sequence-to-sequence models is available, improving API support and performance.

Fixes Worth Knowing

Fixed issues with pipelines (text generation, QA) and tokenizers, improving stability and functionality.
Improved error messages

v3.5.1v3.5.1

Nov 13, 2020

v3.5.0v3.5.0: Model versioning, TensorFlow encoder-decoder models, new scripts, refactor of the `generate` method

Nov 10, 2020

v3.4.0ProphetNet, Blenderbot, SqueezeBERT, DeBERTa

Oct 20, 2020

v3.3.1

Sep 29, 2020

v3.3.0RAG

Sep 28, 2020

v3.2.0Bert Seq2Seq models, FSMT, LayoutLM, Funnel Transformer, LXMERT

Sep 22, 2020

v3.1.0Pegasus, DPR, self-documented outputs, new pipelines and MT support

Sep 1, 2020

v3.0.2Tokenizer fixes

Jul 6, 2020

v3.0.1Patch v3.0.1: Better backward compatibility for tokenizers

Jul 3, 2020

v3.0.0New tokenizer API, TensorFlow improvements, enhanced documentation & tutorials

Jun 29, 2020

huggingface/transformersv2.x

18 releases

TL;DR

The release introduces Longformer, a new model for processing long sequences of text, alongside several community notebooks demonstrating its use and other models.

Breaking

Model instantiation for BART, Flaubert, Japanese BERT variants, Finnish BERT variants, Dutch BERT, and ALBERT from TensorFlow now requires the full model ID (e.g., "cl-tohoku/bert-base-japanese") instead of relying on hardcoded URLs.

New

Longformer Support: Added the Longformer model architecture, tokenizer, and pre-trained weights for tasks like question answering and sequence classification.
Community Notebooks: Several new notebooks are available demonstrating fine-tuning and pre-training techniques for various models, including Longformer, BART, and T5.

Fixes Worth Knowing

Corrected tokenizer behavior for summarization pipelines and fast tokenizers.
Fixed issues with MNLI and SST-2 datasets.
Improved robustness of the max_len attribute and added deprecation warnings.
Fixed tokenization of extra ID symbols in the T5 tokenizer.

Before You Upgrade

Update your code to use the full model ID when instantiating

v2.11.0Longformer

Jun 2, 2020

v2.10.0Reformer, ElectraForSequenceClassification, ONNX conversion script

May 22, 2020

v2.9.1Marian

May 14, 2020

v2.9.0Trainer, TFTrainer, Multilingual BART, Encoder-decoder improvements, Generation Pipeline

May 7, 2020

v2.8.0ELECTRA, Bad word filters, bugfixes & improvements

Apr 6, 2020

v2.7.0T5 Model, BART summarization example and reduced memory, translation pipeline

Mar 30, 2020

v2.6.0BART, organizations, community notebooks, lightning examples, dropping Python 3.5

Mar 24, 2020

v2.5.1Patch v2.5.1: AutoTokenizer slow by default, bug fixes

Feb 24, 2020

v2.5.0Rust Tokenizers, DistilBERT base cased, Model cards

Feb 19, 2020

v2.4.1Patch v2.4.1: FlauBERT for AutoModel and AutoTokenizer

Jan 31, 2020

v2.4.0FlauBERT, MMBT, UmBERTo, Dutch model, improved documentation, training from scratch, clean Python code

Jan 31, 2020

v2.3.0Downstream NLP task API (feature extraction, text classification, NER, QA), Command-Line Interface and Serving – models: T5 – community-added models: Japanese & Finnish BERT, PPLM, XLM-R

Dec 20, 2019

v2.2.2Bug fixes

Dec 20, 2019

v2.2.1Bug fixes related to input shape in TensorFlow and tokenization messages

Dec 3, 2019

v2.2.0ALBERT, CamemBERT, DistilRoberta, GPT-2 XL, and Encoder-Decoder architectures

Nov 26, 2019

v2.1.1CTRL, DistilGPT-2, Pytorch TPU, tokenizer enhancements, guideline requirements

Oct 11, 2019

v2.1.0Superseded by v2.1.1

Oct 11, 2019

v2.0.0v2.0.0 - TF 2.0/PyTorch interoperability, improved tokenizers, improved torchscript support

Sep 26, 2019

huggingface/transformersv1.x

3 releases

TL;DR

The transformers library now supports DistilBERT, a faster and lighter version of BERT, alongside new checkpoints for GPT-2 Large and XLM, significantly expanding model options for various natural language processing (NLP) tasks.

Breaking

A new dependency, sacremoses (a Moses tokenizer port), is required for XLM support.
XLM tokenization in Thai, Japanese, and Chinese may require additional, optional dependencies (pythainlp, kytea, jieba) which must be installed separately.

New

DistilBERT: A distilled version of BERT offering improved speed and efficiency.
GPT-2 Large: The 774M parameter GPT-2 model is now available.
AutoModels: Generic classes for easier model instantiation using from_pretrained().

Fixes Worth Knowing

Improved multi-GPU training stability.
Corrected saving and reloading of models with pruned heads.
Fixed issues with GPT-2 and RoBERTa tokenizers related to sentence spacing.
Enhanced XLM tokenization for multilingual inputs.
Added shortcuts for accessing special token IDs (e

1.2.0DistilBERT, GPT-2 Large, XLM multilingual models, torch.hub, bug fixes

Sep 4, 2019

1.1.0New model: RoBERTa, tokenizer sequence pair handling for sequence classification models.

Aug 15, 2019

v1.0.0v1.0.0 - Name change, new models (XLNet, XLM), unified API for models and tokenizer, access to models internals, torchscript

Jul 16, 2019

huggingface/transformersv0.x

9 releases

TL;DR

This release updates the transformers library with improved model saving/loading and replaces the old learning rate warmup with more flexible scheduling options.

Breaking

warmup_linear in OpenAIAdam and BertAdam is removed; use the new schedule classes instead (learning rate adjustments).

New

BERT language model fine-tuning scripts are added (scripts for training).
GLUE task support is expanded in run_classifier.py (natural language understanding benchmark).

Fixes Worth Knowing

Tokenizers now support sequences longer than 512 tokens (input length).
GPT-2 loss computation and FP16 training stability are improved (generation quality).
Model serialization is more reliable (saving/loading models).

v0.6.2Better model/tokenizer serialization, relax network connection requirements, new scripts and bug fixes

Apr 25, 2019

v0.6.1v0.6.1 - Small install tweak release

Feb 18, 2019

v0.6.0v0.6.0 - Adding OpenAI small GPT-2 pretrained model

Feb 18, 2019

v0.5.1Bug fix update to load the pretrained `TransfoXLModel` from s3, added fallback for OpenAIGPTTokenizer when SpaCy is not installed

Feb 13, 2019

v0.5.0Adding OpenAI GPT and Transformer-XL pretrained models, python2 support, pre-training script for BERT, SQuAD 2.0 example

Feb 11, 2019

v0.4.04x speed-up using NVIDIA apex, new multi-choice classifier and example for SWAG-like dataset, pytorch v1.0, improved model loading, improved examples...

Dec 14, 2018

v0.3.0Added two pre-trained models and one new fine-tuning class

Nov 30, 2018

v0.2.0Small improvements and a few bug fixes.

Nov 26, 2018

v0.1.2First release

Nov 17, 2018