#2302025-06-19
SAM-3 Lite-Text lands in Transformers: 88% smaller text encoder, same segmentation quality
Hugging Face Transformers now supports SAM-3 Lite-Text — a distilled MobileCLIP student that replaces SAM-3's heavy CLIP ViT-L/14 text encoder, cutting parameters from 353.72M to 42.54M while keeping vision-language segmentation quality intact.