Tag: Mistral
Ministral 3
2026-03-02 distillation Mistral text vision Introduction of the Ministral 3 models from the French commercial vendor Mistral AI. These are language-and-vision models distilled from the Mistral Small 3.1 model to even smaller sizes by a process called Cascade Distillation, which is the main topic of the whitepaper. Access: $ Basic
Embeddings from generative models
2025-08-01 theory applications attention Mistral For text generation you usually want a "decoder" model; for other text tasks you usually want an "encoder." Here we look at modifying a decoder model to change it into an encoder. Access: $ Basic