The latest Sentence Transformers update introduces native support for multimodal embedding and reranker models. This allows developers to map images and text into a shared vector space using a single library. It simplifies the pipeline for visual search and retrieval-augmented generation. Practitioners can now deploy multimodal RAG systems with significantly less boilerplate code.