The latest Sentence Transformers update adds native support for multimodal embedding and reranker models. This integration allows developers to map images and text into a shared vector space more efficiently. It streamlines the pipeline for retrieval-augmented generation. Practitioners can now deploy complex cross-modal search systems using a unified, standardized library.