Siglip Vision Encoder

Text and Vision embeddings in a shared hyperspace