Skip to content

Image-text Model

  • Number of models: 42

Instruction Model

Alibaba-NLP/gme-Qwen2-VL-2B-Instruct

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
32.8K 1.5K 2.2B 8.2 GB 2024-12-24 cmn-Hans, eng-Latn

Alibaba-NLP/gme-Qwen2-VL-7B-Instruct

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
32.8K 3.6K 8.3B 30.9 GB 2024-12-24 cmn-Hans, eng-Latn

TIGER-Lab/VLM2Vec-Full

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
131.1K 3.1K 4.2B 7.7 GB 2024-10-08 eng-Latn

TIGER-Lab/VLM2Vec-LoRA

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
131.1K 3.1K not specified not specified 2024-10-08 eng-Latn

ibm-granite/granite-vision-3.3-2b-embedding

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
128.0K 128 3.0B 11.1 GB 2025-06-11 eng-Latn

intfloat/mmE5-mllama-11b-instruct

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
128.0K 4.1K 10.6B 19.8 GB 2025-02-12 eng-Latn
Citation
@article{chen2025mmE5,
  title={mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data},
  author={Chen, Haonan and Wang, Liang and Yang, Nan and Zhu, Yutao and Zhao, Ziliang and Wei, Furu and Dou, Zhicheng},
  journal={arXiv preprint arXiv:2502.08468},
  year={2025}
}

jinaai/jina-clip-v1

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
8.2K 768 223.0M 849.0 MB 2024-05-30 eng-Latn

jinaai/jina-embeddings-v4

License: cc-by-nc-4.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
32.8K 2.0K 3.8B 7.3 GB 2025-06-24 afr-Latn, amh-Latn, ara-Latn, asm-Latn, aze-Latn, ... (99)

microsoft/LLM2CLIP-Openai-B-16

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
not specified 1.3K 361.0M not specified 2024-11-07 eng-Latn

microsoft/LLM2CLIP-Openai-L-14-224

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
not specified 1.3K 578.0M not specified 2024-11-07 eng-Latn

microsoft/LLM2CLIP-Openai-L-14-336

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
not specified 1.3K 579.0M not specified 2024-11-07 eng-Latn

nomic-ai/colnomic-embed-multimodal-3b

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
128.0K 128 3.0B 7.0 GB 2025-03-31 deu-Latn, eng-Latn, fra-Latn, ita-Latn, spa-Latn

nomic-ai/colnomic-embed-multimodal-7b

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
128.0K 128 7.0B 14.1 GB 2025-03-31 deu-Latn, eng-Latn, fra-Latn, ita-Latn, spa-Latn

nomic-ai/nomic-embed-vision-v1.5

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
2.0K 768 92.9M 355.0 MB 2024-06-08 eng-Latn

nvidia/llama-nemoretriever-colembed-1b-v1

License: https://huggingface.co/nvidia/llama-nemoretriever-colembed-1b-v1/blob/main/LICENSE

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
8.2K 2.0K 2.4B 9.0 GB 2025-06-27 eng-Latn

nvidia/llama-nemoretriever-colembed-3b-v1

License: https://huggingface.co/nvidia/llama-nemoretriever-colembed-1b-v1/blob/main/LICENSE

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
8.2K 3.1K 4.4B 16.4 GB 2025-06-27 eng-Latn

royokong/e5-v

License: not specified

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
8.2K 4.1K 8.4B 15.6 GB 2024-07-17 eng-Latn

vidore/colSmol-256M

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
8.2K 128 256.0M 800.0 MB 2025-01-22 eng-Latn

vidore/colSmol-500M

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
8.2K 128 500.0M 1.2 GB 2025-01-22 eng-Latn

vidore/colpali-v1.1

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
16.4K 128 2.9B 4.6 GB 2024-08-21 eng-Latn

vidore/colpali-v1.2

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
16.4K 128 2.9B 4.6 GB 2024-08-26 eng-Latn

vidore/colpali-v1.3

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
16.4K 128 2.9B 4.6 GB 2024-11-01 eng-Latn

vidore/colqwen2-v1.0

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
32.8K 128 2.2B 7.0 GB 2025-11-03 eng-Latn

vidore/colqwen2.5-v0.2

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
128.0K 128 3.0B 7.0 GB 2025-01-31 eng-Latn

Non-instruction Model

BAAI/bge-visualized-base

License: not specified

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
512.0 768 196.0M 1.6 GB 2024-06-06 eng-Latn

BAAI/bge-visualized-m3

License: not specified

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
8.2K 1.0K 872.9M 4.2 GB 2024-06-06 eng-Latn

Cohere/Cohere-embed-v4.0

License: not specified

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
128.0K 1.5K not specified not specified 2024-12-01 afr-Latn, amh-Ethi, ara-Arab, asm-Beng, aze-Latn, ... (109)

QuanSun/EVA02-CLIP-B-16

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 512 149.0M 568.0 MB 2023-04-26 eng-Latn

QuanSun/EVA02-CLIP-L-14

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 768 428.0M 1.6 GB 2023-04-26 eng-Latn

QuanSun/EVA02-CLIP-bigE-14

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 1.0K 4.7B 17.5 GB 2023-04-26 eng-Latn

QuanSun/EVA02-CLIP-bigE-14-plus

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 1.0K 5.0B 18.6 GB 2023-04-26 eng-Latn

Salesforce/blip-image-captioning-base

License: bsd-3-clause

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
512.0 768 247.0M 942.0 MB 2023-08-01 eng-Latn

Salesforce/blip-image-captioning-large

License: bsd-3-clause

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
512.0 768 470.0M 1.8 GB 2023-12-07 eng-Latn

Salesforce/blip-itm-base-coco

License: bsd-3-clause

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
512.0 768 247.0M 942.0 MB 2023-08-01 eng-Latn

Salesforce/blip-itm-base-flickr

License: bsd-3-clause

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
512.0 768 247.0M 942.0 MB 2023-08-01 eng-Latn

Salesforce/blip-itm-large-coco

License: bsd-3-clause

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
512.0 768 470.0M 1.8 GB 2023-08-01 eng-Latn

Salesforce/blip-itm-large-flickr

License: bsd-3-clause

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
512.0 768 470.0M 1.8 GB 2023-08-01 eng-Latn

Salesforce/blip-vqa-base

License: bsd-3-clause

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
512.0 768 247.0M 1.4 GB 2023-12-07 eng-Latn

Salesforce/blip-vqa-capfilt-large

License: bsd-3-clause

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
512.0 768 247.0M 942.0 MB 2023-01-22 eng-Latn

Salesforce/blip2-opt-2.7b

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
not specified 768 3.7B 14.0 GB 2024-03-22 eng-Latn

Salesforce/blip2-opt-6.7b-coco

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
not specified 768 7.8B 28.9 GB 2024-03-31 eng-Latn

cohere/embed-english-v3.0

License: not specified

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
not specified 1.0K not specified not specified 2024-10-24 eng-Latn

cohere/embed-multilingual-v3.0

License: not specified

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
not specified 1.0K not specified not specified 2024-10-24 not specified

google/siglip-base-patch16-224

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
64.0 768 203.0M 775.0 MB 2024-01-08 eng-Latn

google/siglip-base-patch16-256

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
64.0 768 203.0M 775.0 MB 2024-01-08 eng-Latn

google/siglip-base-patch16-256-multilingual

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
64.0 768 371.0M 1.4 GB 2024-01-08 eng-Latn

google/siglip-base-patch16-384

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
64.0 768 203.0M 776.0 MB 2024-01-08 eng-Latn

google/siglip-base-patch16-512

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
64.0 768 204.0M 777.0 MB 2024-01-08 eng-Latn

google/siglip-large-patch16-256

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
64.0 1.0K 652.0M 2.4 GB 2024-01-08 eng-Latn

google/siglip-large-patch16-384

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
64.0 1.0K 652.0M 2.4 GB 2024-01-08 eng-Latn

google/siglip-so400m-patch14-224

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
16.0 1.2K 877.0M 3.3 GB 2024-01-08 eng-Latn

google/siglip-so400m-patch14-384

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
64.0 1.2K 878.0M 3.3 GB 2024-01-08 eng-Latn

google/siglip-so400m-patch16-256-i18n

License: apache-2.0

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
64.0 1.2K 1.1B 4.2 GB 2024-01-08 eng-Latn

kakaobrain/align-base

License: not specified

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
64.0 768 176.0M 671.0 MB 2023-02-24 eng-Latn

laion/CLIP-ViT-B-16-DataComp.XL-s13B-b90K

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 512 150.0M 572.0 MB 2023-04-26 eng-Latn

laion/CLIP-ViT-B-32-DataComp.XL-s13B-b90K

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 512 151.0M 576.0 MB 2023-04-26 eng-Latn

laion/CLIP-ViT-B-32-laion2B-s34B-b79K

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 512 151.0M 577.0 MB 2022-09-15 eng-Latn

laion/CLIP-ViT-H-14-laion2B-s32B-b79K

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 1.0K 986.0M 3.7 GB 2022-09-15 eng-Latn

laion/CLIP-ViT-L-14-DataComp.XL-s13B-b90K

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 768 428.0M 1.6 GB 2023-04-26 eng-Latn

laion/CLIP-ViT-L-14-laion2B-s32B-b82K

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 768 428.0M 1.6 GB 2022-09-15 eng-Latn

laion/CLIP-ViT-bigG-14-laion2B-39B-b160k

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 1.3K 2.5B 9.5 GB 2023-01-23 eng-Latn

laion/CLIP-ViT-g-14-laion2B-s34B-b88K

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 1.0K 1.4B 5.1 GB 2023-03-06 eng-Latn

openai/clip-vit-base-patch16

License: not specified

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 512 151.0M 576.0 MB 2021-02-26 eng-Latn

openai/clip-vit-base-patch32

License: not specified

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 512 151.0M 576.0 MB 2021-02-26 eng-Latn

openai/clip-vit-large-patch14

License: not specified

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
77.0 768 428.0M 1.6 GB 2021-02-26 eng-Latn

voyageai/voyage-multimodal-3

License: mit

Max Tokens Embedding dimension Parameters Required Memory (Mb) Release date Languages
32.8K 1.0K not specified not specified 2024-11-10 not specified