Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
veryhungryhippo 's Collections
LLM Benchmarks
Multimodal-LLM
Diffusion Inpainting
Diffusion Models Fundamental Papers (Read First)

Multimodal-LLM

updated Jul 14
Upvote
-

  • VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents

    Paper • 2507.04590 • Published Jul 7 • 16

  • Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning

    Paper • 2507.05255 • Published Jul 7 • 74
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs