Datasets used to train SmolDocling
HuggingFaceM4
Team
company
AI & ML interests
None defined yet.
Recent Activity
View all activity
WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot.
-
HuggingFaceM4/WebSight
Viewer β’ Updated β’ 2.75M β’ 6.18k β’ 387 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation β’ 8B β’ Updated β’ 164 β’ 192 -
Screenshot to HTML
β‘921Convert screenshot to HTML code and preview
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper β’ 2403.09029 β’ Published β’ 57
Collection gathering artifacts related to OBELICS
-
OBELICS Interactive Map
π16Explore the OBELICS dataset with an interactive map
-
OBELICS Web Document Visualization
π10 -
HuggingFaceM4/OBELICS
Viewer β’ Updated β’ 276M β’ 22.7k β’ 165 -
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
Paper β’ 2306.16527 β’ Published β’ 47
Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation.
-
IDEFICS2 Playground
π¨169Chat with a visual AI assistant using text and images
-
HuggingFaceM4/idefics2-8b
Image-Text-to-Text β’ 8B β’ Updated β’ 139k β’ 620 -
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text β’ 8B β’ Updated β’ 163 β’ 95 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text β’ 8B β’ Updated β’ 1.62k β’ 28
Collection assembling all the models and spaces related to IDEFICS
Datasets used to train SmolDocling
Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation.
-
IDEFICS2 Playground
π¨169Chat with a visual AI assistant using text and images
-
HuggingFaceM4/idefics2-8b
Image-Text-to-Text β’ 8B β’ Updated β’ 139k β’ 620 -
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text β’ 8B β’ Updated β’ 163 β’ 95 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text β’ 8B β’ Updated β’ 1.62k β’ 28
WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot.
-
HuggingFaceM4/WebSight
Viewer β’ Updated β’ 2.75M β’ 6.18k β’ 387 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation β’ 8B β’ Updated β’ 164 β’ 192 -
Screenshot to HTML
β‘921Convert screenshot to HTML code and preview
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper β’ 2403.09029 β’ Published β’ 57
Collection assembling all the models and spaces related to IDEFICS
Collection gathering artifacts related to OBELICS
-
OBELICS Interactive Map
π16Explore the OBELICS dataset with an interactive map
-
OBELICS Web Document Visualization
π10 -
HuggingFaceM4/OBELICS
Viewer β’ Updated β’ 276M β’ 22.7k β’ 165 -
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
Paper β’ 2306.16527 β’ Published β’ 47