Small multilingual LLMs for annotating and curating LLM training data.
AI & ML interests
Open, Multilingual, European, Generative, Foundational LLM
Recent Activity
Organization Card
Europe's leading AI companies and research institutions combine their forces and expertise to develop next-generation open-source language models in an unprecedented collaboration to advance European AI capabilities, the OpenEuroLLM project
models
9
openeurollm/datamix-2b-en-80pct-DPO-HelpSteer3-16k
Text Generation
•
2B
•
Updated
•
1
openeurollm/datamix-2b-70-30
Updated
•
52
openeurollm/datamix-2b-60-40
Updated
•
62
openeurollm/datamix-2b-50-50
Updated
•
60
openeurollm/datamix-9b-60-40
Updated
•
1.47k
openeurollm/datamix-2b-en
Updated
•
2
openeurollm/datamix-2b-90-10
Updated
•
65
openeurollm/datamix-2b-80-20
Updated
•
56
openeurollm/hplt2c-nemotron-cc_eurollm_500BT
Updated
datasets
8
openeurollm/propella-annotations
Viewer
•
Updated
•
3.87B
•
124
•
3
openeurollm/battle-annotations
Viewer
•
Updated
•
165
•
13
openeurollm/contaminated-documents
Viewer
•
Updated
•
40.3k
•
2
openeurollm/evaluation_singularity_images
Updated
•
10
openeurollm/ArenaHard-EU-v0-bis
Viewer
•
Updated
•
30
•
1
openeurollm/ArenaHard-EU-v0
Viewer
•
Updated
•
360
•
23
openeurollm/nemotron-cc-10K-sample-translated-judged
Viewer
•
Updated
•
9.07M
•
11
openeurollm/nemotron-cc-10K-sample-translated
Viewer
•
Updated
•
450k
•
334