--- license: apache-2.0 language: - uk metrics: - f1 - precision - recall base_model: - 51la5/roberta-large-NER pipeline_tag: token-classification library_name: spacy model-index: - name: roberta-large-ner-uk results: - task: name: NER type: token-classification metrics: - name: NER Precision type: precision value: 0.9468 - name: NER Recall type: recall value: 0.9416 - name: NER F1 type: f1 value: 0.9442 tags: - ner - uk datasets: - lang-uk/UberText-NER-Silver --- # roberta-large-ner-uk A transformer-based NER model for Ukrainian, trained on a combination of human-annotated data (NER-UK 2.0) and high-quality silver-standard annotations (UberText-NER-Silver). Based on `roberta-large-NER`, this model achieves state-of-the-art performance on a wide range of named entities in Ukrainian. ## Model Details - **Model type:** Transformer-based encoder (spaCy pipeline) - **Language (NLP):** Ukrainian - **License:** Apache 2.0 - **Finetuned from model:** `51la5/roberta-large-NER` - **Entity Types (13):** `PERS`, `ORG`, `LOC`, `DATE`, `TIME`, `JOB`, `MON`, `PCT`, `PERIOD`, `DOC`, `QUANT`, `ART`, `MISC` ## Usage ```python import spacy nlp = spacy.load("roberta-large-ner-uk") doc = nlp("Президент України Володимир Зеленський виступив у Брюсселі.") print([(ent.text, ent.label_) for ent in doc.ents]) ``` ## Authors [Vladyslav Radchenko](https://huggingface.co/pofce), [Nazarii Drushchak](https://huggingface.co/ndrushchak)