all search terms
2024 年 12 月 16 日
OpenNER 10 Standardized OpenAccess Named Entity Recognition Datasets in 50 Languages
title: OpenNER 10 Standardized OpenAccess Named Entity Recognition Datasets in 50 Languages
publish date:
2024-12-12
authors:
Chester Palen-Michel et.al.
paper id
2412.09587v1
download
abstracts:
We present OpenNER 1.0, a standardized collection of openly available named entity recognition (NER) datasets. OpenNER contains 34 datasets spanning 51 languages, annotated in varying named entity ontologies. We correct annotation format issues, standardize the original datasets into a uniform representation, map entity type names to be more consistent across corpora, and provide the collection in a structure that enables research in multilingual and multi-ontology NER. We provide baseline models using three pretrained multilingual language models to compare the performance of recent models and facilitate future research in NER.
QA:
coming soon
编辑整理: wanghaisheng 更新日期:2024 年 12 月 16 日