JRC-Names-a multilingual named entity resource


JRC-Names is a highly multilingual named entity resource for person and organisation names (called 'entities'). It consists of large lists of names and their many spelling variants (up to hundreds for a single person), including across scripts (Latin, Greek, Arabic, Cyrillic, Japanese, Chinese, etc.). The named entity resource file with the list of spelling variants is accompanied by Java-implemented demonstrator software that (a) allows to produce - for any input name - a list of known spelling variants, and that (b) analyses UTF8-encoded text files to find known entity mentions, returning the name variant found, the preferred display name for that entity, the unique name identifier for that name, the position of the entity name in the text, and its length in characters.

Usage conditions

By downloading and/or using JRC-Names, you agree to the usage conditions formulated in the licence, which is available at http://optima.jrc.it/Resources/LICENCE-EULA_JRC-Names_2011.pdf.
Privacy statement

JRC-Names is subject to a privacy statement.

You don’t have the permission to edit this resource.