MultiUN: Multilingual UN Parallel Text 2000—2009-AR_ZH

MultiUN subcorpus AR-ZH

This is a collection of translated documents from the United Nations originally compiled by Andreas Eisele and Yu Chen (see http://www.euromatrixplus.net/multi-un/). Please cite MultiUN: A Multilingual corpus from United Nation Documents, Andreas Eisele and Yu Chen, LREC 2010

7 languages, 21 bitexts
total number of files: 420,404
total number of tokens: 1.64G
total number of sentence fragments: 69.30M

Please cite the following article if you use any part of the corpus in your own work:
Jörg Tiedemann, 2012, Parallel Data, Tools and Interfaces in OPUS. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012)

You don’t have the permission to edit this resource.
  • United Nations translated documents