OpenOffice3 subcorpus DE-EN (ΤΜΧ)

A collection of documents from http://www.openoffice.org/.

8 languages, 28 bitexts
total number of files: 18,120
total number of tokens: 3.56M
total number of sentence fragments: 0.62M

Please cite the following article if you use any part of the corpus in your own work:
Jörg Tiedemann, 2009, News from OPUS - A Collection of Multilingual Parallel Corpora with Tools and Interfaces. In N. Nicolov and K. Bontcheva and G. Angelova and R. Mitkov (eds.) Recent Advances in Natural Language Processing (vol V), pages 237-248, John Benjamins, Amsterdam/Philadelphia

ATTENTION
Please check the public documentation licence at http://www.openoffice.org/licenses/pdl.pdf

You don’t have the permission to edit this resource.Sorry, the size of data you want to process exceeds the limit set for simple registered users. Please contact us at qt21-helpdesk@ilsp.gr for more information on how to acquire the necessary rights.