QTLP English CC Corpus for the Automotive Domain

QTLP_AUTOMOTIVE_EN_CC

The data set contains documents that were acquired from the web; were automatically detected to be in the English language; were automatically classified as relevant to the "Automotive" (AUTO) domain; and were automatically detected to be available under a Creative Commons license. The documents have been classified (based on specific patterns which were detected in the URL or the title of the documents) into one of the genre categories: "Reference", "News/Journalism", "Discussion", "Commercial" and "Other".

NOTE
If you want your webpage/website to be removed from these corpora, please contact us.

You don’t have the permission to edit this resource.Sorry, the size of data you want to process exceeds the limit set for simple registered users. Please contact us at qt21-helpdesk@ilsp.gr for more information on how to acquire the necessary rights.