QTLP English Corpus for the Medical Domain


The data set contains documents that were acquired from the web; were automatically detected to be in the English language; were automatically classified as relevant to the "MEDICAL" (MED) domain; The documents have been classified (based on specific patterns which were detected in the URL or the title of the documents) into one of the genre categories: "Reference", "News/Journalism", "Discussion", "Commercial" and "Other".

If you want your webpage/website to be removed from these corpora, please contact us.

You don’t have the permission to edit this resource.