|Back — CWB Homepage||Online CQP Demos|
Online CQP Demos
The EUROPARL parallel corpus contains proceedings of the European Parliament from the years 1996–2003. This Web interface gives access to EUROPARL version 3, which is distributed as part of the OPUS collection of freely available parallel corpora. The Web interface covers six languages (English, German, French, Spanish, Italian and Dutch), with close to 40 million words in each language and full pairwise alignments. The texts have been POS-tagged and lemmatised with the IMS TreeTagger. Some meta-information on dates and speakers is also included. The EUROPARL corpus was originally compiled and sentence-aligned by Philipp Koehn; an improved and extended version (running up to 2009) can be downloaded from the Europarl corpus homepage.