Skip to content

Parallel Corpus of documents from the Technical Regulations Information System for German-Spanish (v0.2)

Specialized parallel corpus Spanish-German (ES-ES, DE-AT and DE-DE), texts from the European Commission between 1997-2010. The texts are technical regulations in a variety of domains. To be sentence aligned. The current version 0.2 contains the files for 1999-2010 in the domain B00: Construction (205 files; 70,648 sentences; 1,563,000 words). This version also fixed some formatting errors detected in version 0.1 of the corpus. Versions 0.1 and 0.2 are kept as individual records because they are (currently) intended to be downloaded individually. Neither version are currently available for download, as the licensing terms and the distributor is still under negotiation. The current contact person is therefore the corpus developer, Carla Parra.

Specialized parallel corpus Spanish-German (ES-ES, DE-AT and DE-DE), texts from the European Commission between 1997-2010. The texts are technical regulations in a variety of domains. To be sentence aligned. The current version 0.2 contains the files for 1999-2010 in the domain B00: Construction (205 files; 70,648 sentences; 1,563,000 words). This version also fixed some formatting errors detected in version 0.1 of the corpus. Versions 0.1 and 0.2 are kept as individual records because they are (currently) intended to be downloaded individually. Neither version are currently available for download, as the licensing terms and the distributor is still under negotiation. The current contact person is therefore the corpus developer, Carla Parra.

Extended metadata

Download resources

Download metadata