Spanish-German website parallel corpus (Processed)

Open data API in a single place

Provided by Directorate-General for Communications Networks, Content and Technology

Get early access to Spanish-German website parallel corpus (Processed) API!

Let us know and we will figure it out for you.

Dataset information

Catalog

Directorate-General for Communications Networks, Content and Technology

Country of origin

EU institutions

Updated

2019.04.11 00:00

Created

Available languages

English

Keywords

group-resources-for-language-technologies

Datasource

Official portal for European data link

Quality scoring

250

Dataset description

This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 2,840 TUs. Period of crawling : 15/11/2016 - 23/01/2017. A strict validation process was already followed for the source data, which resulted in discarding: - TUs from crawled websites that do not comply to the PSI directive, - TUs with more than 99% of mispelled tokens, - TUs identified during the manual validation process and all the TUs from websites which error rate in the sample extracted for manual validation are strictly above the following thresholds: 50% of TUs with language identification errors, 50% of TUs with alignment errors, 50% of TUs with tokenization errors, 20% of TUs identified as machine translated content, 50% of TUs with translation errors. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) actions SMART 2014/1074 and SMART 2015/1091. For further information on the project: http://lr-coordination.eu.

Build on reliable and scalable technology

FAQ

Frequently Asked Questions

Some basic informations about API Store ®.

Operation and development of APIs are currently fully funded by company Apitalks and its usage is for free.

Yes, you can.

All important information such as time of last update, license and other information are in response of each API call.

In case of major update that would not be compatible with previous version of API, we keep for 30 days both versions so you will have enough time to transfer to new version. We will inform you about the changes in advance by e-mail.

Didn't find the API you need?

Let us know and we will figure it out for you.

API Store ®

API Store provides access to European Open Data via scalable and reliable REST API interface.