Please see https://ivdnt.org (many pages are also available in English) for information about all our various projects. This page focuses on the APIs (application programming interfaces).
Open source projects
Actively developed projects
- BlackLab, a corpus search engine that supports advanced token-based querying as well as some (dependency) relations querying and parallel corpora.
- corpus-frontend, a frontend for BlackLab that allows users to search and browse corpora.
Older projects
- OpenConvert, a text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA).
- MBMP-morphological parser, a memory-based morphological parser for Python
- COBALT, a corpus annotation tool
- AttestationTool, a multi-purpose GUI used in the production of computational lexica and gold standard data for NE tagging
Publicly available APIs
We can a number of APIs that can be publicly accessed.
Please note:
- our data is copyrighted. Small scale personal use is fine, but for larger scale and/or commercial use, you need a license. (contact us)
- please don’t overload the servers with requests
- we cannot provide official support
- we cannot guarantee availability now or in the future
Corpora
Corpora are large text collections. More information about our corpora can be found on our main website.
Our corpora can be accessed through the BlackLab API, which is documented here.
Here are the API endpoints:
- Brieven als Buit, a corpus with Dutch letters sent by sailors from the second half of the 17th to the early 19th centuries. (plus supplement)
- Corpus Hedendaags Nederlands (login required), a large corpus with contemporary Dutch texts.
- OpenSonar/CGN (login required), a reference corpus of contemporary written Dutch (OpenSonar) and spoken Dutch (CGN).
- Corpus Gysseling, corpus of 13th century Middle Dutch documents.
- Couranten, 17th century Dutch newspapers available in Delpher.
- Corpus Juridisch Nederlands, a collection of law texts from the period 1814-1989.
- Gekaapte brieven (hijacked letters), a collection of around 6000 letters and other documents that were passed to Dutch ships as mail in the 17th century.
Dictionaries
We have a number of contemporary and historical dictionaries available through APIs.