User Tools

Site Tools


wiki:ponyland:corpora

Corpora

Ponyland houses various corpora useful for language and speech research, among which CGN, SoNaR and BNC. They are all available at /vol/bigdata/corpora. This the full list, automatically updated daily:

search?q=dynamic%3Aponyland%3Acorpora&btnI=lucky

We also have a folder /vol/bigdata/datasets for smaller, more specific, personally collected 'corpora'.

wiki/ponyland/corpora.txt · Last modified: 2019/04/25 15:23 (external edit)