| Home | Course | Web Tools | Corpora|
| Indices | Software | Conferences | Standards |

Corpus resources:
Corpora and electronic text databases

This page contains links to lists of available corpora and descriptions of individual corpus projects. Because of the nature of WWW, there is considertable overlap between some of the lists. Some of the corpora linked to here are freely available, others only for a fee.

NOTE This page is not actively maintained. For more up-to-date information, you might try the ACL wiki page on resources by language.

Jump to:

Lists of corpora

Pages for specific corpora, by language:

Multilingual

Modern English

Earlier English

Basque

Catalan

Czech

French

Galician

German

Italian

Hebrew

Japanese

Norwegian

Portuguese

Slovene

Russian

Serbo-Croatian

Spanish

Turkish

Other Databases

-----
Emily M. Bender (bender at csli dot stanford dot edu)
Last modified: June 17, 2004