• user: anonymous

Open corpora

Corpus name Language Tokens Words
ACL Anthology Reference Corpus (ARC) English 49,348,397 38,792,655 info open
British Academic Spoken English Corpus (BASE) English 1,252,256 1,186,290 info open
British Academic Written English Corpus (BAWE) English 8,336,262 6,964,411 info open
Brown English 1,175,675 1,007,299 info open