LT World

Sections
Personal tools
Log in

Skip to content. | Skip to navigation

Supporters

provided by

dfki logo

with support by

eu star logofp7 logo

through

meta logo
clarin logo

as well as by

bmbf logo

through

take logo

You are here: Home kb Resources & Tools Language Data The Brown University Standard Corpus of Present-Day American English (Corpus BROWN )

The Brown University Standard Corpus of Present-Day American English (Corpus BROWN )


http://khnt.aksis.uib.no/icame/manuals/brown/

1,014,312 words sampled from 15 text categories

  • English

  • Monolingual

human annotation, native speakers of American English

POS

morphosyntactically

syntactic dependencies, POS

  • Brown University
  • Department of Cognitive & Linguistic Sciences, Brown University

The Corpus consists of 500 samples, distributed across 15 genres in rough proportion to the amount published in 1961 in each of those genres. All works sampled were published in 1961; as far as could be determined they were first published then, and were written by native speakers of American English. Each sample began at a random sentence-boundary in the article or other unit chosen, and continued up to the first sentence boundary after 2,000 words. In a very few cases miscounts led to samples being just under 2,000 words.


http://www.ldc.upenn.edu/cgi-bin/ldc/textcorpus?doc=yes&corpus=BROWN

  • GNU GPL

linguistic analysis