LT World

Sections
Personal tools
Log in

Skip to content. | Skip to navigation

Supporters

provided by

dfki logo

with support by

eu star logofp7 logo

through

meta logo
clarin logo

as well as by

bmbf logo

through

take logo

You are here: Home kb Resources & Tools Language Data Lancaster Parsed Corpus (ICAME)

Lancaster Parsed Corpus (ICAME)


  • English

  • Monolingual

morphosyntactically

syntactic trees, syntactic dependencies, POS

  • International Computer Archive of Modern and Medieval English (ICAME)

  • Treebank
  • POS-tagged Text Corpus

ICAME is an international organization of linguists and information scientists working with English machine-readable texts. The aim of the organization is to collect and distribute information on English language material available for computer processing and on linguistic research completed or in progress on the material, to compile an archive of English text corpora in machine-readable form, and to make material available to research institutions.

 

This is a parsed subcorpus of the Lancaster-Oslo/Bergen (LOB) Corpus, compiled by Roger Garside, Geoffrey Leech and Tamas Varadi. It can now be obtained (under conditiond similar to those applying to other corpus holdings) through ICAME.

 

The Lancaster Parsed Corpus is a treebank consisting of sentences of the LOB Corpus, amounting altogether to over 133,000 words. Each sentence in the Parsed Corpus is annotated with a phrase-structure parse, represented in the form of labelled bracketing, marking the boundaries of sentence, clause, phrase, and coordinated word constiuents. The labels correspond to well-known `consensual` constituents such as noun phrases, relative clauses, infinitive clauses, etc. The annotations also include the word tags used for the Tagged LOB Corpus. See ICAME Journal 16, p. 124

 

Example:
A07 418
[S[Na I_PP1A Na] [V can_MD n't_XNOT make_VB V][N a_AT club_MM N][Tb[V
pay_VB V] [N a_AT player_NN N][N[D so_QL much_AP D][N a_AT week_NN N]N]
Tb]._. S]


B04 248
[S[N \OMr_NPT Henry_NP Newton_NP [Po of_INO [N Acton_NP N] Po]N][V does_DOZ
not_XNOT want_VB V][N his_PP$ daughter_NN N] [Ti [Vi to_TO marry_VB Vi][N
a_AT Scotsman_NNP N)Ti] ._. S]


http://icame.uib.no/lanpeks.html

  • Creative Commens

  • International Computer Archive of Modern and Medieval English (ICAME)