Spanish Treebank (UAM)
The project started in December 1997, and by September 1999 the corpus consists of 1,500 syntactically annotated sentences extracted from newspapers (El País Digital and Compra Maestra).
In this period we have developed the annotation guidelines and tools for annotating and debugging. In the current new phase , we continue the manual annotation with the help of more human annotators and improved tools. The goal for this phase is to get 5,000 annotated sentences. We have also started some experiments on the corpus. The future work is oriented to the semi-automatic corpus construction, based on a grammar infered from the treebank.