Universiteit van Amsterdam

Events

Institute for Logic, Language and Computation

Please note that this newsitem has been archived, and may contain outdated information or links.

19 December 2001, Computing with LLI Seminar, Yoad Winter and Khalil Sima'an

Speaker: Yoad Winter (Technion) and Khalil Sima'an
(Amsterdam and Tilburg)
Title: Building a Corpus of Modern Hebrew Text
Date: Wednesday 19 December 2001
Time: 13:00
Location: Room B2.35, Gebouw B, Nieuwe Achtergracht 166, Amsterdam

ABSTRACT: We describe the process of building the first tree-bank for Modern Hebrew texts. A major concern in this process is the need for reducing the cost of manual annotation by the use of automatic means. To this end, the joint utility of an automatic morphological analyzer, a probabilistic parser and a small manually annotated tree-bank was explored. An initial tree-bank that consists of 500 annotated sentences from a daily newspaper is described. The annotation scheme that underlies the tree-bank analyses integrates morphology and syntax. An existing morphological analyzer and a language-independent probabilistic parser were applied to this tree-bank. Based on the results of some experiments with these tools, a semi-automatic procedure for future enlargement of the tree-bank is outlined.

For abstracts and more information, see http://lit.science.uva.nl/News/seminar01-2.html#December19

Please note that this newsitem has been archived, and may contain outdated information or links.