Computer Science – Computation and Language
Scientific paper
2000-09-08
Proceedings of the 18th International Conference on Computational Linguistics (Coling 2000), Universit
Computer Science
Computation and Language
7 pages. Another version under the name "Learning Verb Subcategorization from Corpora: Counting Frame Subsets", authors: Zeman
Scientific paper
We present some novel machine learning techniques for the identification of subcategorization information for verbs in Czech. We compare three different statistical techniques applied to this problem. We show how the learning algorithm can be used to discover previously unknown subcategorization frames from the Czech Prague Dependency Treebank. The algorithm can then be used to label dependents of a verb in the Czech treebank as either arguments or adjuncts. Using our techniques, we ar able to achieve 88% precision on unseen parsed text.
Sarkar Anoop
Zeman Daniel
No associations
LandOfFree
Automatic Extraction of Subcategorization Frames for Czech does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with Automatic Extraction of Subcategorization Frames for Czech, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Automatic Extraction of Subcategorization Frames for Czech will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-203701