Computer Science – Computation and Language
Scientific paper
2009-07-04
ACL 2002
Computer Science
Computation and Language
Scientific paper
We present a document compression system that uses a hierarchical noisy-channel model of text production. Our compression system first automatically derives the syntactic structure of each sentence and the overall discourse structure of the text given as input. The system then uses a statistical hierarchical model of text production in order to drop non-important syntactic and discourse constituents so as to generate coherent, grammatical document compressions of arbitrary length. The system outperforms both a baseline and a sentence-based compression system that operates by simplifying sequentially all sentences in a text. Our results support the claim that discourse knowledge plays an important role in document summarization.
III Hal Daume
Marcu Daniel
No associations
LandOfFree
A Noisy-Channel Model for Document Compression does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with A Noisy-Channel Model for Document Compression, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and A Noisy-Channel Model for Document Compression will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-187249