The theory and practice of discourse parsing and summarization

Until now, most discourse researchers have assumed that full semantic understanding is necessary to derive the discourse structure of texts. This book documents the first serious attempt to construct automatically and use nonsemantic computational structures for text summarization. Daniel Marcu deve...

Mô tả đầy đủ

Đã lưu trong:
Chi tiết về thư mục
Tác giả chính: Marcu, Daniel
Định dạng: Sách
Ngôn ngữ:Undetermined
Được phát hành: Cambridge, Mass. MIT Press c2000
Những chủ đề:
Các nhãn: Thêm thẻ
Không có thẻ, Là người đầu tiên thẻ bản ghi này!
Thư viện lưu trữ: Trung tâm Học liệu Trường Đại học Cần Thơ
LEADER 02060nam a2200217Ia 4500
001 CTU_153231
008 210402s9999 xx 000 0 und d
020 |c 45.24 
082 |a 401.410285 
082 |b M322 
100 |a Marcu, Daniel 
245 4 |a The theory and practice of discourse parsing and summarization 
245 0 |c Daniel Marcu 
260 |a Cambridge, Mass. 
260 |b MIT Press 
260 |c c2000 
520 |a Until now, most discourse researchers have assumed that full semantic understanding is necessary to derive the discourse structure of texts. This book documents the first serious attempt to construct automatically and use nonsemantic computational structures for text summarization. Daniel Marcu develops a semantics-free theoretical framework that is both general enough to be applicable to naturally occurring texts and concise enough to facilitate an algorithmic approach to discourse analysis. He presents and evaluates two discourse parsing methods: one uses manually written rules that reflect common patterns of usage of cue phrases such as "however" and "in addition to"; the other uses rules that are learned automatically from a corpus of discourse structures. By means of a psycholinguistic experiment, Marcu demonstrates how a discourse-based summarizer identifies the most important parts of texts at levels of performance that are close to those of humans. Marcu also discusses how the automatic derivation of discourse structures may be used to improve the performance of current natural language generation, machine translation, summarization, question answering, and information retrieval systems. 
650 |a Discourse analysis,Parsing (Computer grammar),Abstracts,Automatic abstracting,Phân tích ngôn ngữ,Phân tích cú pháp ( ngữ pháp máy tính),Lý thuyết,Lý thuyết tự động 
650 |x Data processing,Data processing,Xử lý dữ kiện,Xử lý dữ liệu 
904 |i QHieu 
980 |a Trung tâm Học liệu Trường Đại học Cần Thơ