Khalifé, Michelle (2004) Examining orthogonal concepts-based micro-classifiers and their correlations with noun-phrase coreference chains. Masters thesis, Concordia University.
- Accepted Version
The classification of a text into a category that corresponds with its content is an important task in Natural Language Processing. As we want to support shallow syntactic techniques, it must be possible to establish this category without a semantic text analysis. We therefore propose a statistical approach featuring a set of categories that cover orthogonal concepts: micro-classifiers. They allow for a robust multi-dimensional and multi-label text classification, which reveals to be beneficial in the context of automatic document summarization. The performance of these micro-classifiers is evaluated for four different cut-off thresholds using measures of precision and recall. Furthermore, we examine noun-phrase coreference chains within documents and attempt to find correlations with single and multi-label categorization. The presence of patterns could suggest better ways to enhance automatic document summarization.
|Divisions:||Concordia University > Faculty of Engineering and Computer Science > Computer Science and Software Engineering|
|Item Type:||Thesis (Masters)|
|Pagination:||xii, 108 leaves : ill. ; 29 cm.|
|Degree Name:||M. Comp. Sc.|
|Program:||Computer Science and Software Engineering|
|Thesis Supervisor(s):||Bergler, Sabine|
|Deposited By:||Concordia University Libraries|
|Deposited On:||18 Aug 2011 18:25|
|Last Modified:||18 Aug 2011 19:30|
Repository Staff Only: item control page