Extraction of semantic header from RTF documents

Title:

Extraction of semantic header from RTF documents

Ali, Abdelbaset (1999) Extraction of semantic header from RTF documents. [Graduate Projects (Non-thesis)] (Unpublished)

Preview

Text (application/pdf)
MQ43664.pdf

1MB

Abstract

The problem of indexing and retrieval of electronic information resources becomes more critical as the amount of information and the number of Internet users continues to grow. The Semantic Header, proposed by Desai [3], is a portion of each document that contains the meta-information for each publicly accessible resource on the Internet. The Semantic Header for document-like Internet resources is a powerful means of helping users locate documents and other types of data among large repositories. In environments that contain many different types of data, content indexing requires type-specific processing to extract information effectively. In this project which is a part of the ASHG system (Automatic Semantic Header Generator), we present a model for type-specific, information extraction that automatically extracts the meta-information from RTF (Rich Text Format) documents, and stores it in a Semantic Header which will be used as an index for the document. This shall provide a useful tool in searching for a document based on a number of commonly used criteria. The information from the Semantic Header could be used by the search system to help locate appropriate documents with minimum effort.

Divisions:	Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering
Item Type:	Graduate Projects (Non-thesis)
Authors:	Ali, Abdelbaset
Pagination:	ix, 43 leaves : ill. ; 29 cm.
Institution:	Concordia University
Degree Name:	M. Comp. Sc.
Program:	Computer Science
Department (as was):	Department of Computer Science
Date:	1999
Thesis Supervisor(s):	Desai, Bipin C.
Identification Number:	QA 76 M26+ 1999 no.7
ID Code:	873
Deposited By:	lib-batchimporter
Deposited On:	27 Aug 2009 17:14
Last Modified:	20 Oct 2022 20:44
Related URLs:	https://concordiauniversity.on.worldcat....

Repository Staff Only: item control page

Download Statistics

Downloads per month over past year

Research related to the current document (at the CORE website)

Spectrum Research Repository

Extraction of semantic header from RTF documents

Extraction of semantic header from RTF documents

Abstract