عنوان

A rules based system for named entity recognition in modern standard Arabic

پدید آورنده

Elsebai, A.

موضوع

Media, Digital Technology and the Creative Economy

رده

کتابخانه

Center and Library of Islamic Studies in European Languages

محل استقرار

استان: Qom ـ شهر: Qom

تماس با کتابخانه : 32910706-025

NATIONAL BIBLIOGRAPHY NUMBER

Number

TLets521502

TITLE AND STATEMENT OF RESPONSIBILITY

Title Proper

A rules based system for named entity recognition in modern standard Arabic

General Material Designation

[Thesis]

First Statement of Responsibility

Elsebai, A.

.PUBLICATION, DISTRIBUTION, ETC

Name of Publisher, Distributor, etc.

University of Salford

Date of Publication, Distribution, etc.

2009

DISSERTATION (THESIS) NOTE

Dissertation or thesis details and type of degree

Ph.D.

Body granting the degree

University of Salford

Text preceding or following the note

2009

SUMMARY OR ABSTRACT

Text of Note

The amount of textual information available electronically has made it difficult for many users to find and access the right information within acceptable time. Research communities in the natural language processing (NLP) field are developing tools and techniques to alleviate these problems and help users in exploiting these vast resources. These techniques include Information Retrieval (IR) and Information Extraction (IE). The work described in this thesis concerns IE and more specifically, named entity extraction in Arabic. The Arabic language is of significant interest to the NLP community mainly due to its political and economic significance, but also due to its interesting characteristics. Text usually contains all kinds of names such as person names, company names, city and country names, sports teams, chemicals and lots of other names from specific domains. These names are called Named Entities (NE) and Named Entity Recognition (NER), one of the main tasks of IE systems, seeks to locate and classify automatically these names into predefined categories. NER systems are developed for different applications and can be beneficial to other information management technologies as it can be built over an IR system or can be used as the base module of a Data Mining application. In this thesis we propose an efficient and effective framework for extracting Arabic NEs from text using a rule based approach. Our approach makes use of Arabic contextual and morphological information to extract named entities. The context is represented by means of words that are used as clues for each named entity type. Morphological information is used to detect the part of speech of each word given to the morphological analyzer. Subsequently we developed and implemented our rules in order to recognise each position of the named entity. Finally, our system implementation, evaluation metrics and experimental results are presented.

TOPICAL NAME USED AS SUBJECT

Media, Digital Technology and the Creative Economy

PERSONAL NAME - PRIMARY RESPONSIBILITY

Elsebai, A.

CORPORATE BODY NAME - SECONDARY RESPONSIBILITY

University of Salford

ELECTRONIC LOCATION AND ACCESS

Electronic name

[Thesis]

276903

عنوان A rules based system for named entity recognition in modern standard Arabic

پدید آورنده Elsebai, A.

موضوع Media, Digital Technology and the Creative Economy

رده

کتابخانه Center and Library of Islamic Studies in European Languages

محل استقرار استان: Qom ـ شهر: Qom

NATIONAL BIBLIOGRAPHY NUMBER

TITLE AND STATEMENT OF RESPONSIBILITY

.PUBLICATION, DISTRIBUTION, ETC

DISSERTATION (THESIS) NOTE

SUMMARY OR ABSTRACT

TOPICAL NAME USED AS SUBJECT

PERSONAL NAME - PRIMARY RESPONSIBILITY

CORPORATE BODY NAME - SECONDARY RESPONSIBILITY

ELECTRONIC LOCATION AND ACCESS

عنوان

A rules based system for named entity recognition in modern standard Arabic

پدید آورنده

Elsebai, A.

موضوع

Media, Digital Technology and the Creative Economy

کتابخانه

Center and Library of Islamic Studies in European Languages

محل استقرار

استان: Qom ـ شهر: Qom