عنوان

Annotation of conceptual co-reference and text mining the Qur'an

پدید آورنده

Muhammad, Abdul Baquee

موضوع

رده

کتابخانه

مرکز و کتابخانه مطالعات اسلامی به زبان‌های اروپایی

محل استقرار

استان: قم ـ شهر: قم

تماس با کتابخانه : 32910706-025

شماره کتابشناسی ملی

شماره

TLets577366

عنوان و نام پديدآور

عنوان اصلي

Annotation of conceptual co-reference and text mining the Qur'an

نام عام مواد

[Thesis]

نام نخستين پديدآور

Muhammad, Abdul Baquee

نام ساير پديدآوران

Atwell, E.

وضعیت نشر و پخش و غیره

نام ناشر، پخش کننده و غيره

University of Leeds

تاریخ نشرو بخش و غیره

2012

یادداشتهای مربوط به پایان نامه ها

جزئيات پايان نامه و نوع درجه آن

Thesis (Ph.D.)

امتياز متن

2012

یادداشتهای مربوط به خلاصه یا چکیده

متن يادداشت

This research contributes to the area of corpus annotation and text mining by developing novel domain specific language resources. Most practical text mining applications restrict their domain. This research restricts the domain to the Qur'anic Text. In this thesis, a number of pre-processing steps were undertaken and annotation information were added to the Qur'an. The raw Arabic Qur'an was pre-processed into morphological units using the Qur'anic Arabic Corpus (QAC). Qur'anic terms were indexed and converted into a vector space model using techniques in Information Retrieval (IR). In parallel, nearly 24,000 Qur'anic personal pronouns were annotated with information on their referents. These referents are consolidated and organized into a total of over 1,000 ontological concepts. Moreover, a dataset of nearly 8,000 pairs of related Qur'anic verses are compiled from books of scholarly commentary on the Qur'an. This vector space model, the pronoun tagging, the verse relatedness dataset, and the part-of-speech tags available in QAC all together served for a number of Qur'anic text mining applications which were rendered online for public use. Among these applications: lemma concordance, collocation, POS search of the Qur'an, verse similarity measures, concept clouds of a given verse, pronominal anaphora and Qur'anic chapter similarity. Furthermore, machine learning experiments were conducted on automatic detection of verse similarity/relatedness as well as categorization of Qur'anic chapters based on their chronology of revelation. Domain specific linguistic features were investigated to induct learning algorithms. Results show that deep linguistic and world knowledge is needed to reach the human upper bound in certain computational tasks such as detecting text relatedness, question answering and textual entailment. However, many useful queries can be addressed using text mining techniques and layers of annotations made available through this research. The works presented here can be extended to include other similar texts like Hadith (i.e., saying of Prophet Muhammad), or other scriptures like the Gospels.

نام شخص به منزله سر شناسه - (مسئولیت معنوی درجه اول )

مستند نام اشخاص تاييد نشده

Muhammad, Abdul Baquee

نام شخص - ( مسئولیت معنوی درجه دوم )

مستند نام اشخاص تاييد نشده

Atwell, E.

شناسه افزوده (تنالگان)

مستند نام تنالگان تاييد نشده

University of Leeds

دسترسی و محل الکترونیکی

نام الکترونيکي

وضعیت انتشار

فرمت انتشار

اطلاعات رکورد کتابشناسی

نوع ماده

[Thesis]

کد کاربرگه

276903

اطلاعات دسترسی رکورد

سطح دسترسي

تكميل شده

عنوان Annotation of conceptual co-reference and text mining the Qur'an

پدید آورنده Muhammad, Abdul Baquee