Shifta Ansari

Date of Award


Degree Type


Degree Name

Master of Science


Computer Science


Dr. R. E. Mercer


Biomedical researchers show their experimental findings in a tabular form, which is one of the most important information sources in a scientific paper. The aim of this thesis is to develop and implement an intelligent tool that extracts and interprets the information from phenotype-genotype tables found in scholarly biomedical papers. In order to extract the information effectively, a table-based ontology to describe the relations among fields in these tables was developed; The ontology is domain-based. This thesis concentrates on the biomedical papers specially selected by the keywords mutation, gene, phenotype, genotype, disease, and syndrome. The tool is composed of an automated system to extract information from tables and to store the information in the ontology. The tool is verified by populating'the ontology.with some table information taken from the set of tables from which the ontology is designed. The tool is also verified by populating the ontology with a new set of tables to see what problems-still exist for continuation of this work.



To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.