dc.contributor.authorLim, Lionel Guan Chuan.
dc.date.accessioned2013-04-19T02:42:20Z
dc.date.available2013-04-19T02:42:20Z
dc.date.copyright2013en_US
dc.date.issued2013
dc.identifier.urihttp://hdl.handle.net/10356/51991
dc.description.abstractCurrently, there is a trend where users post questions and edit questions via the use of online websites. These sites are also known as Community Question Answering (CQA) sites. CQA sites are beneficial to the web users because of the valuable knowledge accumulated from everybody around the world. However, as beneficial as CQA sites may be, there comes a complexity of how to extract only relevant information which is beneficial to the web user. The goal of this project aims to consolidate healthcare information and allow web users to extract information which is beneficial to them. To do so, java-programmed web crawlers are programmed to retrieve the URL, category, question answer from the CQA health category. The question answer pairs crawled are then saved into an XML format. Lucene, a java IR java library, is used for speed indexing of the various XML documents.Another goal is to design a centralised search engine that can retrieve relevant healthcare information from CQA data. As this project will be a continuation from Senior Lee Qian Hui’s progress, i am tasked to utilise Information Retrieval Models to data crawl from more CQA sites that resemble WikiAnswers, which was previously implemented by Senior Lee.en_US
dc.format.extent43 p.en_US
dc.language.isoenen_US
dc.rightsNanyang Technological University
dc.subjectDRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrievalen_US
dc.subjectDRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processingen_US
dc.titleExtracting integrate and search healthcare knowledge from the web (III)en_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.schoolSchool of Computer Engineeringen_US
dc.description.degreeBachelor of Engineering (Computer Science)en_US
dc.contributor.supervisor2Gao Congen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record