Influenza genome database integration with statistic visualisation
Tan Jian An
Date of Issue2017-04-17
School of Computer Science and Engineering
As the studies on influenza viruses progress, more influenza sequence database emerges online for public access, with each serving their individual objectives. However, this progress has made it difficult for researchers to study the different virus sequences as the formats differs from database to database. Researchers are required to spend more time and effort to format each file even before they can study the sequences. In addition, as the cost for genome sequencing became more affordable, most of these databases contain huge amount of data. Researchers do not know what they have in store for them in those databases and would usually spend a lot of time trying to understand the data before even researching on the sequences. In this project, the data from the different database will be studied and integrated to become one common influenza virus database allowing users to have a one stop influenza virus database. This project will also provide free-text search where researchers and users will be able to search words from the abstract to retrieve the virus sequences which will allow researchers and users to perform a more detailed search as compared to the traditional field search which the traditional influenza sequence database does not provide. Finally, data visualizations of the database will be provided to allow users to understand the data available in store for them in just one glance without wasting much time to analyse it.
DRNTU::Engineering::Computer science and engineering
Final Year Project (FYP)
Nanyang Technological University