View Item 
      •   Home
      • 1. Schools
      • College of Engineering
      • School of Computer Science and Engineering (SCSE)
      • SCSE Student Reports (FYP/IA/PA/PI)
      • View Item
      •   Home
      • 1. Schools
      • College of Engineering
      • School of Computer Science and Engineering (SCSE)
      • SCSE Student Reports (FYP/IA/PA/PI)
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.
      Subject Lookup

      Browse

      All of DR-NTUCommunities & CollectionsTitlesAuthorsBy DateSubjectsThis CollectionTitlesAuthorsBy DateSubjects

      My Account

      Login

      Statistics

      Most Popular ItemsStatistics by CountryMost Popular Authors

      About DR-NTU

      Collection and analysis on data from Drugs.com

      Thumbnail
      FYP Report submission for SCE 16-0379, Collection and Analysis on Data from Drugs.com (2.499Mb)
      Author
      Aw, Teng Teng
      Date of Issue
      2017-04-18
      School
      School of Computer Science and Engineering
      Abstract
      Internet users rely on the Internet for its convenience and efficiency. Search engines provide convenience and are time-saving. Depending on the source of results, search engines provide plenty of information at an utmost accuracy. For example, professional medical websites such as Drugs.com and Wikipedia are reliable as the authors are professionals with medical knowledge. The public, with no medical knowledge, can access this information and learn more about the prescribed drugs. Also, there are web scrapers on the Internet, known for aiding researchers in extracting data at a much faster speed in a specific time frame. In this report, Scrapy, is a web scraper, which will be used to extract data from Drugs.com. Scrapy is a framework, done in Python and the outputs will be saved in JSON files. Scrapy adapts to the different webpages with different structures using XPath.selectors. The findings will be presented in this report. The aim of this project is to utilize web scraping tools to collect data from Drugs.com and to be further analyzed. Data collected can be used in the future, saving time for researchers intending to do the same. Next, analysis of the collected data will cover aspects of the website, such as the structure and accuracy of information. In addition, this report will analyze the different web scrapers, its costs, complexity level and accuracy of data extracted. To conclude, this report will indicate the recommended choice of the web scrapers.
      Subject
      DRNTU::Engineering::Computer science and engineering::Computing methodologies::Document and text processing
      Type
      Final Year Project (FYP)
      Rights
      Nanyang Technological University
      Collections
      • SCSE Student Reports (FYP/IA/PA/PI)

      Show full item record


      NTU Library, Nanyang Avenue, Singapore 639798 © 2011 Nanyang Technological University. All rights reserved.
      DSpace software copyright © 2002-2015  DuraSpace
      Contact Us | Send Feedback
      Share |    
      Theme by 
      Atmire NV
       

       


      NTU Library, Nanyang Avenue, Singapore 639798 © 2011 Nanyang Technological University. All rights reserved.
      DSpace software copyright © 2002-2015  DuraSpace
      Contact Us | Send Feedback
      Share |    
      Theme by 
      Atmire NV
       

       

      DCSIMG