dc.contributor.authorPeh, Wei Leng
dc.date.accessioned2014-04-17T08:20:10Z
dc.date.available2014-04-17T08:20:10Z
dc.date.copyright2014en_US
dc.date.issued2014
dc.identifier.urihttp://hdl.handle.net/10356/58980
dc.description.abstractPeople nowadays are strongly influenced to using acronym and not full names when referring to Smartphone or other products on forums. This leads to an increasing number of different naming convention (e.g., full names, acronym names or certain names referring to more than one product) used when referring to a certain product. Therefore, this report presents the proposed techniques to automatically identify the product names from online forums. The first proposed technique combines the usage of natural language processing tools, standard matching of noun phrases with a list of phone database and acronyms together with rule based method to further filter the output list of phone names after extraction. The second technique uses the users’ pattern analysis model to extract the possible phone names from forum. From the results, more than 75% of the phone names are extracted for rule-based approach. However, the drawback is that there are too many unnecessary nouns being extracted as mobile names. There are too many false positives in the result. For pattern-based approach, lesser mobile names are being detected and extracted out. Further research on users’ patterns analysis needs to be done for pattern-based approach. Therefore, further improvement needs to be done. Firstly, more rules needs to be defined to further filter the unnecessary words. Secondly, those special words that do not appear for more than 15 times for each thread can be extracted. Thirdly, to add on to the users’ pattern analysis model, a list of categories of words that are hardly used for naming product names can be defined. Lastly, manual annotation on product names can be done in one XML thread and then extract them to train the rest of the data. As more refinements are continuously made, it is believed that the proposed techniques will achieve better performance in identifying the product names automatically.en_US
dc.format.extent65 p.en_US
dc.language.isoenen_US
dc.rightsNanyang Technological University
dc.subjectDRNTU::Engineeringen_US
dc.titleProduct name detection from user forumsen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorSun Aixin (SCE)en_US
dc.contributor.schoolSchool of Computer Engineeringen_US
dc.description.degreeCOMPUTER SCIENCEen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record