Deep learning for image classification
Ku, Wee Kiat
Date of Issue2017
School of Computer Science and Engineering
Recent advancements in Convolutional Neural Networks (CNN) has been highly successful in more tricky and complex image processing tasks like object detection and classification. Research has also been conducted on classifying painting and photographic aesthetic styles (Karayev et al., 2014) with CNNs. However, such work on classifying photographic aesthetic styles are often limited as datasets used are multi-class, single label datasets but in reality multiple aesthetic styles can co-exist together in a single photograph. Therefore, this project aims to provide a classification pipeline that is able to provide multi-label results from a multi-class, single-label dataset and to build a photographic aesthetic style tool on top of the pipeline for photographers to improve on the aesthetic styles of their photographs. This project consists of 2 main parts, photographic aesthetic style classification pipeline and aesthetic style settings recommendation system. A CNN architec- ture, AlexNet (Krizhevsky, Sutskever, & Hinton, 2012), was chosen to be trained with the AVA Dataset (Murray, Marchesotti, & Perronnin, 2012). Through a series of experiments it was determined that the AVA Dataset has a lot of mis- labelled images, a new dataset was then collected to train the CNN with. The new dataset includes photograph meta data, EXIF, which is used to train binary Random Forest classifiers as part of the classification pipeline. A camera settings recommender system was then built on top of the classification pipeline. Accessed through a web API, the system is able to classify photographs to aesthetic styles as well as recommend camera settings given a photograph or an aesthetic style chosen by the user.
DRNTU::Engineering::Computer science and engineering
Final Year Project (FYP)
Nanyang Technological University