TY - JOUR
T1 - Beauty is in the eye of the beholder
T2 - Demographically oriented analysis of aesthetics in photographs
AU - Kairanbay, Magzhan
AU - See, John
AU - Wong, Laikuan
N1 - Funding Information:
This work is supported by Multimedia University, Malaysia under MMU-GRA Scheme MMUI/160085 and Malaysia MOHE Grant FRGS/1/2018/ICT02/MMU/02/2. Authors’ addresses: M. Kairanbay, Multimedia University, Persiaran Multimedia, Cyberjaya, Selangor, 63100, Malaysia; email: [email protected]; J. See (corresponding author), Multimedia University, Persiaran Multimedia, Cy-berjaya, Malaysia; email: [email protected]; L.-K. Wong, Multimedia University, Persiaran Multimedia, Cyberjaya, Selangor, Malaysia; email: [email protected]. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]. © 2019 Association for Computing Machinery. 1551-6857/2019/07-ART63 $15.00 https://doi.org/10.1145/3328993
Publisher Copyright:
© 2019 Association for Computing Machinery.
Copyright:
Copyright 2019 Elsevier B.V., All rights reserved.
PY - 2019/8
Y1 - 2019/8
N2 - Aesthetics is a subjective concept that is likely to be perceived differently among people of different ages, genders, and cultural backgrounds. While techniques that directly compute this concept in images has seen increasing attention by the multimedia and machine-learning community, there are very few attempts at encoding the influences from the photographer's viewpoint. This work demonstrates how the aesthetic quality of photos can be better learned by accounting for the demographic background of a photographer. A new AVA-PD (Photographer Demographic) dataset is created to supplement the AVA dataset by providing photographers' age, gender and location attributes. Two deep convolutional neural network (CNN) architectures are proposed to utilize demographic information for aesthetic prediction of photos; both are shown to yield better prediction capabilities compared to most existing approaches. By leveraging on AVA-PD meta-data, we also present some additional machine-learnable tasks such as identifying the photographer and predicting photography styles from a person's gallery of photos.
AB - Aesthetics is a subjective concept that is likely to be perceived differently among people of different ages, genders, and cultural backgrounds. While techniques that directly compute this concept in images has seen increasing attention by the multimedia and machine-learning community, there are very few attempts at encoding the influences from the photographer's viewpoint. This work demonstrates how the aesthetic quality of photos can be better learned by accounting for the demographic background of a photographer. A new AVA-PD (Photographer Demographic) dataset is created to supplement the AVA dataset by providing photographers' age, gender and location attributes. Two deep convolutional neural network (CNN) architectures are proposed to utilize demographic information for aesthetic prediction of photos; both are shown to yield better prediction capabilities compared to most existing approaches. By leveraging on AVA-PD meta-data, we also present some additional machine-learnable tasks such as identifying the photographer and predicting photography styles from a person's gallery of photos.
KW - AVA
KW - Convolutional neural networks
KW - Demographic attributes
KW - Image aesthetic evaluation
KW - Photographer demographics
UR - http://www.scopus.com/inward/record.url?scp=85071121710&partnerID=8YFLogxK
U2 - 10.1145/3328993
DO - 10.1145/3328993
M3 - Article
AN - SCOPUS:85071121710
SN - 1551-6857
VL - 15
JO - ACM Transactions on Multimedia Computing, Communications and Applications
JF - ACM Transactions on Multimedia Computing, Communications and Applications
IS - 2s
M1 - 63
ER -