Abstract
This work presents an analysis of the discriminators used in Generative Adversarial Networks (GANs) for Video. We show that unconstrained video discriminator architectures induce a loss surface with high curvature which make optimization difficult. We also show that this curvature becomes more extreme as the maximal kernel dimension of video discriminators increases. With these observations in hand, we propose a methodology for the design of a family of efficient Lower-Dimensional Video Discriminators for GANs (LDVD-GANs). The proposed methodology improves the performance and efficiency of video GAN models it is applied to and demonstrates good performance on complex and diverse datasets such as UCF-101. In particular, we show that LDVDs can double the performance of Temporal-GANs and provide for state-of-the-art performance on a single GPU using the proposed methodology.
Original language | English |
---|---|
Pages (from-to) | 506-520 |
Number of pages | 15 |
Journal | Neural Networks |
Volume | 132 |
Early online date | 26 Sept 2020 |
DOIs | |
Publication status | Published - Dec 2020 |
Keywords
- Discriminator analysis
- Generative Adversarial Networks
- Video generation
ASJC Scopus subject areas
- Cognitive Neuroscience
- Artificial Intelligence