DeepBev: A conditional adversarial network for bird's eye view generation

Helmi Fraser, Sen Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Obtaining a meaningful, interpretable yet compact representation of the immediate surroundings of an autonomous vehicle is paramount for effective operation as well as safety. This paper proposes a solution to this by representing semantically important objects from a top-down, ego-centric bird's eye view. The novelty in this work is from formulating this problem as an adversarial learning task, tasking a generator model to produce bird's eye view representations which are plausible enough to be mistaken as a ground truth sample. This is achieved by using a Wasserstein Generative Adversarial Network based model conditioned on object detections from monocular RGB images and the corresponding bounding boxes. Extensive experiments show our model is more robust to novel data compared to strictly supervised benchmark models, while being a fraction of the size of the next best.

Original languageEnglish
Title of host publication25th International Conference on Pattern Recognition 2020
PublisherIEEE
Pages5581-5586
Number of pages6
ISBN (Electronic)9781728188089
DOIs
Publication statusPublished - 5 May 2021
Event25th International Conference on Pattern Recognition 2020 - Virtual, Milan, Italy
Duration: 10 Jan 202115 Jan 2021

Conference

Conference25th International Conference on Pattern Recognition 2020
Abbreviated titleICPR 2020
Country/TerritoryItaly
CityVirtual, Milan
Period10/01/2115/01/21

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'DeepBev: A conditional adversarial network for bird's eye view generation'. Together they form a unique fingerprint.

Cite this