Abstract
TransUNet is a hybrid architecture that combines a transformer-based encoder with a CNN-based UNet. Originally introduced for semantic segmentation of medical images, we show in our work that TransUNet can be successfully applied to urban scenery datasets commonly used for developing autonomous driving systems. We also explore the performance characteristics of training on multi-domain data from the real world and a simulator, and show that using simulated images to augment a live dataset can improve segmentation performance. Code will be made available at https://github.com/weiyuen.
Original language | English |
---|---|
Title of host publication | 2022 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) |
Publisher | IEEE |
ISBN (Electronic) | 9798350332421 |
DOIs | |
Publication status | Published - 31 Mar 2023 |
Keywords
- Autonomous Driving
- Domain Adaptation
- Semantic Segmentation
- Urban Scenery
- Vision Transformer
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Networks and Communications
- Computer Science Applications
- Computer Vision and Pattern Recognition
- Signal Processing