A combined GIS and stereo vision approach to identifying building pixels in images to determine appropriate colour terms

Phil Bartie, Femke Reitsma, Steven Mills

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)
46 Downloads (Pure)

Abstract

Color information is a useful attribute to include in a building’s description to assist the listener in identifying the intended target. Often this information is only available as image data, and not readily accessible for use in constructing referring expressions for verbal communication. The method presented uses a GIS building polygon layer in conjunction with street-level captured imagery to provide a method to automatically filter foreground objects and select pixels which correspond to building fac¸ades. These selected pixels are then used to define the most appropriate color term for the building, and corresponding fuzzy color term histogram. The technique uses a single camera capturing images at a high frame rate, with the baseline distance between frames calculated from a GPS speed log. The expected distance from the camera to the building is measured from the polygon layer and refined from the calculated depth map, after which building pixels are selected. In addition significant foreground planar surfaces between the known road edge and building fac¸ade are identified as possible boundarywalls and hedges. The output is a dataset of the most appropriate color terms for both the building and boundary walls. Initial trials demonstrate the usefulness of the technique in automatically capturing color terms for buildings in urban regions.
Original languageEnglish
Pages (from-to)59-83
Number of pages25
JournalJournal of Spatial Information Science
Issue number2
DOIs
Publication statusPublished - 2011

Keywords

  • GIS
  • computer vision
  • stereo depth mapping
  • color terms
  • referring expressions
  • building facade
  • structure from motion
  • wayfinding instructions
  • color entropy

Fingerprint

Dive into the research topics of 'A combined GIS and stereo vision approach to identifying building pixels in images to determine appropriate colour terms'. Together they form a unique fingerprint.

Cite this