A unified approach to segmentation and categorization of dynamic textures

Avinash Ravichandran, Paolo Favaro, René Vidal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

10 Citations (Scopus)

Abstract

Dynamic textures (DT) are videos of non-rigid dynamical objects, such as fire and waves, which constantly change their shape and appearance over time. Most of the prior work on DT analysis dealt with the classification of videos of a single DT or the segmentation of videos containing multiple DTs. In this paper, we consider the problem of joint segmentation and categorization of videos of multiple DTs under varying viewpoint, scale, and illumination conditions. We formulate this problem of assigning a class label to each pixel in the video as the minimization of an energy functional composed of two terms. The first term measures the cost of assigning a DT category to each pixel. For this purpose, we introduce a bag of dynamic appearance features (BoDAF) approach, in which we fit each video with a linear dynamical system (LDS) and use features extracted from the parameters of the LDS for classification. This BoDAF approach can be applied to the whole video, thus providing a framework for classifying videos of a single DT, or to image patches (superpixels), thus providing the cost of assigning a DT category to each pixel. The second term is a spatial regularization cost that encourages nearby pixels to have the same label. The minimization of this energy functional is carried out using the random walker algorithm. Experiments on existing databases of a single DT demonstrate the superiority of our BoDAF approach with respect to state-of-the art methods. To the best of our knowledge, the problem of joint segmentation and categorization of videos of multiple DTs has not been addressed before, hence there is no standard database to test our method. We therefore introduce a new database of videos annotated at the pixel level and evaluate our approach on this database with promising results. © 2011 Springer-Verlag Berlin Heidelberg.

Original languageEnglish
Title of host publicationComputer Vision, ACCV 2010 - 10th Asian Conference on Computer Vision, Revised Selected Papers
Pages425-438
Number of pages14
Volume6492 LNCS
EditionPART 1
DOIs
Publication statusPublished - 2011
Event10th Asian Conference on Computer Vision - Queenstown, New Zealand
Duration: 8 Nov 201012 Nov 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume6492 LNCS
ISSN (Print)0302-9743

Conference

Conference10th Asian Conference on Computer Vision
Abbreviated titleACCV 2010
CountryNew Zealand
CityQueenstown
Period8/11/1012/11/10

Fingerprint Dive into the research topics of 'A unified approach to segmentation and categorization of dynamic textures'. Together they form a unique fingerprint.

Cite this