Exploiting Multimodal Interaction Techniques for Video-Surveillance

Exploiting Multimodal Interaction Techniques for Video-Surveillance

Castelló, Marc and Gonzàlez, Jordi and Amato, Ariel and Baiget, Pau and Fernández, Carles and Gonfaus, Josep M. and Mollineda, Ramón A. and Pedersoli, Marco and de la Blanca, Nicolás Pérez and Roca, F. Xavier

Intelligent Systems Reference Library 2013

Abstract : In this paper we present an example of a video surveillance application that exploits Multimodal Interactive (MI) technologies. The main objective of the so-called VID-Hum prototype was to develop a cognitive artificial system for both the detection and description of a particular set of human behaviours arising from real-world events. The main procedure of the prototype described in this chapter entails: (i) adaptation, since the system adapts itself to the most common behaviours (qualitative data) inferred from tracking (quantitative data) thus being able to recognize abnormal behaviors; (ii) feedback, since an advanced interface based on Natural Language understanding allows end-users the communicationwith the prototype by means of conceptual sentences; and (iii) multimodality, since a virtual avatar has been designed to describe what is happening in the scene, based on those textual interpretations generated by the prototype. Thus, the MI methodology has provided an adequate framework for all these cooperating processes. © Springer-Verlag Berlin Heidelberg 2013.