Automatic detection of violent actions in public places through video analysis is difficult because the employed Artificial Intelligence-based techniques often suffer from generalization problems. Indeed, these algorithms hinge on large quantities of annotated data and usually experience a drastic drop in performance when used in scenarios never seen during the supervised learning phase. In this paper, we introduce and publicly release the Bus Violence benchmark, the first large-scale collection of video clips for violence detection in public transport, where some actors simulated violent actions inside a moving bus in changing conditions such as background or light. Moreover, we conduct a performance analysis of several state-of-the-art video violence detectors pre-trained with general violence detection databases on this newly established use case. The achieved moderate performances reveal the difficulties in generalizing from these popular methods, indicating the need to have this new collection of labeled data beneficial to specialize them in this new scenario.
We published the Bus Violence benchmark in the Zenodo repository. Check it out here, it is freely available!
The code for the experiments described in the paper is available in our GitHub repository.
@article{Ciampi_2022,
doi = {10.3390/s22218345},
url = {https://doi.org/10.3390%2Fs22218345},
year = 2022,
month = {oct},
publisher = {{MDPI}},
volume = {22},
number = {21},
pages = {8345},
author = {Luca Ciampi and Pawel Foszner and Nicola Messina and Michal Staniszewski and Claudio Gennaro and Fabrizio Falchi and Gianluca Serao and Michal Cogiel and Dominik Golba and Agnieszka Szczesna and Giuseppe Amato},
title = {Bus Violence: An Open Benchmark for Video Violence Detection on Public Transport},
journal = {Sensors}
}
This work was supported by: European Union funds awarded to Blees Sp. z o.o. under grant POIR.01.01.01-00-0952/20-00 “Development of a system for analysing vision data captured by public transport vehicles interior monitoring, aimed at detecting undesirable situations/behaviours and passenger counting (including their classification by age group) and the objects they carry”); EC H2020 project ``AI4media: a Centre of Excellence delivering next generation AI Research and Training at the service of Media, Society and Democracy'' under GA 951911; research project (RAU-6, 2020) and projects for young scientists of the Silesian University of Technology (Gliwice, Poland); research project INAROS (INtelligenza ARtificiale per il mOnitoraggio e Supporto agli anziani), Tuscany POR FSE CUP B53D21008060008.