Détail du document
Identifiant

oai:arXiv.org:2410.23092

Sujet
Computer Science - Computer Vision...
Auteur
Li, Ruyang Zhang, Tengfei Zhang, Heng Liu, Tiejun Wang, Yanwei Li, Xuelei
Catégorie

Computer Science

Année

2024

Date de référencement

06/11/2024

Mots clés
challenge model track object 2024 road++ recognition
Métrique

Résumé

This report presents our team's technical solution for participating in Track 3 of the 2024 ECCV ROAD++ Challenge.

The task of Track 3 is atomic activity recognition, which aims to identify 64 types of atomic activities in road scenes based on video content.

Our approach primarily addresses the challenges of small objects, discriminating between single object and a group of objects, as well as model overfitting in this task.

Firstly, we construct a multi-branch activity recognition framework that not only separates different object categories but also the tasks of single object and object group recognition, thereby enhancing recognition accuracy.

Subsequently, we develop various model ensembling strategies, including integrations of multiple frame sampling sequences, different frame sampling sequence lengths, multiple training epochs, and different backbone networks.

Furthermore, we propose an atomic activity recognition data augmentation method, which greatly expands the sample space by flipping video frames and road topology, effectively mitigating model overfitting.

Our methods rank first in the test set of Track 3 for the ROAD++ Challenge 2024, and achieve 69% mAP.

Li, Ruyang,Zhang, Tengfei,Zhang, Heng,Liu, Tiejun,Wang, Yanwei,Li, Xuelei, 2024, First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Atomic Activity Recognition 2024

Document

Ouvrir

Partager

Source

Articles recommandés par ES/IODE IA

Clinical Relevance of Plaque Distribution for Basilar Artery Stenosis
study endovascular imaging wall basilar complications plaque postoperative artery plaques stenosis