See the next THUMOS Challenge, 2015!
The ground truth labels of the test data is available HERE!
THUMOS 2014 Challenge results are available HERE!
Action Recognition in Temporally Untrimmed Videos!
Automatically recognizing and localizing a large number of action categories from videos in the wild of significant importance for video understanding and multimedia event detection. THUMOS workshop and challenge aims at exploring new challenges and approaches for large-scale action recognition with large number of classes from open source videos in a realistic setting.
Most of the existing action recognition datasets are composed of videos that have been manually trimmed to bound the action of interest. This has been identified to be a considerable limitation as it poorly matches how action recognition is applied in practical settings. Therefore, THUMOS 2014 will conduct the challenge on temporally untrimmed videos. The participants may train their methods using trimmed clips but will be required to test their systems on untrimmed data.
A new forward-looking dataset containing over 254 hours of video data and 25 million frames with the following components is made available under this challenge:
- Training Set: over 13,000 temporally trimmed videos from 101 action classes.
- Validation Set: Over 1000 temporally untrimmed videos with temporal annotations of actions.
- Background Set: Over 2500 relevant videos guaranteed to not include any instance of the 101 actions.
- Test Set: Over 1500 temporally untrimmed videos with withheld ground truth.
- Spatio-temporal Annotations: Bounding box annotation for 24 action classes.
All videos are collected from YouTube, and their pre-extracted low-level features (Improved Dense Trajectory Features) are made available.
The entries to the challenge will be evaluated using the new THUMOS 2014 Dataset in two tasks:
- Action Recognition: accepts submissions for whole-clip action recognition over 101 classes.
- Temporal Action Detection: accepts submissions on action recognition and temporal localization on 20 action classes.
Participants may either submit a notebook paper (due September 4th) that briefly describes their system, or a research paper detailing their approach. All of the submission results will be summarized during the workshop and included in the workshop\conference proceedings. Additionally, the top performers will be invited to give oral presentations, and the remaining participants are encouraged to present their work in the poster session.