Center for Research in Comptuer Vision
Center for Research in Comptuer Vision



DaMN-Discriminative and Mutually Nearest



What is DaMN

demo

DaMN-discriminative and mutually nearest is a set of automatically generated category pairs, which augment one-vs-rest classifiers with a judicious selection of “two-vs-rest" classifier outputs. By combining one-vs-rest and two-vs-rest features in a principled probabilistic manner, we achieve state-of-the-art results on the UCF101 and HMDB51 datasets. More importantly, the same DaMN features, when treated as a mid-level representation also outperform existing methods in knowledge transfer experiments, both cross-dataset from UCF101 to HMDB51 and to new categories with limited training data (one-shot and few-shot learning). Finally, we study the generality of the proposed approach by applying DaMN to other classification tasks; our experiments show that DaMN outperforms related approaches in direct comparisons, not only on video action recognition but also on their original image dataset tasks.


Our Approach

framework

Our goal is to automatically identify the set of suitable features F, train their associated classifiers f(:) using the available training data and use the instance-level predictions to classify novel videos. We take one-vs-one SVM margin for every pair of categories to compute the category level similarity. Mutual k-Nearest neighbor can be obtained based on the distances. DaMN category- level feature matrix is composed of two parts: 1) features to separate each category individually (identical to one-vs-rest); 2) pair classifiers designed to separate mutually proximate pairs of categories from the remaining classes. Each colum of this matrix define a SVM classifier, and we use a principled probabilistic manner to combine the results from these classifiers together.


Knowledge transfer to novel category

For each new label y', we determine its similarity to the known categories Y using the small amount of new data. Note that while the data from novel categories may be insufficient to enable accurate training of one-vs-rest classiffiers, it is sufficient to provide a rough estimate of similarity between the new category and existing categories. At test time, we obtain scores from the bank of existing DaMN feature classifiers and determine the most likely category through equations states in our paper


Results

The following figure compare the category-level distance metrics:

distance

The follwing table shows action recognition accuracy on UCF 101

ucf101

The follwing table shows action recognition accuracy on HMDB51

hmdb51

The following compares DaMN and THUMOS semantic attributes.

thumos

The following shows the result on Animal with Attributes dataset.

awa

Code

Our code for text detection can be downloaded HERE.

Related Publications

Rui Hou, Amir Roshan Zamir, Rahul Sukthankar and Mubarak Shah, "DaMN - Discriminative and Mutually Nearest: Exploiting Pairwise Category Proximity for Video Action Recognition", European Conference on Computer Vision (ECCV), 2014 [PDF | BibTeX]

Back to Human Action and Activity Recognition Projects