CAP6412 – Spring 2013


Advanced Computer Vision (3 Credit Hours)

Course Description:

Review recent advances in computer vision.

Course Goals:

To prepare students for graduate research in computer vision.

Exam and Grading Policy:

Reports: 30%
Paper Presentations: 10%
Discussion and Attendance: 20%
Programming Projects + presentation (30+10): = 40%
No exam!


Useful Links:

CAP 5415 Fall 2005

How to read a research paper (by Dr. Shah)


List of Papers to choose from:

This will be updated through the semester. Email me to sign up.

Motion Patterns Modeling and Estimation:

Xuemei Zhao and Gerard Medioni, “Robust Unsupervised Motion Pattern Inference from Video and Applications”, ICCV 2011

Visual Saliency:

Ali Borji, “Boosting bottom-up and top-down visual features for saliency estimation”, CVPR 2012.


Yuning Jiang, Jingjing Meng, Junsong Yuan, “Randomized visual phrases for object search,” CVPR 2012
N. Payet, S. Todorovic, “From contours to 3D object detection and pose estimation,” ICCV 2011

Scene classification/segmentation, Image Retrieval:

Liujuan Cao, Rongrong Ji, Yue Gao, Yi Yang, Qi Tian, “Weakly Supervised Sparse Coding with Geometric Consistency Pooling”, CVPR 2012
Lingqiao Liu, Lei Wang, “What has my classifier learned? Visualizing the classification rules of bag-of-feature model by support region detection”, CVPR 2012
F. Yu, Rongrong Ji, Ming-Hen Tsai, Guangnan Ye, Shih-Fu Chang, “Weak attributes for large-scale image retrieval,” CVPR 2012
D. Parikh, K. Grauman, “Relative attributes”, ICCV 2011


Zheng Wu, A. Thangali, Stan Sclaroff, M. Betke, “Coupling detection and data association for multiple object tracking”, CVPR 2012
L. Leal-Taixe, G. Pons-Moll, B. Rosenhahn, “Branch-and-price global optimization for multi-view multi-target tracking,” CVPR 2012
One of
H. Pirsiavash, D. Ramanan, C. Fowlkes, “Globally-optimal greedy algorithms for tracking a variable number of objects”, CVPR 2011
J. Berclaz, F. Fleuret, E. Turetken, P. Fua, “Multiple Object Tracking Using K-Shortest Paths Optimization”, PAMI 2011

Action, Activity, Event Recognition:

Lixin Duan, Dong Xu, I. Tsang, Jiebo Luo, “Visual Event Recognition in Videos by Learning from Web Data”, PAMI 2012
Wen Li, Lixin Duan, Dong Xu, I. Tsang, “Text-based image retrieval using progressive multi-instance learning”, ICCV 2011
Lixin Duan, Dong Xu, Shih-Fu Chang, “Exploiting web images for event recognition in consumer videos: A multiple source domain adaptation approach”, CVPR 2012
W. Brendel, S. Todorovic, “Learning spatiotemporal graphs of human activities,” ICCV 2011
Bangpeng Yao, Xiaoye Jiang, A. Khosla, A. Lin, L. Guibas, Li Fei-fei, “Human action recognition by learning bases of action attributes and parts,” ICCV 2011