February 7, 2025
Ms. Akshita Gupta TU Darmstadt Friday, February 7, 2025 2:00PM – 3:00PM R1 101 | Zoom Abstract In this talk, I will be discussing advancements in Temporal Action Localization (TAL) with a focus on two key innovations: Efficient Large Model Adaptation and Open-Vocabulary Recognition in Videos. The first part of the talk introduces the Long-Short-range […]
February 3, 2025
Dr. Ser-Nam Lim UCF Department of Computer Science Monday, February 3, 2025 12:00PM – 1:00PM HEC 101 Abstract I will first go through the work on generative AI that my students and collaborators have been working on. These works answer in general the question “What can Generative AI do for you?”. I will then follow […]
January 28, 2025
The open-source AI model analyzes medical images, generates detailed reports, answers clinical questions and integrates multimodal data to streamline diagnostics and improve accuracy. By Eddy Duryea ’13 | January 28, 2025 Original Article As the fields of healthcare and technology increasingly evolve and intersect, researchers are collaborating on the best ways to use emerging technologies […]
January 23, 2025
The Workshop on Video Large Language Models (VidLLMs) sponsored by Amazon Science has been accepted to CVPR 2025 that will take place in Nashville, TN. The VidLLMs Workshop focuses on the latest advancements and challenges of Video Large Language Models. VidLLMs find applications across various fields where video content plays a crucial role. In educational […]
January 21, 2025
In a collaborative effort with Microsoft Research and SRI International, our team has developed and released a series of innovative benchmarks to evaluate and improve the robustness of visual perception models. These benchmarks focus on critical challenges such as distribution shifts, perturbations, occlusions, and low-resolution conditions, offering insights into the performance of video action recognition […]
January 13, 2025
Dr. Christian Kümmerle University of North Carolina Monday, January 13, 2025 11:00AM – 12:00PM MSB 318 | Zoom Abstract For a machine learning model’s ability to generalize well to unseen data, it has been understood that it needs to be able to capture a hidden, low-dimensional data distribution in the high-dimensional feature space. In this […]
December 11, 2024
The British Machine Vision Conference (BMVC) is the British Machine Vision Association’s (BMVA) annual conference on machine vision, image processing, and pattern recognition. It is one of the major international conferences on computer vision and related areas held in the UK. With increasing popularity and quality, it has established itself as a prestigious event on […]
December 4, 2024
The United States Patent and Trademark Office has granted on November 12, 2024, the patent number US 12,142,053 titled, “Self-Supervised Privacy Preservation Action Recognition System”. Title: Self-Supervised Privacy Preservation Action Recognition System UCF Inventor(s): Mubarak Shah, Chen Chen, Ishan Rajendrakumar Dave UCF REF ID#: 2023-019-02 Download
November 26, 2024
Introducing All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages (ALM-Bench): A culturally diverse multilingual and multimodal VQA benchmark covering 100 languages with 22.7K question-answers. ALM-bench encompasses 19 generic and culture-specific domains for each language, enriched with four diverse question types. With over 800 hours of human annotations, ALM-Bench is meticulously curated and verified […]
November 21, 2024
Dr. Kevin Bello Soroco Thursday, November 21, 2024 11:00AM – 12:00PM HEC 101A | Zoom Abstract Interpretability and causality are key desiderata in modern machine learning systems. Graphical models, and more specifically directed acyclic graphs (DAGs, a.k.a. Bayesian networks), serve as a well-established tool for expressing interpretable causal relationships. However, the task of estimating DAG […]