The First Workshop on

Short-Form Video Understanding:

The Next Frontier in Video Intelligence

ICCV 2025

Exploring the challenges and opportunities in understanding short-form videos

Ocotber 19/20, 2025, Honolulu, Hawai'i, USA

About the Workshop

Our workshop welcomes contributions on exploring the challenges of developing benchmark datasets for short-form videos, findings from solving existing video understanding problems, and explorations of novel research problems and application areas.


What are Short-Form Videos?

Short-form videos (SVs) are videos that are typically 5 minutes or less in length. They are often characterized by editing and composition techniques geared towards heightening viewer engagement. This can include, but is not limited to, looped narratives (the end of the video matches seamlessly with the begninning), two or more juxtaposed videos (e.g., one video in the top-half and another in the bottom-half), and semi-independent submodalities (e.g., disseminating audio information via an ASMR video).


Census data on website visits consistently show that social media platforms are among the top-visited websites on the Internet, with major platforms such as YouTube, Facebook, Instagram, and X (formerly Twitter) ranking in or around the top 10. The pan-global adoption of mobile devices has been one of the major contributing factors to this trend, which, in turn, has led to the emergence of this new class of videos – SVs – tailored for rapid consumption, primarily on small screens.


Why Focus on Short-Form Videos?

SVs are becoming increasingly embedded in our daily lives, whether as a source of entertainment, information, advertisement, or even social communications. Recent estimates show that 73% of consumers prefer SVs to get information about products and services, and more than 50% of marketers have turned to SVs to reach their customers, with over 90% set to increase or maintain their investments. Creative artists view SVs as a separate form of art and media, and, given the novelty and popularity of SVs in the modern world, they are increasingly designing content specifically for and with SVs. By current estimates, SVs account for 90% of internet traffic and can have as much as 2.5 times the engagement factor of longer videos. This only leads to more proliferation of SVs in the wild and more diversity in their content.


To this end, our workshop aims to bring together the ongoing efforts on SV understanding, bring out the specific challenges of handling SVs, scope out the research landscape, and lay the foundation for future research and development in the space of SVs.

Topics of Interest

SV Data Collection and Benchmarking

Exploring challenges related to collecting and benchmarking SV data that ideally highlight the fundamental differences of SVs from other forms of video, for example:

  • Assembling high-quality, ethically sourced SV data
  • Capturing the diversity in the themes, contents, editing, and composition styles of SVs
  • Developing benchmarks catering to the different editing and composition styles of SVs

SV Analysis and Understanding

Establishing performance baselines on SVs by building on top of existing techniques for solving video understanding problems, including, but not limited to:

  • Object and scene segmentations
  • Action recognition
  • Multi-concept recognition
  • Human-object and human-scene interactions
  • Content captioning
  • Content forecasting
  • Language models for SVs
  • Ethics and safety of using SVs, including social and cultural issues

New Research Frontiers in SV

Rigorously investigating research problems specific to SVs and their usage in common media. Sample problems include, but are not limited to:

  • Detection, such as whether an SV contains looped narratives, juxtaposed videos, or semi-independent modalities
  • Generation, especially focusing on topic- and viewership-specific engagement
  • Provenance, including recognizing the sources of composite and synthesized SVs
  • Evaluation of the quality of SVs, particlarly at scale, on factors that drive viewership and engagement
  • Exploring how SV can impact social, cultural, and professional communications, interactions, and workflows

Organizers

Uttaran Bhattacharya

Uttaran Bhattacharya

Adobe Research

Ishita Dasgupta

Ishita Dasgupta

Adobe Research

Mehrab Tanjim

Mehrab Tanjim

Adobe Research

Chen-Yi Lu

Chen-Yi Lu

Purdue University

Kunjal Panchal

Kunjal Panchal

University of Massachusetts, Amherst

Dinesh Manocha

Dinesh Manocha

University of Maryland, College Park

Invited Speakers

TBD

Workshop Schedule

Half-Day
Schedule TBD

Paper Submission

Important Dates

Full Papers

Submissions Open May 12, 2025
Submissions Due June 17 June 27, 2025, 11:59 PM AoE
Notification to Authors June 24 July 08, 2025
Camera-Ready Due July 18 August 11, 2025

Short Papers and Extended Abstracts

Submissions Open July 28, 2025
Submissions Due August 20, 2025, 11:59 PM AoE
Notification to Authors September 18, 2025
Camera-Ready Due September 30, 2025

Submission Guidelines

We welcome submissions of extended abstracts (1-2 pages), short papers (2-4 pages), and full papers (5-8 pages). Full papers will be published with ICCV 2025 proceedings, extended abstracts and short papers will be archived as part of the workshop.

  • Full papers are intended to demonstrate original research ideas and their impacts (theoretical or empirical), original research findings and analysis, or industrial applications of various research domains. Full papers should provide comprehensive background, clear motivation, rigorous methodology, and thorough experimental validation to demonstrate significant contributions to the field.
  • Short papers are intended for reporting promising early-stage research, novel ideas, or results that may not yet be fully developed for a full paper. Short papers should provide sufficient background, motivation, and preliminary results to demonstrate the potential impact and value to the community.
  • Extended abstracts are intended for sharing innovative concepts, position statements, or works-in-progress that would benefit from community feedback. They are ideal for presenting emerging ideas, pilot studies, or visionary perspectives that spark discussion and inspire future research directions.

All submissions should follow the ICCV 2025 author guidelines.


Submission Portal: OpenReview


Review Guidelines

Our review process follows that of ICCV 2025, and reviewers should adhere to the same ICCV 2025 reviewer guidelines. We thank all reviewers for maintaining the technical standards of ICCV 2025!