专题:Video Analysis and Summarization

This cluster of papers focuses on the automatic analysis and summarization of video content, covering topics such as shot boundary detection, user attention models, semantic analysis, key frame extraction, event detection, and the application of these techniques to soccer videos. It also explores the use of the MPEG-7 standard and content-based retrieval methods in video summarization.
最新文献
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labeling Real Videos

article Full Text OpenAlex

Multi-Modal Few-Shot Temporal Action Segmentation

article Full Text OpenAlex

Streamingbench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

article Full Text OpenAlex

COUNTER-CAPTIONS: κ_O (Operative Captioning) Applied to Operation Epic Fury — Two-Tier Architecture: Operative Compression + Witness Restoration

article Full Text OpenAlex

CustomVideo: Customizing Text-to-Video Generation With Multiple Subjects

article Full Text OpenAlex

Local2Global Query Alignment for Video Instance Segmentation

article Full Text OpenAlex

AIM 2025 Challenge on Screen-Content Video Quality Assessment: Methods and Results

article Full Text OpenAlex

ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling

article Full Text OpenAlex

MCBLT: Multi-Camera Multi-Object 3D Tracking in Long Videos

article Full Text OpenAlex

Blast from The Past: MOQ Live Streaming with AI-Generated Event Timeline

article Full Text OpenAlex

近5年高被引文献
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

article Full Text OpenAlex 1917 FWCI219.8983

Imagic: Text-Based Real Image Editing with Diffusion Models

article Full Text OpenAlex 682 FWCI78.5434

Blended Diffusion for Text-driven Editing of Natural Images

article Full Text OpenAlex 678 FWCI37.7275

ImageBind One Embedding Space to Bind Them All

article Full Text OpenAlex 678 FWCI77.9022

CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning

article Full Text OpenAlex 664 FWCI63.2724

Vector Quantized Diffusion Model for Text-to-Image Synthesis

article Full Text OpenAlex 615 FWCI34.6573

4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

article Full Text OpenAlex 520 FWCI291.7616

Generating Diverse and Natural 3D Human Motions from Text

article Full Text OpenAlex 470 FWCI25.7474

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

article Full Text OpenAlex 468 FWCI53.9091

Metaverse: Perspectives from graphics, interactions and visualization

article Full Text OpenAlex 381 FWCI41.6748