专题:Video Analysis and Summarization

This cluster of papers focuses on the automatic analysis and summarization of video content, covering topics such as shot boundary detection, user attention models, semantic analysis, key frame extraction, event detection, and the application of these techniques to soccer videos. It also explores the use of the MPEG-7 standard and content-based retrieval methods in video summarization.
最新文献
FuzzySeek: Multimodal Refinement of Imprecise Video Queries for Moment Retrieval

article Full Text OpenAlex

RankCut: A Ranking-Based LLM Approach to Extractive Summarization for Transcript-Based Video Editing

article Full Text OpenAlex

MCBLT: Multi-Camera Multi-Object 3D Tracking in Long Videos

article Full Text OpenAlex

AIM 2025 Challenge on Screen-Content Video Quality Assessment: Methods and Results

article Full Text OpenAlex

Task-Specific Dual-Model Framework for Comprehensive Traffic Safety Video Description and Analysis

article Full Text OpenAlex

ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling

article Full Text OpenAlex

Blast from The Past: MOQ Live Streaming with AI-Generated Event Timeline

article Full Text OpenAlex

MI-Cap: A Multi-Modal Interpretable Model for Video Captioning

article Full Text OpenAlex

Fast Track Anything With Sparse Spatio-Temporal Propagation for Unified Video Segmentation

article Full Text OpenAlex

Deep Learning for Video Anomaly Detection: A Review

article Full Text OpenAlex

近5年高被引文献
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

article Full Text OpenAlex 1786 FWCI216.9884

Blended Diffusion for Text-driven Editing of Natural Images

article Full Text OpenAlex 651 FWCI37.5976

Imagic: Text-Based Real Image Editing with Diffusion Models

article Full Text OpenAlex 645 FWCI78.5059

CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning

article Full Text OpenAlex 638 FWCI62.8936

ImageBind One Embedding Space to Bind Them All

article Full Text OpenAlex 633 FWCI76.9414

Vector Quantized Diffusion Model for Text-to-Image Synthesis

article Full Text OpenAlex 582 FWCI34.1506

4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

article Full Text OpenAlex 440 FWCI283.2564

Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

article Full Text OpenAlex 431 FWCI52.7675

Generating Diverse and Natural 3D Human Motions from Text

article Full Text OpenAlex 422 FWCI24.5462

Metaverse: Perspectives from graphics, interactions and visualization

article Full Text OpenAlex 376 FWCI42.0703