专题:Video Analysis and Summarization

This cluster of papers focuses on the automatic analysis and summarization of video content, covering topics such as shot boundary detection, user attention models, semantic analysis, key frame extraction, event detection, and the application of these techniques to soccer videos. It also explores the use of the MPEG-7 standard and content-based retrieval methods in video summarization.
最新文献
A survey of visual insight mining: Connecting data and insights via visualization

article Full Text OpenAlex

T2VEval: Benchmark dataset and objective evaluation method for T2V-generated videos

article Full Text OpenAlex

Optimization of maximum parallel K-means algorithm based on GPU cluster

article Full Text OpenAlex

Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration

article Full Text OpenAlex

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

article Full Text OpenAlex

Rethinking Training for De-biasing Text-to-Image Generation: Unlocking the Potential of Stable Diffusion

article Full Text OpenAlex

BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations

article Full Text OpenAlex

Relation-Rich Visual Document Generator for Visual Information Extraction

article Full Text OpenAlex

AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM

article Full Text OpenAlex

Koala-36M : A Large-Scale Video Dataset Improving Consistency between Fine-Grained Conditions and Video Content

article Full Text OpenAlex

近5年高被引文献
SDP: Session Description Protocol

report Full Text OpenAlex 1337 FWCI0

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

article Full Text OpenAlex 1184 FWCI207.866

End-to-End Video Instance Segmentation with Transformers

article Full Text OpenAlex 576 FWCI32.294

Imagic: Text-Based Real Image Editing with Diffusion Models

article Full Text OpenAlex 469 FWCI83.099

Blended Diffusion for Text-driven Editing of Natural Images

article Full Text OpenAlex 463 FWCI38.048

CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning

article Full Text OpenAlex 454 FWCI52.01

Vector Quantized Diffusion Model for Text-to-Image Synthesis

article Full Text OpenAlex 428 FWCI35.076

ImageBind One Embedding Space to Bind Them All

article Full Text OpenAlex 371 FWCI65.289

AI Choreographer: Music Conditioned 3D Dance Generation with AIST++

article Full Text OpenAlex 339 FWCI112.406

Improving Tag-Clouds as Visual Information Retrieval Interfaces

preprint Full Text OpenAlex 324 FWCI0