专题:Visual Attention and Saliency Detection

This cluster of papers focuses on computational modeling and detection of visual saliency, including topics such as saliency detection, visual attention, deep learning for salient object detection, analysis of eye movements, image and video segmentation, and the interplay between bottom-up and top-down attention mechanisms.
最新文献
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labeling Real Videos

article Full Text OpenAlex

Early Depth Engagement in Art Perception: visual dynamics and aesthetic experience

article Full Text OpenAlex

WeakTr: Exploring Plain Vision Transformer for Weakly-Supervised Semantic Segmentation

article Full Text OpenAlex

Audio-visual saliency prediction based on joint adversarial learning and Co-Attention mechanism

article Full Text OpenAlex

A Data-Driven RetinaNet Model for Small Object Detection in Aerial Images

article Full Text OpenAlex

Diff-MEF: Cross-Modal Diffusion Framework With Text Prompts and Semantic Perception for Multi-Exposure Image Fusion

article Full Text OpenAlex

Cross-Modal Fusion with Mixture-of-Experts for Efficient RGB-D Salient Object Detection

article Full Text OpenAlex

Fine-grained Image Quality Assessment for Perceptual Image Restoration

article Full Text OpenAlex

Rethinking Saliency Maps: A Cognitive Human Aligned Taxonomy and Evaluation Framework for Explanations

article Full Text OpenAlex

WaveFormer: Frequency-Time Decoupled Vision Modeling with Wave Equation

article Full Text OpenAlex

近5年高被引文献
Segment Anything

article Full Text OpenAlex 8614 FWCI997.4269

A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS

review Full Text OpenAlex 2421 FWCI280.396

Attention mechanisms in computer vision: A survey

article Full Text OpenAlex 2301 FWCI217.751

Image fusion in the loop of high-level vision tasks: A semantic-aware real-time infrared and visible image fusion network

article Full Text OpenAlex 941 FWCI97.1057

Visual attention network

article Full Text OpenAlex 940 FWCI105.163

PIAFusion: A progressive infrared and visible image fusion network based on illumination aware

article Full Text OpenAlex 920 FWCI92.2448

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

article Full Text OpenAlex 872 FWCI100.8777

Vision Transformer with Deformable Attention

article Full Text OpenAlex 835 FWCI45.891

What is XR? Towards a Framework for Augmented and Virtual Reality

article Full Text OpenAlex 819 FWCI83.3267

MaxViT: Multi-axis Vision Transformer

book-chapter Full Text OpenAlex 769 FWCI98.7217