专题:Advanced Image and Video Retrieval Techniques

This cluster of papers focuses on the development and evaluation of techniques for extracting, matching, and utilizing local image features for tasks such as object recognition, image retrieval, and scene classification. It covers a wide range of methods including local descriptors, deep learning approaches, binary codes, and cross-modal retrieval techniques.
最新文献
Decoupling foreground and background with Siamese ViT networks for weakly-supervised semantic segmentation

article Full Text OpenAlex

DeepCluE: Enhanced Deep Clustering via Multi-Layer Ensembles in Neural Networks

article Full Text OpenAlex

Few-Shot Panoptic Segmentation With Foundation Models

article Full Text OpenAlex

Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation

article Full Text OpenAlex

MCG-RTDETR: Multi-Convolution and Context-Guided Network with Cascaded Group Attention for Object Detection in Unmanned Aerial Vehicle Imagery

article Full Text OpenAlex

RecDiffusion: Rectangling for Image Stitching with Diffusion Models

article Full Text OpenAlex

How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?

article Full Text OpenAlex

Learning Occupancy for Monocular 3D Object Detection

article Full Text OpenAlex

Transcending Fusion: A Multiscale Alignment Method for Remote Sensing Image–Text Retrieval

article Full Text OpenAlex

Mask-Guided Local–Global Attentive Network for Change Detection in Remote Sensing Images

article Full Text OpenAlex

近5年高被引文献
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

article Full Text OpenAlex 25813 FWCI1151.17162172

Learning Multiple Layers of Features from Tiny Images

dissertation Full Text OpenAlex 25434 FWCI0

A Multi-Modal Distributed Real-Time IoT System for Urban Traffic Control (Invited Paper)

preprint Full Text OpenAlex 14153 FWCI1663.10426411

YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors

article Full Text OpenAlex 9475 FWCI1692.305563

A ConvNet for the 2020s

article Full Text OpenAlex 5683 FWCI392.2404136

Reading digits in natural images with unsupervised feature learning

article Full Text OpenAlex 4548 FWCI0

Emerging Properties in Self-Supervised Vision Transformers

article Full Text OpenAlex 4220 FWCI365.40672601

ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual–Inertial, and Multimap SLAM

article Full Text OpenAlex 3236 FWCI710.93853377

SegFormer: Simple and Efficient Design for Semantic Segmentation with\n Transformers

preprint Full Text OpenAlex 3103 FWCI226.51836541

Object Detection in 20 Years: A Survey

article Full Text OpenAlex 2427 FWCI408.3369552