专题:Human Pose and Action Recognition

This cluster of papers focuses on the development and application of deep learning techniques for human action recognition and pose estimation. It covers topics such as spatiotemporal feature learning, convolutional networks, 3D human pose estimation, skeleton-based recognition, and video classification. The research aims to advance the understanding and accurate detection of human actions in various environments.
最新文献
Improved Baselines with Visual Instruction Tuning

article Full Text OpenAlex

AttX: Attentive Cross-Connections for Fusion of Wearable Signals in Emotion Recognition

article Full Text OpenAlex

Learning in Deep Radial Basis Function Networks

article Full Text OpenAlex

MLMSign: Multi-lingual multi-modal illumination-invariant sign language recognition

article Full Text OpenAlex

IME: Integrating Multi-curvature Shared and Specific Embedding for Temporal Knowledge Graph Completion

article Full Text OpenAlex

Stealthy Targeted Backdoor Attacks Against Image Captioning

article Full Text OpenAlex

MADE: Multicurvature Adaptive Embedding for Temporal Knowledge Graph Completion

article Full Text OpenAlex

HDA-pose: a real-time 2D human pose estimation method based on modified YOLOv8

article Full Text OpenAlex

Image Caption Generation using Vision Transformer and GPT Architecture

article Full Text OpenAlex

POCO: 3D Pose and Shape Estimation with Confidence

article Full Text OpenAlex

近5年高被引文献
Learning Transferable Visual Models From Natural Language Supervision

preprint Full Text OpenAlex 5296 FWCI0

UCF101: A Dataset of 101 Human Actions Classes from Videos in the Wild

preprint Full Text OpenAlex 4433 FWCI0

Deep Learning for Person Re-Identification: A Survey and Outlook

review Full Text OpenAlex 1871 FWCI142.39173421

Point Transformer

article Full Text OpenAlex 1820 FWCI192.7589499

Video Swin Transformer

article Full Text OpenAlex 1641 FWCI113.02544571

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

article Full Text OpenAlex 1291 FWCI103.13945428

Is Space-Time Attention All You Need for Video Understanding?

preprint Full Text OpenAlex 1211 FWCI0

TrackFormer: Multi-Object Tracking with Transformers

article Full Text OpenAlex 829 FWCI53.71643052

Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition

article Full Text OpenAlex 793 FWCI36.86688405

Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking

article Full Text OpenAlex 768 FWCI64.60271071