专题:Human Pose and Action Recognition

This cluster of papers focuses on the development and application of deep learning techniques for human action recognition and pose estimation. It covers topics such as spatiotemporal feature learning, convolutional networks, 3D human pose estimation, skeleton-based recognition, and video classification. The research aims to advance the understanding and accurate detection of human actions in various environments.
最新文献
VQualA 2025 Challenge on GenAI-Bench AIGC Video Quality Assessment: Methods and Results

article Full Text OpenAlex

Video Decoupling Networks for Accurate, Efficient, Generalizable, and Robust Video Object Segmentation

article Full Text OpenAlex

A Decade of Action Quality Assessment: Largest Systematic Survey of Trends, Challenges, and Future Directions

article Full Text OpenAlex

Fast Track Anything With Sparse Spatio-Temporal Propagation for Unified Video Segmentation

article Full Text OpenAlex

Human Motion Prediction via Continual Prior Compensation

article Full Text OpenAlex

SMOTE-Enhanced CNN-Bi-LSTM for wearable sensor-based human activity recognition

article Full Text OpenAlex

Deep Learning for Video Anomaly Detection: A Review

article Full Text OpenAlex

Towards Nation-Wide Analytical Healthcare Infrastructures: A Privacy-Preserving Augmented Knee Rehabilitation Case Study

book-chapter Full Text OpenAlex

Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding

article Full Text OpenAlex

A Survey on Deep Learning for Group-Level Emotion Recognition

article Full Text OpenAlex

近5年高被引文献
Video Swin Transformer

article Full Text OpenAlex 1789 FWCI103.7709

Improved Baselines with Visual Instruction Tuning

article Full Text OpenAlex 965 FWCI245.934

TrackFormer: Multi-Object Tracking with Transformers

article Full Text OpenAlex 900 FWCI49.0727

SLEAP: A deep learning system for multi-animal pose tracking

article Full Text OpenAlex 833 FWCI79.6448

Revisiting Skeleton-based Action Recognition

article Full Text OpenAlex 729 FWCI41.7717

Frozen in time: A joint video and image encoder for end-to-end retrieval

article Full Text OpenAlex 707 FWCI0

Learning robust perceptive locomotion for quadrupedal robots in the wild

article Full Text OpenAlex 684 FWCI54.341

LightGlue: Local Feature Matching at Light Speed

article Full Text OpenAlex 674 FWCI82.0546

SMPL: A Skinned Multi-Person Linear Model

book-chapter Full Text OpenAlex 666 FWCI1008.1432

Human Action Recognition and Prediction: A Survey

article Full Text OpenAlex 650 FWCI50.9799