专题:Human Pose and Action Recognition

This cluster of papers focuses on the development and application of deep learning techniques for human action recognition and pose estimation. It covers topics such as spatiotemporal feature learning, convolutional networks, 3D human pose estimation, skeleton-based recognition, and video classification. The research aims to advance the understanding and accurate detection of human actions in various environments.
最新文献
Adv-Cpg: A Customized Portrait Generation Framework with Facial Adversarial Attacks

article Full Text OpenAlex

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

article Full Text OpenAlex

CNN2GNN: How to Bridge CNN with GNN

article Full Text OpenAlex

Momentum Contrast for Unsupervised Visual Representation Learning

article Full Text OpenAlex

Computer-vision research powers surveillance technology

article Full Text OpenAlex

Health Gaming based Activity Recognition Using Body-Worn Sensors via Artificial Neural Network

article Full Text OpenAlex

BaboonLand Dataset: Tracking Primates in the Wild and Automating Behaviour Recognition from Drone Videos

article Full Text OpenAlex

Towards Cultural Preservation of Traditional Motion Knowledge through Automated Annotations with MoRTELaban

article Full Text OpenAlex

Reinforced Intelligence Through Active Interaction in Real World: A Survey on Embodied AI

article Full Text OpenAlex

When language and vision meet road safety: Leveraging multimodal large language models for video-based traffic accident analysis

article Full Text OpenAlex

近5年高被引文献
Masked Autoencoders Are Scalable Vision Learners

article Full Text OpenAlex 4564 FWCI595.996

Learning Transferable Visual Models From Natural Language Supervision

preprint Full Text OpenAlex 4258 FWCI0

ViViT: A Video Vision Transformer

article Full Text OpenAlex 1656 FWCI87.049

Point Transformer

article Full Text OpenAlex 1388 FWCI156.27

Video Swin Transformer

article Full Text OpenAlex 1283 FWCI106.645

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

article Full Text OpenAlex 1071 FWCI56.122

Transformer Tracking

article Full Text OpenAlex 985 FWCI51.465

Is Space-Time Attention All You Need for Video Understanding?

preprint Full Text OpenAlex 890 FWCI0

Learning Spatio-Temporal Transformer for Visual Tracking

article Full Text OpenAlex 762 FWCI39.047

TrackFormer: Multi-Object Tracking with Transformers

article Full Text OpenAlex 686 FWCI52.943