专题:Music and Audio Processing

This cluster of papers focuses on the classification and analysis of audio signals, including music genre classification, environmental sound recognition, melody extraction, and acoustic scene classification. It explores techniques such as deep learning, convolutional neural networks, and feature extraction for music information retrieval.
最新文献
CDCGM: Composition-specified Dance Choreography Generation from Music

article Full Text OpenAlex

Non-invasive acoustic classification of adult asthma using an XGBoost model with vocal biomarkers

article Full Text OpenAlex

Advancing Radar Echo Extrapolation with Hypergraph-enhanced Latent Diffusion Model

article Full Text OpenAlex

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

article Full Text OpenAlex

A pop music intelligent composition method based on adaptive multimodal particle swarm optimization algorithm

article Full Text OpenAlex

CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages

article Full Text OpenAlex

Predictive processes shape individual musical preferences

article Full Text OpenAlex

Listener Acoustic Personalisation Challenge - LAP24: Head-Related Transfer Function Upsampling

article Full Text OpenAlex

High-Resolution Time-Frequency Feature Selection and EEG Augmented Deep Learning for Motor Imagery Recognition

article Full Text OpenAlex

Style transfer with diffusion models for synthetic-to-real domain adaptation

article Full Text OpenAlex

近5年高被引文献
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units

article Full Text OpenAlex 1300 FWCI124.633

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

article Full Text OpenAlex 856 FWCI105.711

A survey of transformers

article Full Text OpenAlex 765 FWCI252.286

Robust Speech Recognition via Large-Scale Weak Supervision

preprint Full Text OpenAlex 735 FWCI0

A Transformer-based Framework for Multivariate Time Series Representation Learning

article Full Text OpenAlex 692 FWCI75.373

AST: Audio Spectrogram Transformer

article Full Text OpenAlex 582 FWCI65.155

BirdNET: A deep learning solution for avian diversity monitoring

article Full Text OpenAlex 540 FWCI54.921

Scalable Diffusion Models with Transformers

article Full Text OpenAlex 448 FWCI135.616

Attention Is All You Need In Speech Separation

article Full Text OpenAlex 431 FWCI49.364

Unsupervised Cross-Lingual Representation Learning for Speech Recognition

article Full Text OpenAlex 424 FWCI38.123