专题:Speech and Audio Processing

This cluster of papers focuses on the advances in speech enhancement techniques, including audio-visual speech recognition, deep learning methods, noise reduction, source separation, reverberation handling, objective quality measures, beamforming, and lipreading. The papers cover a wide range of topics related to improving the quality and intelligibility of speech signals in various challenging acoustic environments.
最新文献
Frequency-Direction Aware Multichannel Selective Fixed-Filter Active Noise Control Based on Multi-Task Learning

article Full Text OpenAlex

Identifying the Desired Word Suggestion in Simultaneous Audio

article Full Text OpenAlex

Advancing Radar Echo Extrapolation with Hypergraph-enhanced Latent Diffusion Model

article Full Text OpenAlex

Frequency-Based Decoupling and Modeling of BLE RSS Measurements for Indoor Positioning

article Full Text OpenAlex

Band gap analysis and prediction for phononic metamaterials with different spiral shapes based on transfer learning

article Full Text OpenAlex

Single-Sided Deafness and Cochlear Implants: Performance in a Novel Combined Speech-in-Noise and Localization Task

article Full Text OpenAlex

Underwater acoustic signal recognition system with multi-scale hybrid cepstral feature strategy and joint deep network

article Full Text OpenAlex

Hierarchical Channel Estimation for Near-field Spatial Non-stationary Channels: A Pre-selection and Multi-level Dynamic Threshold Strategy

article Full Text OpenAlex

Broadband Passive Sonar Track-Before-Detect Using Raw Acoustic Data

article Full Text OpenAlex

Wavenet-Volterra Neural Network for Active Noise Control: A Fully Causal Approach

preprint Full Text OpenAlex

近5年高被引文献
Enhancements in Immediate Speech Emotion Detection: Harnessing Prosodic and Spectral Characteristics

article Full Text OpenAlex 1662 FWCI1375.44

HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units

article Full Text OpenAlex 1300 FWCI124.633

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

article Full Text OpenAlex 856 FWCI105.711

Robust Speech Recognition via Large-Scale Weak Supervision

preprint Full Text OpenAlex 735 FWCI0

Reduction in Transmission Power of Base Transceiver Station

article Full Text OpenAlex 717 FWCI106.255

AST: Audio Spectrogram Transformer

article Full Text OpenAlex 582 FWCI65.155

Attention Is All You Need In Speech Separation

article Full Text OpenAlex 431 FWCI49.364

Unsupervised Cross-Lingual Representation Learning for Speech Recognition

article Full Text OpenAlex 424 FWCI38.123

SpeechBrain: A General-Purpose Speech Toolkit

preprint Full Text OpenAlex 424 FWCI0

Semantic Communication Systems for Speech Transmission

article Full Text OpenAlex 351 FWCI35.07