no |
title |
author |
magazine |
year |
volume |
issue |
page(s) |
type |
1 |
Advanced acoustic modelling techniques in MP3 speech recognition
|
Borsky, Michal |
|
2015 |
2015 |
1 |
p. 1-7 |
article |
2 |
Albayzín-2014 evaluation: audio segmentation and classification in broadcast news domains
|
Castán, Diego |
|
2015 |
2015 |
1 |
p. 1-9 |
article |
3 |
A multichannel diffuse power estimator for dereverberation in the presence of multiple sources
|
Braun, Sebastian |
|
2015 |
2015 |
1 |
p. 1-14 |
article |
4 |
An acoustic data transmission system based on audio data hiding: method and performance evaluation
|
Cho, Kiho |
|
2015 |
2015 |
1 |
p. 1-14 |
article |
5 |
An improved i-vector extraction algorithm for speaker verification
|
Li, Wei |
|
2015 |
2015 |
1 |
p. 1-9 |
article |
6 |
An investigation of supervector regression for forensic voice comparison on small data
|
Huang, Chee Cheun |
|
2015 |
2015 |
1 |
p. 1-15 |
article |
7 |
A novel hybrid of genetic algorithm and ANN for developing a high efficient method for vocal fold pathology diagnosis
|
Majidnezhad, Vahid |
|
2015 |
2015 |
1 |
p. 1-11 |
article |
8 |
A signal subspace approach to spatio-temporal prediction for multichannel speech enhancement
|
Borowicz, Adam |
|
2015 |
2015 |
1 |
p. 1-12 |
article |
9 |
Biomimetic spectro-temporal features for music instrument recognition in isolated notes and solo phrases
|
Patil, Kailash |
|
2015 |
2015 |
1 |
p. 1-13 |
article |
10 |
Deep neural network-based bottleneck feature and denoising autoencoder-based dereverberation for distant-talking speaker identification
|
Zhang, Zhaofeng |
|
2015 |
2015 |
1 |
p. 1-13 |
article |
11 |
Emotion in the singing voice—a deeperlook at acoustic features in the light ofautomatic classification
|
Eyben, Florian |
|
2015 |
2015 |
1 |
p. 1-9 |
article |
12 |
Erratum to: Efficient voice activity detection algorithm using long-term spectral flatness measure
|
Ma, Yanna |
|
2015 |
2015 |
1 |
p. 1 |
article |
13 |
Evaluation of linguistic and prosodic features for detection of Alzheimer’s disease in Turkish conversational speech
|
Khodabakhsh, Ali |
|
2015 |
2015 |
1 |
p. 1-15 |
article |
14 |
Exploiting foreign resources for DNN-based ASR
|
Motlicek, Petr |
|
2015 |
2015 |
1 |
p. 1-10 |
article |
15 |
Exploiting spectro-temporal locality in deep learning based acoustic event detection
|
Espi, Miquel |
|
2015 |
2015 |
1 |
p. 1-12 |
article |
16 |
Lightweight multi-DOA tracking of mobile speech sources
|
Rascon, Caleb |
|
2015 |
2015 |
1 |
p. 1-16 |
article |
17 |
Multimodal voice conversion based on non-negative matrix factorization
|
Masaka, Kenta |
|
2015 |
2015 |
1 |
p. 1-9 |
article |
18 |
Noisy training for deep neural networks in speech recognition
|
Yin, Shi |
|
2015 |
2015 |
1 |
p. 1-14 |
article |
19 |
Phone recognition with hierarchical convolutional deep maxout networks
|
Tóth, László |
|
2015 |
2015 |
1 |
p. 1-13 |
article |
20 |
Physical task stress and speaker variability in voice quality
|
Godin, Keith W. |
|
2015 |
2015 |
1 |
p. 1-13 |
article |
21 |
Regularized minimum class variance extreme learning machine for language recognition
|
Xu, Jiaming |
|
2015 |
2015 |
1 |
p. 1-10 |
article |
22 |
Robust design of Farrow-structure-based steerable broadband beamformers with sparse tap weights via convex optimization
|
Wang, Tiannan |
|
2015 |
2015 |
1 |
p. 1-17 |
article |
23 |
Semi-fragile digital speech watermarking for online speaker recognition
|
Nematollahi, Mohammad Ali |
|
2015 |
2015 |
1 |
p. 1-15 |
article |
24 |
SIFT-based local spectrogram image descriptor: a novel feature for robust music identification
|
Zhang, Xiu |
|
2015 |
2015 |
1 |
p. 1-15 |
article |
25 |
Simulation of tremulous voices using a biomechanical model
|
Fraile, Rubén |
|
2015 |
2015 |
1 |
p. 1-12 |
article |
26 |
Singer identification using perceptual features and cepstral coefficients of an audio signal from Indian video songs
|
Ratanpara, Tushar |
|
2015 |
2015 |
1 |
p. 1-12 |
article |
27 |
Small-parallel exemplar-based voice conversion in noisy environments using affine non-negative matrix factorization
|
Aihara, Ryo |
|
2015 |
2015 |
1 |
p. 1-9 |
article |
28 |
Speech enhancement based on Bayesian decision and spectral amplitude estimation
|
Deng, Feng |
|
2015 |
2015 |
1 |
p. 1-18 |
article |
29 |
Speech signal modeling using multivariate distributions
|
Aroudi, Ali |
|
2015 |
2015 |
1 |
p. 1-14 |
article |
30 |
Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion
|
Tejedor, Javier |
|
2015 |
2015 |
1 |
p. 1-27 |
article |
31 |
Stereo-based histogram equalization for robust speech recognition
|
Al-Wakeel, Randa |
|
2015 |
2015 |
1 |
p. 1-10 |
article |
32 |
The Latin Music Mood Database
|
dos Santos, Carolina L. |
|
2015 |
2015 |
1 |
p. 1-11 |
article |
33 |
ViSQOL: an objective speech quality model
|
Hines, Andrew |
|
2015 |
2015 |
1 |
p. 1-18 |
article |
34 |
Voice conversion using speaker-dependent conditional restricted Boltzmann machine
|
Nakashika, Toru |
|
2015 |
2015 |
1 |
p. 1-12 |
article |
35 |
Within and cross-corpus speech emotion recognition using latent topic model-based features
|
Shah, Mohit |
|
2015 |
2015 |
1 |
p. 1-17 |
article |