You are here

Publications

Export 5 results:
Author Title Type [ Year(Asc)]
2016
T. Merritt, Clark, R. A. J., Wu, Z., Yamagishi, J., and King, S., Deep neural network-guided unit selection synthesis, in Proc. ICASSP, 2016.
C. Zhang and Woodland, P. C., DNN Speaker Adaptation using Parameterised Sigmoid and ReLU Hidden Activation Functions, in Proc. ICASSP'16, Shanghai, China, 2016.
M. Nicolao, Christensen, H., Cunningham, S., Green, P., and Hain, T., A framework for collecting realistic recordings of dysarthric speech - the homeService corpus, in The International Conference on Language Resources and Evaluation - LREC 2016, Portorož, SLO, 2016.
O. Watts, Henter, G. Eje, Merritt, T., Wu, Z., and King, S., From HMMs to DNNs: where do the improvements come from?, in Proc. ICASSP, Shanghai, China, 2016, vol. 41.
L. Wang, Zhang, C., Woodland, P. C., Gales, M. J. F., Karanasou, P., Lanchantin, P., Liu, X., and Qian, Y., Improved DNN-based Segmentation for Multi-genre Broadcast Audio, in Proc. ICASSP'16, Shanghai, China, 2016.
G. Eje Henter, Ronanki, S., Watts, O., Wester, M., Wu, Z., and King, S., Robust TTS duration modelling using DNNs, in Proc. ICASSP, Shanghai, China, 2016, vol. 41.
P. Swietojanski and Renals, S., SAT-LHUC: Speaker Adaptive Training for Learning Hidden Unit Contributions, in Proc. IEEE ICASSP, Shanghai, China, 2016.
R. W. M. Ng, Nicolao, M., Saz, O., Hasan, M., Chettri, B., Doulaty, M., Lee, T., and Hain, T., Sheffield LRE 2015 System Description}, in {Odyssey: The Speaker and Language Recognition Workshop (Submitted)}, 2016.
J. Yang, Zhang, C., Ragni, A., Gales, M. J. F., and Woodland, P. C., System Combiantion with Log-linear Models, in Proc. ICASSP'16, Shanghai, China, 2016.
R. Dall, Brognaux, S., Richmond, K., Valentini-Botinhao, C., Henter, G. Eje, Hirschberg, J., Yamagishi, J., and King, S., Testing the consistency assumption: Pronunciation variant forced alignment in read and spontaneous speech synthesis, in Proc. ICASSP, Shanghai, China, 2016, vol. 41.
2015
R. Milner, Saz, O., Deena, S., Doulaty, M., Ng, R., and Hain, T., The 2015 Sheffield System for Longitudinal Diarisation of Broadcast Media, in {Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}, Scottsdale, AZ, 2015.
O. Saz, Doulaty, M., Deena, S., Milner, R., Ng, R., Hasan, M., Liu, Y., and Hain, T., The 2015 Sheffield System for Transcription of Multi–Genre Broadcast Media, in {Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}, Scottsdale, AZ, 2015.
M. Wester, Valentini-Botinhao, C., and Henter, G. Eje, Are we using enough listeners? No! An empirically-supported critique of Interspeech 2014 TTS evaluations, in Proc. of Interspeech, Dresden, 2015.
M. Wester, Aylett, M., Tomalin, M., and Dall, R., Artificial Personality and Disfluency, in Proc. of Interspeech, Dresden, 2015.
T. Merritt, Latorre, J., and King, S., Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brisbane, 2015.
P. C. Woodland, Liu, X., Qian, Y., Zhang, C., Gales, M. J. F., Karanasou, P., Lanchantin, P., and Wang, L., Cambridge University Transcription Systems for the Multi-Genre Broadcast Challenge, in Proc. of ASRU, Scottsdale, USA, 2015.
M. Doulaty, Saz, O., and Hain, T., Data-selective Transfer Learning for Multi-Domain Speech Recognition, in Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 2015.
L. - H. Chen, Raitio, T., Valentini-Botinhao, C., Ling, Z., and Yamagishi, J., A Deep Generative Architecture for Postfiltering in Statistical Parametric Speech Synthesis, Audio, Speech, and Language Processing, IEEE/ACM Transactions on, vol. 23, pp. 2003-2014, 2015.
T. Merritt, Yamagishi, J., Wu, Z., Watts, O., and King, S., Deep neural network context embeddings for model selection in rich-context HMM synthesis, in Proc. Interspeech, Dresden, Germany, 2015, pp. 2207–2211.
Z. Wu, Valentini-Botinhao, C., Watts, O., and King, S., Deep neural networks employing multi-task learning and stacked bottleneck features for speech synthesis, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015.
P. Swietojanski and Renals, S., Differentiable Pooling for Unsupervised Speaker Adaptation, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015.
R. Dall, Wester, M., and Corley, M., Disfluencies in change detection in natural, vocoded and synthetic speech, in Proc. of DiSS 2015, Edinburgh, 2015.
N. Obin, Veaux, C., and Lanchantin, P., Exploiting Alternatives for Text-To-Speech Synthesis: From Machine to Human. Springer Verlag, 2015, pp. 189-202.
L. Lu and Renals, S., Feature-space Speaker Adaptation for Probabilistic Linear Discriminant Analysis Acoustic Models, in Proc. INTERSPEECH, 2015.
C. Zhang and Woodland, P. C., A General Artificial Neural Network Extension for HTK, in Proc. Interspeech'15, Dresden, Germany, 2015.

Pages