You are here

Publications

Export 5 results:
Author Title [ Type(Desc)] Year
Conference Paper
R. Milner, Saz, O., Deena, S., Doulaty, M., Ng, R., and Hain, T., The 2015 Sheffield System for Longitudinal Diarisation of Broadcast Media, in {Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}, Scottsdale, AZ, 2015.
O. Saz, Doulaty, M., Deena, S., Milner, R., Ng, R., Hasan, M., Liu, Y., and Hain, T., The 2015 Sheffield System for Transcription of Multi–Genre Broadcast Media, in {Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}, Scottsdale, AZ, 2015.
L. Lu, Ghoshal, A., and Renals, S., Acoustic Data-driven Pronunciation Lexicon for Large Vocabulary Speech Recognition, in Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2013.
, A., G., and S., R., Acoustic Data-driven Pronunciation Lexicon for Large Vocabulary Speech Recognition, in Proc. ASRU, 2013.
P. Karanasou, Wang, Y., Gales, M., and Woodland, P., Adaptation of Deep Neural Network Acoustic Models Using Factorised I-vectors, in Proceedings of Interspeech’14, 2014.
I. Casanueva, Christensen, H., Hain, T., and Green, P., "Adaptive speech recognition and dialogue management for users with speech disorders, in Proceedings of Interspeech'14, 2014.
M. Wester, Valentini-Botinhao, C., and Henter, G. Eje, Are we using enough listeners? No! An empirically-supported critique of Interspeech 2014 TTS evaluations, in Proc. of Interspeech, Dresden, 2015.
M. Wester, Aylett, M., Tomalin, M., and Dall, R., Artificial Personality and Disfluency, in Proc. of Interspeech, Dresden, 2015.
O. Saz and Hain, T., Asynchronous Factorisation of Speaker and Background with Feature Transforms in Speech Recognition, in {Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech)}, Lyon, France, 2013, pp. 1238–1242.
O. Saz and Hain, T., Asynchronous factorisation of speaker and background with feature transforms in speech recognition, in Proceedings of Interspeech 2013, Lyon, France, 2013.
T. Merritt, Latorre, J., and King, S., Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brisbane, 2015.
M. Doulaty, Saz, O., Ng, R. W. M., and Hain, T., Automatic Genre and Show Identification of Broadcast Media, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, California, USA, 2016.
H. Christensen, Casanueva, I., Cunningham, S., Green, P., and Hain, T., Automatic Selection of Speakers for Improved Acoustic Modelling : Recognition of Disordered Speech with Sparse Data, in Spoken Language Technology Workshop, SLT'14, Lake Tahoe, 2014.
P. Lanchantin, Bell, P. - J., Gales, M. - J. - F., Hain, T., Liu, X., Long, Y., Quinnell, J., Renals, S., Saz, O., Seigel, M. - S., Swietojanski, P., and Woodland, P. - C., Automatic Transcription of Multi-genre Media Archives, in Proceedings of SLAM Workshop, Marseille, France, 2013.
P. Lanchantin, Bell, P. J., Gales, M. J. F., Hain, T., Liu, X., Long, Y., Quinnell, J., Renals, S., Saz, O., Seigel, M. S., Swietojanski, P., and Woodland, P. C., Automatic Transcription of Multi-Genre Media Archives, in {Proceedings of the First Workshop on Speech, Language and Audio in Multimedia}, Marseille, France, 2013, pp. 26–31.
O. Saz, Doulaty, M., and Hain, T., Background-Tracking Acoustic Features for Genre Identification of Broadcast Shows, in Proceedings of the 2014 Spoken Language Technology (SLT) Workshop, South Lake Tahoe NV, USA, 2014, pp. 118–123.
O. Saz, Doulaty, M., and Hain, T., Background-tracking acoustic features for genre identification of broadcast shows, in Proceedings of the 2014 IEEE Spoken Language Technology Workshop (SLT), South Lake Tahoe, NV, 2014, pp. 118–123.
P. C. Woodland, Liu, X., Qian, Y., Zhang, C., Gales, M. J. F., Karanasou, P., Lanchantin, P., and Wang, L., Cambridge University Transcription Systems for the Multi-Genre Broadcast Challenge, in Proc. of ASRU, Scottsdale, USA, 2015.
H. Lu, King, S., and Watts, O., Combining a Vector Space Representation of Linguistic Context with a Deep Neural Network for Text-To-Speech Synthesis, in 8th ISCA Workshop on Speech Synthesis, Barcelona, Spain, 2013, pp. 281–285.
and Hain, T., Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, California, USA, 2016.
S. Deena, Hasan, M., Doulaty, M., Saz, O., and Hain, T., Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, California, USA, 2016.
H. Christensen, Aniol, M. B., Bell, P., Green, P., Hain, T., King, S., and Swietojanski, P., Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech, in Interspeech'13, 2013.
R. W. M. Ng, Chettri, B., and Hain, T., Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting, in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, CA, 2016.
H. Christensen, Cunningham, S., Fox, C., Green, P., and Hain, T., A comparative study of adaptive, automatic recognition of disordered speech, in Proc Interspeech 2012, Portland, Oregon, US, 2012.

Pages