CSE 794L WI 05 Reading List
The following list includes both required and optional readings for
CSE794L. We will use the Holmes & Holmes book as our primary
book, but if you want more background on some of these topics then you
should consult Huang et al., Gold and Morgan, or Jurfasky and Martin
depending on the topic.
We will be choosing papers to discuss on Thursdays from the
optional list (which may grow as people (including me) have
suggestions). Please let me know if there are other papers or topics
you'd like to see.
Required readings
- [HH] Holmes & Holmes, Speech Synthesis and Recognition,
Taylor and Francis, London, 2001.
- [HLT] Cole et al., Survey of the State of the Art in Human
Language Technology, 1996.
- [PR] Pereira & Riley, Speech recognition by composition of weighted finite automata. In Emmanuel Roche and Yves Schabes, editors, Finite-State Language Processing, pages 431-453. MIT Press, Cambridge, Massachusetts, 1997.
- [HTK] Young et al., The HTK Book, version 3.2.1, December
2002. Full book, Chapters 1-3, 2up
- [SONIC] Pellom & Hacioglu, ``Sonic: The University of Colorado Continuous Speech Recognizer,'' Center for Spoken Language Research
Technical Report TR-CSLR-2001-01, U. Colorado, 2003 (revised).
- [Rab] Rabiner, L. ``A tutorial on Hidden Markov Models and
Selected Applications in Speech Recognition,'' Proceedings of the
IEEE, 1989.
- [Yng] Young, et al. ``Token passing: a simple conceptual model for
connected speech recognition systems.'' Cambridge University
Engineering Department Technical Report TR-38.
(http://mi.eng.cam.ac.uk/reports/abstracts/speech/young_tr38.html).
Reference Texts
- [HAH] Huang, Acero and Hon, Spoekn Language Processing: A guide to theory, algorithm, and system development, Prentice Hall, 2001.
- [JM] Jurafsky & Martin, Speech and Language Processing,
Prentice Hall, 2000.
- [RJ] Rabiner & Juang, Fundamentals of Speech Recognition,
Prentice Hall, Englewood Cliffs, NJ, 1993.
- [GM] Gold & Morgan, Speech and Audio Signal Processing,
Wiley and Sons, New York, 2000.
- [Jel] F. Jelinek, Statistical Methods for Speech Recognition, MIT Press, Cambridge, MA, 1999.
Optional Readings
- [AT&T] Mohri & Riley, ``Weighted Finite-State Transducers in
Speech Recognition (Tutorial),'' International Conference on Spoken
Language Processing 2002 (ICSLP '02). Denver, Colorado, September
2002. Part 1 Part 2
- [GMTK] Bilmes, ``GMTK: The Graphical Models Toolkit Manual,'' October 2002.
- [YngLVR] Young, S. ``Large vocabulary continuous speech
recognition: A review,'' In Proceedings of the IEEE Workshop on
Automatic Speech Recognition and Understanding, pages 3--28, Snowbird,
Utah, December 1995. IEEE. Reprinted in Signal Processing Magazine, 13(5), September 1996.
- [PR] Pereira & Riley, Speech Recognition by Composition of Weighted Finite Automata, in E. Roche and Y. Schabes, eds., Finite-state language processing, MIT Press, Cambridge, MA, 1997.
- [Fos] Fosler-Lussier, E. ``A tutorial on pronunciation modeling
for large vocabulary speech recognition,'' in Text and Speech
Triggered Information Access, Springer-Verlag, 2003.
- [Brn] Brown, P., et al, ``Class-based n-gram models of natural
language,'' Computational Linguistics, 18 (4), 467479, December,
1992.
- [Jur] Jurafsky, D. et al, ``Using a stochastic context-free grammar as a language model for speech recognition'', proceedings of ICASSP 1995.
- [Myr] Myrvoll, Tor Andre, ``Adaptation Techniques in Automatic Speech
Recognition,'' Telektronikk 99(2), Issue on Spoken Language Technology in
Telecommunications, 2003.
- [Ros] Rosenfeld, R. ``Two decades of statistical language modeling:
Where do we go from here?'' Proceedings of the IEEE, 88(8),
2000.
- [VOICEXML] VoiceXML Tutorial
- [CG] Chen, S. and Gopalakrishnan, P. ``Clustering via the Bayesian Information Criterion with applications in speech recognition,'' Proceedings of ICASSP, pp 645-648, 1998.
- [HM] Hermansky, H. and Morgan, N. RASTA processing of speech, IEEE Transactions on Speech and Audio Processing, 2(4):578-589, Oct. 1994.
- PLACEHOLDER: Tandem Systems
- PLACEHOLDER: Chelba paper