Publications at MSP lab
Journal Articles
-
S. Mariooryad and C. Busso, "Generating human-like behaviors using joint,
speech-driven models for conversational agents," IEEE Transactions on Audio, Speech and Language
Processing, vol. accepted, 2012.[soon pdf][soon cited][soon bib]
-
C.-C. Lee, E. Mower, C. Busso, S. Lee, and S.S. Narayanan, "Emotion
recognition using a hierarchical binary decision tree approach," Speech Communication, vol. 53, no. 9-10, pp. 1162-1171,
November-December 2011.
[pdf]
[cited]
[bib]
- C. Busso, S. Lee, and S. Narayanan, "Analysis of emotionally salient
aspects of fundamental frequency for emotion detection," IEEE Transactions
on Audio, Speech and Language Processing, vol. 17, no. 4, pp. 582-596, May
2009.
[pdf]
[cited]
[bib]
- C. Busso, M. Bulut, C. Lee, A. Kazemzadeh, E. Mower, S. Kim, J. Chang, S.
Lee, and S. Narayanan, "IEMOCAP: Interactive emotional dyadic motion
capture database," Journal of Language Resources and Evaluation, vol.
42, no. 4, pp. 335-359, December 2008.
[pdf]
[cited]
[bib]
- C. Busso and S. Narayanan, "Interrelation between speech and facial
gestures in emotional utterances: a single subject study," IEEE Transactions
on Audio, Speech and Language Processing, vol. 15, no. 8, pp. 2331-2347, November
2007.
[pdf]
[cited]
[bib]
- C. Busso, Z. Deng, M. Grimm, U. Neumann, and S. Narayanan, "Rigid head
motion in expressive speech animation: Analysis and synthesis," IEEE
Transactions on Audio, Speech and Language Processing, vol. 15, no. 3, pp.
1075-1086, March 2007.
[pdf]
[cited]
[bib]
- N. Yoma, C. Molina, J. Silva, and C. Busso, "Modeling, estimating,
and compensating low-bit rate coding distortion in speech recognition,"
IEEE Transactions on Audio, Speech and Language Processing, vol. 14, no. 1,
pp. 246-255, January 2006.
[pdf]
[cited]
[bib]
- C. Busso, Z. Deng, U. Neumann, and S. Narayanan, "Natural head motion
synthesis driven by acoustic prosodic features," Computer Animation and
Virtual Worlds, vol. 16, no. 3-4, pp. 283-290, July 2005.
[pdf]
[cited]
[bib]
[slides]
- N. Yoma, C. Busso, and I. Soto, "Packet-loss modelling in IP networks
with state-duration constraints," Communications, IEE Proceedings, vol.
152, no. 1, pp. 1-5, Feb 2005.
[pdf]
[cited]
[bib]
- N. Yoma, J. Hood, and C. Busso, "A real-time protocol for the internet
based on the least mean square algorithm," Transactions on Multimedia,
IEEE, vol. 6, no. 1, pp. 174-184, Feb 2004.
[pdf]
[cited]
[bib]
- N. Yoma, J. Silva, C. Busso, and I. Brito, "Compensating additive noise
and CS-CELP distortion in speech recognition using stochastic weighted Viterbi
algorithm," Electronics Letters, IEE, vol. 39, no. 4, pp. 409-411, Feb
2003.
[pdf]
[cited]
[bib]
Book chapters
- C. Busso and J. Jain, "Advances in multimodal tracking of driver
distraction," in DSP for In-Vehicle Systems & Safety, J. Hansen, P. Boyraz, K. Takeda,
and H. Abut, Eds., p. In Press. Springer, New York, NY, USA, 2011.
[soon link][soon cited]
[bib]
- C. Busso, M. Bulut, and S.S. Narayanan, "Toward effective automatic
recognition systems of emotion in speech," in Social emotions in nature and artifact:
emotions in human and human-computer interaction, S. Marsella J. Gratch, Ed. Oxford University
Press, New York, NY, USA, 2011.
[soon link][soon cited][soon bib]
- C. Busso, M. Bulut, S. Lee, and S.S. Narayanan, "Fundamental
frequency analysis for speech emotion processing," in The Role of Prosody in
Affective Speech, Sylvie Hancil, Ed., pp. 309-337. Peter Lang Publishing Group,
Berlin, Germany, 2009.
[link-to-pdf]
[cited]
[bib]
- C. Busso, Z. Deng, U. Neumann, and S. Narayanan, "Learning expressive
human-like head motion sequences from speech," in Data-Driven 3D Facial
Animations, Z. Deng and U. Neumann, Eds. Surrey,United Kingdom: Springer-Verlag
London Ltd, 2007, pp. 113-131.
[pdf]
[cited]
[bib]
Conference Proceedings
- S. Mariooryad and C. Busso, "Factorizing speaker, lexical and emotional variabilities
observed in facial expressions," in IEEE International Conference on Image Processing (ICIP 2012), Orlando,
FL, USA, September-October 2012.
[soon pdf][soon cited]
[bib]
- D. Tick, T. Rahman, C. Busso, and N. Gans, "Indoor robotic terrain classification via angular
velocity based hierarchical classifier selection," in IEEE International Conference on Robotics and Automation
(ICRA 2012), St. Paul, MN, USA, May 2012, p. to appear.
[soon pdf][soon cited]
[bib]
- T. Rahman and C. Busso, "A personalized emotion recognition system
using an unsupervised feature adaptation scheme," in International Conference on Acoustics, Speech, and Signal
Processing (ICASSP 2012), Kyoto, Japan, March 2012, pp. 5117-5120.
[pdf]
[soon cited]
[bib]
[poster]
- J.J. Jain and C. Busso, "Assessment of driver's distraction using
perceptual evaluations, self assessments and multimodal feature analysis," in 5th Biennial
Workshop on DSP for In-Vehicle Systems, Kiel, Germany, September 2011.
[pdf]
[soon cited]
[bib]
[slides]
- T. Rahman, S. Mariooryad, S. Keshavamurthy, G. Liu, J.H.L. Hansen, and C. Busso,
"Detecting sleepiness by fusing classifiers trained with
novel acoustic features," in 12th Annual Conference of the International
Speech Communication Association (Interspeech-2011), Florence, Italy, August 2011, pp. 3285-3288.
[pdf]
[cited]
[bib]
[slides]
- X. Fan, C. Busso, and J.H.L. Hansen, "Audio-visual isolated digit
recognition for whispered speech," in European Signal Processing Conference (EUSIPCO-2011),
Barcelona, Spain, August-September 2011.
[pdf]
[soon cited]
[bib]
[slides]
- J. Jain and C. Busso, "Analysis of driver behaviors during common tasks
using frontal video camera and CAN-Bus information," IEEE International Conference
on Multimedia and Expo (ICME 2011), Barcelona, Spain, July 2011.
[pdf]
[cited]
[bib]
[poster]
[slides]
[Youtube]
Hewlett Packard Best Paper Award at ICME2011!
- C. Busso, A. Metallinou, and S. Narayanan, "Iterative feature
normalization for emotional speech detection," in International Conference on
Acoustics, Speech, and Signal Processing (ICASSP 2011) Prague, Czech Republic, May 2011, pp. 5692-5695.
[pdf]
[cited]
[bib]
[poster]
- A. Metallinou, C.-C. Lee, C. Busso, S. Carnicke, and S. Narayanan, "The USC
CreativeIT database: A multimodal database of theatrical improvisation," in Workshop on
Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality (MMC 2010), Valletta,
Malta, May 2010.
[pdf]
[cited]
[bib]
- A. Metallinou, C. Busso, S. Lee, and S. Narayanan, "Visual emotion
recognition using compact facial representations and viseme information," in International
Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, TX, USA,
March 2010, pp. 2474-2477.
[pdf]
[cited]
[bib]
[poster]
- E. Mower, A. Metallinou, C.-C. Lee, A. Kazemzadeh, C. Busso, S. Lee, and
S. Narayanan, "Interpreting ambiguous emotional expressions," in
International Conference on Affective Computing and Intelligent Interaction
(ACII 2009), Amsterdam, The Netherlands, September 2009.
[pdf]
[cited]
[bib]
- C.-C. Lee, E. Mower, C. Busso, S. Lee, and S. Narayanan, "Emotion recognition
using a hierarchical binary decision tree approach," in Interspeech 2009,
Brighton, UK, September 2009, pp. 320-323.
[pdf]
[cited]
[bib]
[slides]
- C.-C. Lee, C. Busso, S. Lee, and S. Narayanan, "Modeling mutual influence
of interlocutor emotion states in dyadic spoken interactions," in Interspeech
2009, Brighton, UK, September 2009, pp. 1983-1986.
[pdf]
[cited]
[bib]
[poster]
- C. Busso and S. Narayanan, "The expression and perception of emotions:
Comparing assessments of self versus others," in Interspeech 2008 - Eurospeech,
Brisbane, Australia, September 2008, pp. 257-260.
[pdf]
[cited]
[bib]
[poster]
- C. Busso and S. Narayanan, "Scripted dialogs versus improvisation:
Lessons learned about emotional elicitation techniques from the IEMOCAP database,"
in Interspeech 2008 - Eurospeech, Brisbane, Australia, September 2008, pp.
1670-1673.
[pdf]
[cited]
[bib]
[poster]
- C. Busso and S. Narayanan, "Recording audio-visual emotional databases
from actors: a closer look," in Second International Workshop on Emotion:
Corpora for Research on Emotion and Affect, International conference on Language
Resources and Evaluation (LREC 2008), Marrakech, Morocco, May 2008, pp. 17-22.
[pdf]
[cited]
[bib]
[slides]
- C. Busso and S. Narayanan, "Joint analysis of the emotional fingerprint
in the face and speech: A single subject study," in International Workshop
on Multimedia Signal Processing (MMSP 2007), Chania, Crete, Greece, October
2007, pp. 43-47.
[pdf]
[cited]
[bib]
[poster]
- V. Rozgic, C. Busso, P. Georgiou, and S. Narayanan, "Multimodal meeting
monitoring: Improvements on speaker tracking and segmentation through a modified
mixture particle filter," in International Workshop on Multimedia Signal
Processing (MMSP 2007), Chania, Crete, Greece, October 2007, pp. 60-65.
[pdf]
[cited]
[bib]
- C. Busso, S. Lee, and S. Narayanan, "Using neutral speech models for
emotional speech analysis," in Interspeech 2007 - Eurospeech, Antwerp,
Belgium, August 2007, pp. 2225-2228.
[pdf]
[cited]
[bib]
[poster]
- C. Busso, P. Georgiou, and S. Narayanan, "Real-time monitoring of participants
interaction in a meeting using audio-visual sensors," in International
Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), vol.
2, Honolulu, HI, USA, April 2007, pp. 685-688.
[pdf]
[cited]
[bib]
[slides]
- C. Busso and S. Narayanan, "Interplay between linguistic and affective
goals in facial expression during emotional utterances," in 7th International
Seminar on Speech Production (ISSP 2006), Ubatuba-SP, Brazil, December 2006,
pp. 549-556.
[pdf]
[cited]
[bib]
[poster]
- M. Bulut, C. Busso, S. Yildirim, A. Kazemzadeh, C. Lee, S. Lee, and S. Narayanan,
"Investigating the role of phoneme-level modifications in emotional speech
resynthesis," in 9th European Conference on Speech Communication and
Technology (Interspeech-2005 - Eurospeech), Lisbon, Portugal, September
2005, pp. 801-804.
[pdf]
[cited]
[bib]
- C. Busso, S. Hernanz, C. Chu, S. Kwon, S. Lee, P. Georgiou, I. Cohen, and
S. Narayanan, "Smart Room: Participant and speaker localization and identification,"
in International Conference on Acoustics, Speech, and Signal Processing (ICASSP
2005), vol. 2, Philadelphia, PA, USA, March 2005, pp. 1117-1120.
[pdf]
[cited]
[bib]
[poster]
- N. Yoma, C. Busso, J. Inzunza, and F. Huenupan, "Packet-loss modeling
with state duration constraints and VoIP based on perceptual quality maximization,"
in 10th International Conference on Speech and Computer (SPECOM 2005), Patras,
Greece, October 2005, pp. 757-760.
[pdf]
[soon cited]
[bib]
- C. Busso, Z. Deng, S. Yildirim, M. Bulut, C. Lee, A. Kazemzadeh, S. Lee,
U. Neumann, and S. Narayanan, "Analysis of emotion recognition using
facial expressions, speech and multimodal information," in Sixth International
Conference on Multimodal Interfaces ICMI 2004. State College, PA: ACM Press,
October 2004, pp. 205-211.
[pdf]
[cited]
[bib]
[poster]
[slides]
- Z. Deng, C. Busso, S. Narayanan, and U. Neumann, "Audio-based head
motion synthesis for avatar-based telepresence systems," in ACM SIGMM
2004 Workshop on Effective Telepresence (ETP 2004). New York, NY: ACM Press,
October 2004, pp. 24-30.
[pdf]
[cited]
[bib]
- C. Lee, S. Yildirim, M. Bulut, A. Kazemzadeh, C. Busso, Z. Deng, S. Lee,
and S. Narayanan, "Emotion recognition based on phoneme classes,"
in 8th International Conference on Spoken Language Processing (ICSLP 04),
Jeju Island, Korea, October 2004, pp. 889-892.
[pdf]
[cited]
[bib]
- S. Yildirim, M. Bulut, C. Lee, A. Kazemzadeh, C. Busso, Z. Deng, S. Lee,
and S. Narayanan, "An acoustic study of emotions expressed in speech,"
in 8th International Conference on Spoken Language Processing (ICSLP 04),
Jeju Island, Korea, October 2004, pp. 2193-2196.
[pdf]
[cited]
[bib]
- N. B. Yoma, J. Hood, and C. Busso, "An UDP-based real time protocol
for the Internet," in International Telecommunications Symposium(ITS
2002), Natal, Brazil, September 2002.
[pdf]
[soon cited]
[bib]
Abstracts
- S. Yildirim, M. Bulut, C. Busso, C. Lee, A. Kazamzadeh, S. Lee, and S. Narayanan,
"Study of acoustic correlates associate with emotional speech,"
J. Acoust. Soc. Am., vol. 116, p. 2481, 2004.
[pdf]
[bib]
- C. Lee, S. Yildirim, M. Bulut, C. Busso, A. Kazamzadeh, S. Lee, and S. Narayanan,
"Effects of emotion on different phoneme classes," J. Acoust. Soc.
Am., vol. 116, p. 2481, 2004.
[pdf]
[bib]
- M. Bulut, S. Yildirim, S. Lee, C. Lee, C. Busso, A. Kazamzadeh, and S. Narayanan,
"Emotion to emotion speech conversion in phoneme level," J. Acoust.
Soc. Am., vol. 116, p. 2481, 2004.
[pdf]
[bib]
Copyright Notice: This material is presented to ensure timely dissemination
of scholarly and technical work. Copyright and all rights therein are retained
by authors or by other copyright holders. All persons copying this information
are expected to adhere to the terms and constraints invoked by each author's
copyright. In most cases, these works may not be reposted without the explicit
permission of the copyright holder.
(c) Copyrights. All rights reserved.