Xuedong Huang
Xuedong Huang | |
---|---|
Born |
Hunan, China | October 20, 1962
Citizenship | American (since 1995) |
Fields |
Speech Recognition VOIP Natural Language Processing Software Development |
Institutions |
Microsoft Carnegie Mellon University |
Alma mater |
Hunan University Tsinghua University Edinburgh University |
Doctoral advisor | Mervyn Jack |
Doctoral students |
Mei-Yuh Hwang Roni Rosenfeld |
Notable awards |
Wired Magazine Next List 2016 2011 Asian American Engineer of the Year IEEE 1993 Paper Award Allen Newell Research Excellence Medal |
Xuedong Huang (Simplified Chinese: 黄学东, b. October 20, 1962) is a Chinese-American computer scientist and the key person behind Microsoft's spoken language processing technologies. He is a Distinguished Engineer and company's Chief Speech Scientist. Wired (magazine) named him one of 25 Geniuses in Next List 2016. [1]
Background
Huang grew up in Hunan, China and became a US citizen in 1995. He received his B.S. degree in computer science from Hunan University in 1982, his MS in computer science from Tsinghua University in 1984, and his PhD in Electrical Engineering from the University of Edinburgh in 1989.
Career
Academic research
He joined Carnegie Mellon University in 1989 and worked with Raj Reddy and Kai-Fu Lee on speech recognition. At CMU, Huang directed Sphinx-II speech system research and had the best overall performance in every category of DARPA's 1992 benchmarking. He received the Allen Newell research excellence medal for his leadership in speech recognition in 1992, and IEEE Speech Processing Best Paper Award in 1993. He became an IEEE Fellow in 2000.
Huang has co-authored over 100 papers and two books: Hidden Markov Models for Speech Recognition, (1987) and Spoken Language Processing, Prentice Hall](2000). In 2014 he coauthored a historical speech recognition review with Raj Reddy and James K. Baker for Communications of the ACM that reflected several generations of speech research. [2]
Huang is the Honorary Dean and Professor of School of Software Engineering at his alma mater Hunan University.
Microsoft
Huang has spent his career helping to advance spoken language and web search technologies in a variety of capacities. He is best known for founding and leading Microsoft's speech recognition initiatives. He is also known for his pioneering work on Microsoft's multimodal interactive MiPad prototype [3] as Bill Gates demonstrated at the Consumer Electronics Show in his keynote speech in 2001.
Huang was instrumental in introducing Microsoft's Speech Application Programming Interface (SAPI) and speech recognition/TTS technologies to the public. From 2000 to 2004, Huang served as the general manager of Microsoft's Speech Platforms Group and shipped Microsoft Speech Server and other voice technologies used in Microsoft Windows, Microsoft Office, Windows Mobile and Microsoft Exchange Server. Microsoft Response Point received 2009's Technology of the Year Awards as the best VOIP phone system from the InfoWorld Magazine.[4] From 2009 to 2014, he served as the Architect for Bing and worked on massive scale machine learning on intent understanding, mobile, and ranking that enabled Bing's significantly improved search quality.
He is currently leading the company's advanced technology group. His team helped to ship Bing Torque, CNTK, and Microsoft Cognitive Services. The Speech and Dialog research group continues to advance core speech technologies used in Microsoft's products such as Microsoft Cortana and Skype Translator.
TV and books
- Robert MacNeil, William Cran, Robert McCrum (2005). Do You Speak American? page 191-197, Harcourt Trade
- PBS TV: Do You Speak American? 2005
- Xuedong Huang, Alex Acero, Hsiao-Wuen Hon (2001). Spoken Language Processing: a guide to theory, algorithm, and system development, page 1-980. Prentice Hall
- Xuedong D Huang, Yasuo Ariki, Mervyn A Jack (1990). Hidden Markov Models for Speech Recognition, Edinburgh University Press
References
- ↑ http://www.wired.com/2016/04/wired-nextlist-2016/
- ↑ http://cacm.acm.org/magazines/2014/1/170863-a-historical-perspective-of-speech-recognition/fulltext
- ↑ http://www.microsoft.com/en-us/news/features/2000/05-22mipad.aspx
- ↑ "Archived copy". Archived from the original on February 20, 2009. Retrieved February 9, 2009.
External links
- Thoughts on Microsoft's Response Point, "ZDNet", Sept. 24, 2008
- Microsoft's Xuedong Huang Delivers Keynote at ITEXPO West, Emphasizes Telephony Ease-of-Use and Describes Future Natural User Interface, Sept. 17, 2008
- Microsoft Bolsters Its VoIP Strategy With Response Point Service Pack, "Information Week", March 24, 2008
- Innovation Spotlight: Q&A With Xuedong Huang On Microsoft Response Point 2007
- Microsoft Launches Breakthrough PBX for SMBs, "EWEEK", March, 2007
- Huang appointed as Hunan University's Honorary Dean of Software School "Hunan University News", Nov 5, 2005 (Chinese)
- Microsoft Gets Futuristic With Speech Server 2004 March, 2004
- Microsoft Peppers the Market with Integrated Voice Solutions March, 2004
- Why it's getting easier to talk to your PC "ZDNET", Nov 14, 2002
- The Year Ahead for Microsoft, Microsoft PressPass, Dec 18, 2001
- Speech Recognition in Microsoft Office XP Increases Productivity, Efficiency for Chinese and Japanese Users, Microsoft PressPass, Aug. 16, 2001
- MiPad: Speech Powered Prototype Listens and Learns Microsoft PressPass, May 22, 2000
- Gates demos Microsoft MiPad "ZDNET", Oct 13, 2000
- Microsoft's speech guru Xuedong Huang says that a "conversational interface" will make computing dramatically easier early next century. Newsweek, March 9, 1998
- BEYOND TALKING BARNEY BusinessWeek, Feb 23, 1998
- Talking Face To Interface. Seattle Times, June 1, 1997