Speech & Language Grp
Pictures of Past Gatherings
and Interests
Blk N4, 02c-96, Nanyang Avenue.
School of Computer Science and
Engineering. Nanyang Technolog-
ical University, Singapore 639798
Q: how to get to NTU?
Email: aseschng at ntu dot edu
dot sg
Tel : +65-6790-6200
Fax: +65-6792-6559
CHNG Eng Siong
I am currently an Associate Professor in the School of
Computer Science and Engineering SCSE, Nanyang
Technological University (NTU), Singapore.
Prior to joining NTU in 2003, I worked in: Knowles
Electronics (2001-2002), Lernout and Hauspie
(1999-2000,Belgium), Institute of Infocomm Re-
search (1996-1999,I2R, Singapore), and RIKEN
I received both BEng (Hons) and PhD from Edin-
burgh University, U.K. in 1991 and 1996 respectively.
My PhD was supervised by Bernard Mulgrew, Peter
Grant and Chen Sheng.
My area of focus is in speech research and signal
processing. To date, I have been Principal Investiga-
tor of several research grants awarded by NTU-Rolls Royce, Mindef, MOE and
AStar with a total funding amount of over
million under the Speech and
Language Research Group at SCSE. I have supervised 10 PhD students and 5
Masters Engineering students. My publications include 2 edited books and
over 100 journal/conference papers. What Google and Microsoft say about
I have served as the publication chair for 5 international conferences (Human
Agent Interaction 2016, INTERSPEECH 2014, APSIPA-2010, APSIPA-2011,
ISCSLP-2006), and have been an associate editor for IEICE (special issue 2012),
a reviewer for Speech Communications, Eupsico, IEEE Trans Man,System and
Cybernectics Part B, Journal of Signal Processing System, ACM Multimedia
Systems, IEEE Trans Neural Network, IEEE Trans CAS-II, and Signal Process-
I was the recipient of the Tan Chin Tuan fellowship (2007) to visit Tsinghua
University, the JSPS travel grant award (2008) to visit Tokyo Institute of
Technology, and the Merlion Singapore-France research collaboration award
in 2009.
1. Google scholar
2. Researcherid
3. Microsoft Academic
Visited by: ClustrMap details
last updated: May 2017
Speech & Language
Research Group
Main Page
Speech & Language Grp
Past Graduate Stu-
Speech & Language Research Group
The speech and language research group in SCSE was founded in 2007 by
Chng Eng Siong and Prof Li Haizhou (now in NUS, Singapore). Our group is
now situated within MML Lab in SCSE.
The focus of our research group is in the area of speech and language research.
We are currently working in the following areas:
Robust Large vocabulary continuous speech recognition and Keyword
2. Speech and feature enhancement
3. Speaker identification
4. Voice conversion (morphing)
Towards Speech Understanding - some aspects of NLP such as topic
detection, name entity recognition
1. Prof Xie Lei, Northwestern Polytechnic, Xian, China
2. Dr Ma Bin, AStar I2R - robust ASR
3. Dr Raphael Banchs, AStar, I2R - dialogue ChatBot
Current Staff/Students
Current staff member:
1. Dr Xu Hauhua: robust LVCSR, keyword spotting
2. Dr Rao Wei: speaker verification
3. Ho Thi Nga: speech indexing and SUD, , part-time MEng.
4. Xu Chenglin: robust LVCSR and far field, part-time PhD.
5. Lim Zhi Hao: speaker verification, part-time PhD.
6. Tian XiaoHai: TTS and voice morphing, part-time PhD.
7. Kyaw Zin Tun: speech indexing and chatBot
8. Ly Vu Thi: speech indexing and chatBot
9. Chong Tze Yuang: Language Model Adaptation, part-time PhD.
The current full time PhD students of our team are:
1. Phan Van Tung: keyword spotting
2. Khassan Yerbolat: language model adaptation
3. Hou Nana: robust LVCSR for air traffic control speech
The current part time graduate students of our team are:
Paul Chan Yaozhu: Synthesis of the human singing voice (partTime PhD)
Leow Sujun: Word and Anti-word Discriminative Training for Improving
Large Vocabulary Continuous Speech Recognition (LVCSR) (partTime
Li ZhongWei: Name and Digit Entity Recognition for simultaneous
translation (partTime PhD)
Speech & Language
Research Group
Main Page
Speech & Language Grp
Past PhD Students
Past MEng Students
Past Staff
Past Interns
Past PhD Students
Nguyen Duc Hoang Ha, PhD (2017), PhD-Slides, Feature based robust
techniques for speech recognition. now in Vietnam.
Nguyen Trung Hieu, PhD (2015), Speaker Diarization in Meeting room
domain. Now at AStar
Do Van Hai PhD (2015), Acoustic modelling of speech under limited
training data condition. Now at AStar.
Wu Zhizheng PhD (2015), Spectral Mapping for Voice Conversion. Now
in Apple.
Jonathan Dennis PhD (2014) PhD Slides, now in Green Runnning-Data
Wang Lei, PhD (2013), Audio Pattern Discovery and retrieval. Now at
Tong Rong, PhD (2012), Towards a high performance phonotactic fea-
tures for spoken language recognition. Now at AStar.
Omid Dehzanghi, PhD (2012), Discriminative Learning for speech recog-
nition. Now in U Michigan.
Xiao Xiong, PhD (2010), Robust speech features and acoustic models
for speech recognition. PhD QE (2006). Now in Microsoft, Redmond -
since Apr 2017
Wang Jinjun, PhD (2008), Content based sports video analysis and
composition. Now in Xian Jiaotong
Past MEng Student
1. Nguyen Quy Hy, MEng (2017), Voice conversion using DNN
2. Steven Du, MEng (2015), Robust Front End for Speaker Verification
Terrence Ng Wen Zheng, MEng (2014), Sound Event recognition in
home environment, now in AStar.
Chen Wenda (2014), Computer Assisted Language Learning, now PhD
student in UIUC.
5. Ben Pham Chau Khoa, MEng (2012), now in Microsoft
Eugene Koh, MEng (2009), Speaker Diarization of News Broadcasts and
Meeting Recordings.
Speech & Language
Research Group
Main Page
Speech & Language Grp
Past PhD Students
Past MEng Students
Past Staff
Past Interns
Past Staff
XiaoXiong, PhD Student 2004-2008, Staff 2008-2017, Now in Microsoft,
Redmond - since Apr 2017
2. Benjamin Bigot,2014-2016 - LVCSR for RR
3. Huang Guangpu, Staff 2012-2015 - Articulatory Phonetics Features
4. Lyu Daucheng, 2009-2013 - Code switch LVCSR
5. Zhao Shengkui, 2009-2010 - Microphone array and beamforming.
Past Interns (incomplete)
Gao Shengheng, Undergraduate from University Paul Sabatier (Toulouse
III), Mar-Jun 2016, NLP-topic detection.
Gangeshwar Krishnamurthy, Undergraduate from Bangalore Institute of
Technology, Bengaluru, India,Jan - April 2017, sentence unit detection
Interns from India and SPMS (2017 May-July) : Picture of the student
Note: if you have been a past intern and wish to be included in above list, do
send me a linkedin acccount id, when you were here, and from which college
were you at when you were here. Will be great to have you in the list.
Total Funding: > S$5 million
1. Nov 2016 - Nov 2017, Project Acumon, Mindef Project ID: 9016102410, Amount: S$560K, PI
Jan 2016 1 RSS from ATMRI-NTU for PhD student Hou Nana, Robust ASR for very noisy speech in air-traffic
control domain, ATMRI-NTU, PI
Jun 2014 - Jun 2017, Project Maison: Robust Speaker Verification And Keyword Spotting, DSO M4061477,
Amount: S$2.2 million, PI.
Mar 2014 - Feb 2016, RT1.2: Ontology Based Text Mining from Speech Data NRF & Rolls Royce Project, Amount:
S$203,371, PI: Kim Jung Jae Co- PI, Chng Eng Siong
Mar 2014 - Feb 2017, RT1.1: Robust Large Vocabulary Continuous Speech Recognition (LVCSR) for far field
recordings, NRF & Rolls Royce, Project ID: M4061299, PI: CHNG ENG SIONG Amount: S$527,693 + 1 PhD RSS
(Jan 2015)
Apr 2013 - Apr 2015, Development of Linguistic Resources and LVCSR for Southeast Asia Languages on KALDI
platform (DeKALDI) AStar, S$124,800, PI
7. Sep 2011 - Aug 2014, Audio Mining Broadcast News, DSTA: M4060890.683, S$500K, PI
8. Jan 2010 - Dec 2011, Merlion 2009, MultiLing, French Embassy (Singapore), S$30K, PI
July 2010 - Jun 2012, Speech Recognition for Code-switch Conversational Speech, Temasek Lab@NTU, ProjectID:
M48680100, Amount: S$200K, PI
Mar 2010 - Feb 2012, A pilot study of a Computer Assisted Pronunciation Evaluation (CAPE) system for English
learners in Singapore, MOE Tier 1 ProjectID: M52020089, Amount: S$190K, PI-Chng Eng Siong, Co-PI: Tan
Ying Ying
11. Nov 2009-Oct 2010, Malay Text-to-Speech Synthesis Astar ProjectID: M48020073, Amount: S$90K
Nov 2009 - Oct 2011, A Microphone Array with a 3-dimensional configuration for the I2R social robot, AStar
ProjectID: M48020074, Amount: S$200,400
Dec 2008, Attachment to Tokyo Institute of Technology, Japan to visit Prof S. Furui Lab for exchange in speech
research, NTU/NUS-JSPS New Scientific Exchange Programme (NSEP). Amount: S$3K, PI
Sep 2008 - Mar 2010, Statistical Language Modeling for spoken document retrieval Astar ProjectID: M48020061,
Amount: S$240K, PI
Apr 2008 - Oct 2008, Speech Channel Modelling & Classification, Astar ProjectID: M48020050, Amount: S$108K,
Mar 2008 - Feb 2013 Advanced Research in Automatic Speech Recognition, Temasek Lab@NTU, ProjectID:
M48680101, Amount: S$1 million, PI: Li Haizhou, Co-PI: Chng Eng Siong
Jan 2008 - Jun 2008 Robust Speech Signal Acquisition and Enhancement, AStar ProjectID: M48020049, Amount:
S$72,000, PI.
Dec 2007 - Dec 2009 Micro-eBlock: A scalable microcomputer peripheral system for tertiary level micro-controller
education, MOE & NTU & Renesas, Amount: S$313,572, ProjectID: M20440011, Co-Principal Investigators:
Chng Eng Siong & Tan Su Lim
Jun - July 2007, Tan Chin Tuan Fellowship. Attachment to Tsinghua University, Beijing, China, Amount: S$6000
Jun 2007 - Jun 2008, Speech Data Collection Technology for Robotic Dialog Application Astar ProjectID,
M48020040, Amount: S$69,920, PI.
21. July 2006, ROAR 2006 Award, 1 PhD studentship, NTU Award: Approx S$100K, PI.
Feb 2007 - Feb 2008, Supplementary Equipment Purchase, Digital Signal Processing for Voice Enhancement
Recognition and Search, NTU Award: ProjectID:: M52020057, Project Ref number: RG129/06, Amount:
S$39,440, PI.
Mar 2006-Mar 2007, Development of Speaker turn library for I2R AStar, Project ID: M48020025, Amount:
S$70,000, PI.
Sep 2004- July 2010, Digital Signal Processing for Voice Enhancement, Recognition & Search: AStar G/L Acct:
500360, Amount: S$138,000, PI.
Oct 2004 - Mar 2005, Summarization of proceedings in a Smart Meeting Rooms for AStar Thematic Pilot project,
AStar M47020015, SERC grant number: 0421110063, Amount: S$21,000 Chng Eng Siong & Deepu Rajan
Jun 2003 - Jun 2004, Collaborative acoustic sensor for a smart meeting room application. NTU, SCE CE-SUG-
01/03 Amount: S$14,600, PI
Dec 2003 - Jan 2004, 2 months, Overseas Attachment programme to University of Southampton, Dept of ECE.
Attachment to Prof Sheng Chen AStar Award, Amount: S$3600, PI
Main Page
Speech & Language Grp
Advice To Writing
This section describes my take in how you can organize your thesis, final year
report, etc.
For PhD Students, my suggestion is that the structure is the most important
aspect, and you should aim to get that right first. E.g, you can write a series
of questions to drive the report, and with the questions, write a power point
(which provide more contents) towards answering the questions, and then
finally the report. You can have a look at
1. Chong’s question and power point slide as an example.
2. Khassan’s writing and his question.
I have other tips about writing thesis: word document (last updated: March
2016) and the mindmap
Undergraduate FYP Writing Advice/Example: An example here
Tools for Writing
Please use latex for your writing - it will help a lot! In windows you have
miktex. You can get a host of templates here. My editor of choice is TexStudio
- but ask Quora for the latest answer.
Please see what Jonh Dennis had used to write his thesis. tip on tools he used
for writing thesis.
You can start with using Xiao Xiong’s latex PhD thesis template.
Writing Advice from Others
Professor Simon Peyton Jones (Cambridge) - How to write a Great
Research Paper Youtube
2. Judy Swan (Princeton) - Scientific Writing youtube
3. Kristin Sainani - writing for the sciences youtube
4. Steve Easterbrook - how thesis get written.
Taught Courses
CPE3007 - Digital Signal Processing (Since 2013), CPE414/ES6105 DSP
2. CPE3006 - Digital Communications (since Jan 2016)
3. CPE206 - Micro-controller Systems Design (2004-2009)