Section 2.4 Current PhD and Graduate Students
¶-
Chao Yi-Wen
, Paralinguistic Enhanced LLM for Speech Input (reg Aug 2024, MEng, Supervisor: Chng Eng Siong -
Wu Yihao
,Robust online speaker diarization under far-talk noisy conditions (reg Aug 2024, MEng, Supervisor: Chng Eng Siong -
Liu Songting
, Enhancing Speech Generation with Multi-Modal Prompts and Contextual Information (reg Aug 2024, MEng, Supervisor: Chng Eng Siong -
Yeo Yue Heng
, LLM for speech recognition (reg Jan 2024, Supervisor: Chng Eng Siong, AStar Co-Sup:Tran Huy Dat), AStar Scholar -
Li Haoyang
, Neural Text to Speech (reg Aug 2023, Alibaba-PhD) -
Tuan Truong Duc
, Robust Speaker verification under noisy and short duration scenario (reg Jan 2023, PhD) -
Fabian Ritter Gutierrez
,End2End ASR for Language Learning (reg Aug 2022, AStar scholar) (co-supervised with AStar: Nancy Chen -
Nikita Kuzmin
, Disentaglement for Speaker verification and privacy (reg Aug 2022, AStar scholar) (co-supervised with AStar: Lee Kong Aik -
Kwok Chin Yuen
, Acoustic modelling of targetted domain speech (Children's speech acoustic modelling) (reg Aug 2021, MEng, converted to PhD program, Aug 2022) -
Hu Yuchen 胡宇晨
, robust End-to-End ASR (reg Aug 2021, MEng, converted to PhD program, Aug 2022).QE Slides (2023)
, -
Yip Jia Qi
, Neural Networks for Speaker Extraction and its interdisciplinary applications (reg Aug 2021, Alibaba scholar PhD) -
Ng Dian Wen
, Domain adaptation for End-to-End ASR (reg Jan 2021, Alibaba scholar PhD) -
Chen Chen
, End-to-End ASR (reg Jan 2021, PhD)
Subsection 2.4.1 Current Co-Sup: PhD Students
¶-
Ashutosh Anshul
, Multi-modal Deep Fake classification and detection (reg Jan 2024, Supervisor: Deepu Rajan, Co-Sup:Chng Eng Siong) -
Shreyas Gopas
, Speech Recognition using LLM for under-resourced languages (reg Jan 2024, Supervisor: Quek Hiok Chai, Co-Sup:Chng Eng Siong) -
Rae Koh Jia Xin
, Singapore English (reg Aug 2019, PhD) (Sup: Tan Ying Ying (HSS), Co-Sup: Chng Eng Siong) -
Zou HeQing
, Multimodal Machine Learning (reg Jan 2021, PhD) (Sup:Deepu Rajan, Co-Sup: Chng Eng Siong)
Subsection 2.4.2 Current MEng and Masters Program Students
¶-
Ni Yunyi
(2024 Aug~), Masters Data Science, TTS speaker characterisation via NLP text -
Chen Yanru
(2024 Jan~), Masters Data Science, Classification of Depression Syndrome by Deep Learning -
Qin Xiaokai
(2023 Aug~), MSAI student, Deepfake audio generation using voice conversion
Subsection 2.4.3 Collaborating graduate Student (China)
¶Every year, we will host graduate students from China. We have hosted students from China Scholarship Council (program), Peking University, Xinjiang University, Hunan University, Tianjin University and Northwestern University. The visits have been very rewarding, and many publications have come out of these visits. We hope to see more such outstanding students, so do apply!
-
Yang Yuhang
(2023 June~, PhD student (Hunan University), China), LLM ASR -
Zhang Xueyi
(2024 Aug~), NUTD, Changsa, CSC Scholar, Robust onlline Diarization -
Zheng Haorui
(2024 June~ , AISG remote), Masters student (Peking University, China), Speaker Diarization with Speech Separation -
Zhang Xiangyu
(2023 June~, PhD student (UNSW, Australia), Depression classification -
Le Yuquan
(2023 Oct~ Oct 2024, CSC visitor), PhD student (Hunan University, China), LLM for Legal -
Luo Juan
(2023 Oct~ Oct 2024, CSC visitor), PhD student (Hunan University, China), Audio event detection and classification
Subsection 2.4.4 Past PhD Students
¶-
Andrew Koh Jin Jie
, Sequence to Sequence Machine Learning (reg Aug 2019, PhD, submitted Thesis) -
Zhao Yingzhu
, End-to-End speech recognition (reg Jan 2019, PhD, graduated May 2023).Oral Defence rehearsal
PhD Slides
PhD report
PhD latex folder
-
Hou Nana
, Robust LVCSR for air traffic control speech (reg Jan 2017, PhD, submitted Jan 2022) -
Xu Chenglin
,PhD Slides
PhD Thesis
(2020) Single Channel Multi-talker Speech Separation with Deep Learning -
Paul Chan
, Synthesis of the human singing voice (2020) -
Khassan Yerbolat
,PhD Slides
,Online Presentation
(April 2020) and finalPhD thesis.
, (2020) Language Model Domain Adaptation for Automatic Speech Recognition Systems. -
Pham Van Tung
,PhD Thesis
(2019) Robust Spoken Term Detection using partial search and re-scoring hypothesized detections techniques. Now in NTU. -
Tian Xiaohai
,PhD Thesis
(2019) Voice Conversion with Parallel/Non-Parallel Data and Synthetic Speech Detection. Now in NUS. -
Chong Tze Yuang
,PhD Thesis
,Slides
,Thesis organization
, (2018) Exploiting Long Context Using Joint Distance and Occurrence Informationfor Language Modeling. -
Nguyen Duc Hoang Ha
,PhD Thesis
(2017)Slides
Feature based robust techniques for speech recognition. -
Nguyen Trung Hieu
,PhD Thesis
(2015). Speaker Diarization in Meeting room domain. Now at Alibaba. -
Do Van Hai
,PhD Thesis
(2015). Acoustic modelling of speech under limited training data condition. Now in Vietnam Telecoms. -
Wu Zhizheng
,PhD Thesis
(2015). Spectral Mapping for Voice Conversion. -
Jonathan Dennis
,PhD Thesis
(2014).Slides
, Sound Event Recognition in Unstructured Environments using Spectrogram Image Processing. -
Wang Lei
, (2013). Audio Pattern Discovery and retrieval. -
Tong Rong
,PhD Thesis
(2012). Towards high performance phonotactic features for spoken language recognition. Now at Alibaba. -
Omid Dehzanghi
, (2012). Discriminative Learning for speech recognition, U of Michigan -
Xiao Xiong
,PhD Thesis (2009)
. PhD Thesis: Robust speech features and acoustic models for speech recognition.QE (2006)
,Speech Enhancement with Applications in speech recognition, now in Microsoft, US since Apr 2017 -
Wang Jinjun
, PhD (2008), Content based sports video analysis and composition. Now in Xian Jiaotong.
Subsection 2.4.5 Past MEng Students
¶-
Tanmay Surana
,Deep Learning-based Text Augmentation for Named Entity Recognition
(reg Aug 2021, MEng, completed Oct 2023) -
Prachaseree Chaiyasait
, Adaptation of Language Models via Text Augmentation (reg Aug 2021, MEng, submitted Jul 2023, completed Oct 2023) -
Kyaw Zin Tun
, Name entity recognition for chatbot applcications(MEng, started Aug 2020, submitted thesis Aug 2022) -
Xue Fuzhao
, Information extraction from text (MEng 2020) -
Lim Zhi Hao
, (MEng 2020), Anti-Spoofing Techniques for Robust Speaker VerificationThesis (2020)
-
Ho Thi Nga
, (MEng 2019), Sentence unit detection for automatic speech transcripts using lexical information -
Leow Sujun
, (MEng 2018), Image Processing Technique for Speech Signal Processing -
Nguyen Quy Hy
, (MEng 2017), Voice conversion using DNN -
Steven Du
, (MEng 2015), Robust Front End for Speaker Verification -
Terrence Ng Wen Zheng
,Thesis
,(MEng 2014), Sound Event recognition in home environment -
Chen Wenda
, (MEng 2014),Computer Assisted Language Learning -
Ben Pham Chau Khoa
,Thesis
,(MEng 2012), Robust VAD - Eugene Koh, (MEng 2009), Speaker Diarizaton
Subsection 2.4.6 Past MSAI Students and Other collaboration (graduate) students
¶-
Bo Han
(2023 Oct ~Oct 2024, CSC visitor), PhD student Zhejiang Uni, China), Deep Fake TTS audio generation -
Zhang Zizheng
(2023 June~ Jun 2024, AISG remote), Masters student (Peking University, China), Speech Separation,github
-
Azmat Adnan
(Reg 2023 Aug, completed Aug 2024), MSAI student, DNN Approaches for noisy speech diarization -
Zhuo Ning
(Reg 2023 Aug, completed Aug 2024), Masters Cybersecurity student, Deep Fake Corpus developmenet and detection -
Chen Weiguang
(2022 June~, PhD student (Hunan University), Diarization using multi-channel approaches -
Yao Jixun
(2023 Oct~ Oct 2024, AISG NUS visitor), PhD student (Northwestern Poly, China), Speech Enhancement and TTS -
Jiang Yufei
(2022 Aug~ 2023 Aug), MSAI (NTU), Adopting Neural Translation Model in Data Generation for Inverse Text Normalization -
Liu Jiaxing
(2021 Oct ~2023 Oct, CSC visitor, PhD student Tianjin University, China), Multi-modal emotion recognition -
Cheng Qi
(2021 Oct ~2022 Oct, CSC visitor, PhD student Harbin Engineering Uni, China), Graph Neural Network for Lattice rescoring -
Samuel Samsudin Ng
, (MSAI 2020-S1), Speech emotion recognition with AlexNet and Fully convolutional network,Sam's MSAI Thesis
,github depository
,kaggle iEmoCap
-
Cheung Chin Ka
, (MSAI 2020-S1), Acoustic Scene Classication with cutting edge hyperparameter tuning tool,Andy's MSAI Thesis
-
Liu Bozhong
, (MSAI 2020-S2), Wakeup keyword detection for far-field microphone array using end to end framework
Subsection 2.4.7 Past MSAI Students and Other collaboration students
¶-
Zhao Yang
(2021 Oct ~2023 Oct, CSC visitor, PhD student Xian JiaoTong Uni, China), Semi/Self supervised representation for speech recognition -
Peng Yizhou
(2020 June ~2022 June, Masters student Xinjiang, China), ASR development (Kaldi and End2End) -
Yang Yuhang
(2021 June~2023 June, Masters student Xinjiang, China), WeNet ASR, End2End ASR -
Guo Yachao
(2021 June~2023 June, Masters student Xinjiang, China), End2End Hotword LM Adaptation