Photo of Tran Quang Chung

Hi, I am Chung Tran

Chung Tran received his B.Sc. in 2019 and M.Sc. in 2021 in Computer Science from the School of Information and Communication Technology, Hanoi University of Science and Technology (HUST). In 2022, he was awarded a prestigious scholarship by the Japanese Ministry of Education, Culture, Sports, Science, and Technology (MEXT). He began his Ph.D. journey at the Japan Advanced Institute of Science and Technology (JAIST), where he studied for 1.5 years before transferring to the Nara Institute of Science and Technology (NAIST) to complete his doctoral thesis. His research interests focus on the fields of speech processing, speech synthesis, and speech recognition.

More about his Lab: Click here

Education Background

Publications

Journal Papers

  1. Chung Tran, Luong, C. M., & Sakti (2025). Zero-Shot Cross-Lingual Text-to-Speech With Style-Enhanced Normalization and Auditory Feedback Training Mechanism. IEEE/ACM Transactions on Audio, Speech, and Language Processing. DOI New
  2. Na, I. S., Chung Tran, Nguyen, D., & Dinh, S. (2020). Facial UV map completion for pose-invariant face recognition: A novel adversarial approach based on coupled attention residual UNets. Human-centric Computing and Information Sciences. DOI
  3. Ngoc, P. P., Quang, Chung Tran, & Chi, M. L. (n.d.). Improving few-shot multi-speaker text-to-speech adaptive-based with extracting mel-vector (EMV) for Vietnamese. International Journal of Asian Language Processing.
  4. Ngoc, P. P., Quang, Chung Tran, & Chi, M. L. (2023). ADAPT-TTS: High-quality zero-shot multi-speaker text-to-speech adaptive-based for Vietnamese. Journal of Computer Science and Cybernetics, 39(2), 159–173.

Conference Papers

  1. Chung Tran, Sakriani Sakti: From Pixels to Voice: A Simple and Efficient End-to-End Spoken Image Description Approach via Vision Codec Language Models ICASSP 2025. Demo New
  2. Ahmad Alfani Handoyo, Chung Tran, Dessi Puji Lestari, Sakriani Sakti: Indonesian-English Code-Switching Speech Synthesizer Utilizing Multilingual STEN-TTS and BERT LID O-COCOSDA 2024.
  3. Chung Tran, Luong, C. M., & Sakti, S. (2024): Maintaining Personal Styles in Multilingual TTS with STEN Approach in Diffusion Framework. ASJ 2024.
  4. Chung Tran, Luong, C. M., & Sakti, S. (2023): STEN-TTS: Improving Zero-shot Cross-Lingual Transfer for Multi-Lingual TTS with Style-Enhanced Normalization Diffusion Framework. Proc. INTERSPEECH 2023, 4464–4468. DOI
  5. Phuong Pham Ngoc, Chung Tran, & Mai Luong Chi. (2022): Improving a few-shot multi-speaker Text-To-Speech adaptive-based with Extracting Mel-Vector (EMV) for Vietnamese. The 25th Conference of the O-COCOSDA.
  6. Chung Tran, Quang Minh Nguyen, Phuong Pham Ngoc, & Quoc Truong Do. (2021): Improving Speaker Verification in Noisy Environment Using DNN Classifier. The 15th IEEE-RIVF International Conference on Computing and Communication Technologies, 1–6.
  7. Phuong Pham Ngoc, Chung Tran, Truong Do Quoc, & Mai Luong Chi. (2021): A study on neural-network-based Text-to-Speech adaptation techniques for Vietnamese. The 24th Conference of the Oriental COCOSDA.
  8. Ngoc Phuong Pham, Chung Tran, Nguyen Quang Minh, & Do Quoc Truong. (2020): Improving prosodic phrasing of Vietnamese text-to-speech systems. Proceedings of the 7th International Workshop on Vietnamese Language and Speech Processing, 19–23.
  9. Chung Tran, Huyen, H. C., & Sang, D. V. (2020): A Novel Generative Model to Synthesize Face Images for Pose-invariant Face Recognition. 2020 International Conference on Multimedia Analysis and Pattern Recognition (MAPR), 1–6.
  10. Sang, D. V., Chung Tran, Dung, N. D., & Na, I. S. (2020): Attention ResCUNet-GAN: A Novel Facial UV Map Completion for Pose-invariant Face Recognition. HCIS Workshop 2020.

Awards

Contact

Email (Personal): bktranquangchung@gmail.com

Email (University): tran.quang_chung.tq9@naist.ac.jp

Address: Takayama-cho, Ikoma-City, Nara 630-0101, Japan

Phone: +81 80 3570 3887

GitHub: github.com/tranquangchung

LinkedIn: linkedin.com/in/chungtq