Kyeongha Rho

About me

I am a second-year Ph.D. student at KAIST, where I also received my M.S. degree, under the supervision of Professor Joon Son Chung. My research focuses on building efficient and reliable multimodal AI systems that can perceive, reason, and generate across vision, audio, and language.

My recent work explores both multimodal representation learning and the efficiency of multimodal models, from token compression for omni-modal LLMs and parameter-/memory-efficient audio-visual learning to multimodal understanding and generation tasks such as joint audio-video generation, audio-visual captioning, active speaker detection, and talking-face generation. I am particularly interested in bridging fundamental multimodal learning with practical AI applications, including real-time multimodal assistants, video understanding, human-AI interaction, and generative content creation.

Education

Korea Advanced Institute of Science and Technology (KAIST), Republic of Korea

Sep. 2024 - Present

Ph.D. in Electrical Engineering (Advisor: Joon Son Chung)

Korea Advanced Institute of Science and Technology (KAIST), Republic of Korea

Sep. 2022 - Aug. 2024

M.S. in Electrical Engineering (Advisor: Joon Son Chung)

Korea Advanced Institute of Science and Technology (KAIST), Republic of Korea

Mar. 2015 - Feb. 2019

B.S. in Electrical Engineering (Magna Cum Laude)

Work Experience

Naver Webtoon, Republic of Korea

Jul. 2022 - Aug. 2022

AI Research Intern (Advisor: Seungkwon Kim)

Agency for Defense Development (ADD), Republic of Korea

Jun. 2019 - May. 2022

Research Officer (Military Service; Discharged as First Lieutenant, Republic of Korea Army. Advisor: Youngjung Kim)

Data Intelligence Lab in KAIST, Republic of Korea

Mar. 2018 - Dec. 2018

Research Assistant (Advisor: Steven Euijong Whang)

Publications

Keep What Audio Cannot Say: Context-Preserving Token Pruning for Omni-LLMs
Chaeyong Jung^*, Kyeongha Rho^*, Joon Son Chung
Preprint
Paper

MoLT: Mixture of Layer-Wise Tokens for Efficient Audio-Visual Learning
Kyeongha Rho^*, Hyeongkeun Lee^*, Jae Won Cho, Joon Son Chung
Preprint
Paper

Inference-Time Scaling for Joint Audio–Video Generation
Jaemin Jung, Kyeongha Rho, Inkyu Shin, Joon Son Chung
TMLR 2026
Paper Project Page Code

LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport
Kyeongha Rho^*, Hyeongkeun Lee^*, Valentio Iverson, Joon Son Chung
ICASSP 2025
Paper Code

EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning
Jongsuk Kim^*, Hyeongkeun Lee^*, Kyeongha Rho^*, Junmo Kim, Joon Son Chung
ICML 2024
Paper Code

TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning
Chaeyoung Jung^*, Suyeon Lee^*, Kihyun Nam, Kyeongha Rho, You Jin Kim, Youngjoon Jang, Joon Son Chung
ICASSP 2024
Paper Code

That's What I Said: Fully-Controllable Talking Face Generation
Youngjoon Jang^*, Kyeongha Rho^*, Jongbin Woo, Hyeongkeun Lee, Jihwan Park, Youshin Lim, Byeong-Yeol Kim, Joon Son Chung
ACMMM 2023
Paper Project Page

Guideformer: Transformers for image guided depth completion
Kyeongha Rho^*, Jinsung Ha^*, Youngjung Kim
CVPR 2022
Paper

Action-driven contrastive representation for reinforcement learning
Minbeom Kim, Kyeongha Rho, Yong-duk Kim, Kyomin Jung
Plos one (IF=3.7) 2022
Paper

About me

Education

Korea Advanced Institute of Science and Technology (KAIST), Republic of Korea

Sep. 2024 - Present

Korea Advanced Institute of Science and Technology (KAIST), Republic of Korea

Sep. 2022 - Aug. 2024

Korea Advanced Institute of Science and Technology (KAIST), Republic of Korea

Mar. 2015 - Feb. 2019

Work Experience

Naver Webtoon, Republic of Korea

Jul. 2022 - Aug. 2022

Agency for Defense Development (ADD), Republic of Korea

Jun. 2019 - May. 2022

Data Intelligence Lab in KAIST, Republic of Korea

Mar. 2018 - Dec. 2018

Publications

Honors and Awards

Presidential Science Scholarship

2025

Outstanding Teaching Assistant Award

2024

Kwon Young-Se Scholarship

2022

4th Place, NTIRE 2020 Challenge on Spectral Reconstruction from an RGB Image (CVPR 2020 Workshop)

4th Place, NTIRE 2020 Challenge on Real Image Denoising (CVPR 2020 Workshop)

2020

Contact