I am an AI Researcher at AITRICS, where I specialize in multimodal learning and generative AI. My work focuses on developing advanced artificial intelligence models that integrate various data domains (such as images, text, and audio) to solve real-world challenges. My main goal remains to design AI that assists people in various situations.
Prior to joining AITRICS, I received my M.S. degree in Artificial Intelligence from POSTECH in August 2025. During my time there, I was a member of the POSTECH Efficient Learning Lab (EffL) advised by Prof. Jaeho Lee. I mainly focused on data compression, and my final project was on text-guided image compression. My previous research experience also includes collaborations with Prof. Tae-Hyun Oh on audio captioning and with Prof. Sangpil Kim on event-based photometric stereo.
I earned my B.S. in Electrical and Electronics Engineering from Chung-Ang University (CAU).
(* means ‘equal contribution’)
Hagyeong Lee*, Minkyu Kim*, Jun-Hyuk Kim, Seungeon Kim, Dokwan Oh, Jaeho Lee, “Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity”, ICML, 2024
Minkyu Kim*, Kim Sung-Bin*, Tae-Hyun Oh, “Prefix tuning for automated audio captioning”, ICASSP (Oral), 2023