I’m a second-year M.S. student in GSAI (Graduate School of Artificial Intelligence) at POSTECH Efficient Learning Lab (EffL), advised by Prof. Jaeho Lee. Before joining POSTECH, I completed my B.S. degree in electrical and electronics engineering from Chung-Ang University (CAU).
My research interest lies in multimodal learning, which trains artificial intelligence by using data from various domains (such as image, text, and audio) and uses it to solve multiple tasks. Additionally, my main goal is to design artificial intelligence that assists people in various situations.
In EffL, I’m mainly focusing on data compression. My recent work is text-guided image compression. I worked with Prof. Tae-Hyun Oh on audio captioning, and with Prof. Sangpil Kim on the event-based photometric stereo.
(* means ‘equal contribution’)
Hagyeong Lee*, Minkyu Kim*, Jun-Hyuk Kim, Seungeon Kim, Dokwan Oh, Jaeho Lee, “Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity”, ICML, 2024
Minkyu Kim*, Kim Sung-Bin*, Tae-Hyun Oh, “Prefix tuning for automated audio captioning”, ICASSP (Oral), 2023