우리 학부 휴먼멀티모달연구팀(지도교수 노용만)이 저명한 국제학술지인 IEEE Consumer Technology Society 뉴스 5월호 Featured People에 소개 되었습니다.
휴먼모달 연구팀은 지난 1년간 NeurIPS, AAAI, CVPR, ICCV 등 AI Top tier conference 및 IEEE 저널에 논문들을 다수 발표하였습니다.
게재 내용은 https://ctsoc.ieee.org/images/CTSOC-NCT-2022-05-FP.pdf 에서 확인 가능합니다.
주요연구실적(논문)
- “Distinguishing Homophenes using Multi-head Visual-audio Memory for Lip Reading.” Minsu Kim, Jeong Hun Yeo, and Yong Man Ro. AAAI. 2022.
- “SyncTalkFace: Talking Face Generation with Precise Lip-syncing via Audio-Lip Memory.” Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, and Yong Man Ro. AAAI. 2022.
- “Lip to Speech Synthesis with Visual Context Attentional GAN.” Minsu Kim, Joanna Hong, and Yong Man Ro. NeurIPS (2021).
- “Multi-modality associative bridging through memory: Speech sound recollected from face video.” Minsu Kim*, Joanna Hong*, Se Jin Park, and Yong Man Ro. ICCV. 2021.
- Video prediction recalling long-term motion context via memory alignment learning, S Lee, HG Kim, DH Choi, HI Kim, YM Ro, CVPR 2021.
- “Speech Reconstruction with Reminiscent Sound Via Visual Voice Memory.” Joanna Hong, Minsu Kim, Se Jin Park, Yong Man Ro. IEEE Transactions on Audio, Speech, and Language Processing 29 (2021)
- “Cromm-vsr: Cross-modal memory augmented visual speech recognition.” Minsu Kim, Joanna Hong, Sejin Park, Yong Man Ro. IEEE Transactions on Multimedia (2021).