Research

Research Areas

Multimedia Signal Processing

Multimedia Signal Processing

Research Goals and Vision

The KAIST School of Electrical Engineering aims to develop technologies for the effective analysis, processing, and application of multimedia data such as audio, video, images, and other forms of multimedia. This research not only builds on the foundations of digital signal processing but also leverages advanced AI models to enhance real-world applications, significantly improving the performance of multimedia systems and generating realistic multimedia data. Innovative research outcomes in the fields of image and speech processing, multimodal signal processing, and biomedical signal processing are applied across various industries, including smart devices, healthcare, and data security.

Key Research Areas

Image, Vision, and 3D Graphics

  • Image and Video Processing : Developing techniques for the processing and analysis of high-resolution image and video data. Major research topics include image compression, noise reduction, quality enhancement, object recognition and tracking, and deep learning-based image analysis.
  • 3D Graphics : Researching 3D modeling and rendering technologies to improve the immersiveness of virtual reality (VR) and augmented reality (AR) systems.
  • Multimedia Large Language Model : Utilizing large language models to enhance the learning capabilities of AI models with multimedia data and improve language-based understanding.

Audio and Speech Processing

  • Speech Signal Processing : Researching signal processing techniques for speech recognition and speech synthesis. This supports applications such as voice assistants, automatic translation systems, and voice-controlled interfaces.
  • Audio Signal Processing : Developing techniques for the analysis and processing of audio signals, including music information retrieval, acoustic event detection, and audio effects generation. This drives advancements in music recommendation systems and sound detection systems.

Multimodal Signal Processing

  • Multimodal Data Fusion : Researching techniques to integrate and process various forms of data such as text, audio, and video. Key research topics include multimodal data fusion, cross-modal retrieval, and multimodal interface development.
  • Multimodal Representation Learning : Developing technologies to learn and connect information across different modalities to enhance data understanding and applications.

Biomedical Signal Processing

  • Biological Signal Analysis : Analyzing various biological signals such as electrocardiograms (ECG) and electroencephalograms (EEG) to develop disease diagnosis and monitoring technologies.
  • Medical Image Processing : Researching the analysis of medical imaging data from MRI, CT scans, etc., for lesion detection, image segmentation, and 3D reconstruction.
  • Wearable Medical Devices : Developing signal processing technologies for real-time monitoring of biological signals using wearable devices.

Recent related activities in Multimedia Signal Processing

See below for specifc ongoing research topics related to Multimedia Signal Processing of KAIST EE.