Hi! there๐๐ป I am passionate about advancing artificial intelligence to bridge the gap between modalities, enabling machines to perceive, understand, and interact with the world more intuitively.
Recently, I have been exploring Multi-modal Hallucination in Vision-Language Models, focusing on contrastive decoding improvements and enhanced visual perception to reduce uncertainty in real-world applications.
๐ฅ Key Areas :
Multi-modal Learning
LLM
Computer Vision
Vison-Language Models
โ๏ธย E-mail : [email protected] ๐ย Tel : +82-10-9041-2834 ๐ Site : https://pej0918.github.io/
$\color{#43515c}\rule{361px}{1.5px}$
ETRI
ํ๊ตญ์ ์ํต์ ์ฐ๊ตฌ์
Language Intelligence Lab
<aside> ๐ก 2024.07 ~ 2024.08 (2๊ฐ์) | ๋๊ณ ์ฐ๊ตฌ์ฐ์์
</aside>
$\color{#43515c}\rule{361px}{1.5px}$
๐ผ๏ธ + ๐
Robust Audio-Visual Classification End-to-End Framework under Uncertain Missing Modality using Prompt Learning
2024.11 - 2025.01ย | [code]
๐ผ๏ธ + ๐
Multi-modal Template-Based Learning for Few-Shot Visual Grounding
2024.07 - 2024.10ย | [code]
Question-Aware Prompting and Multi-layer Co-Attention for Enhanced Knowledge-Based Visual Question Answering
2024.05 - 2024.08ย | [code]