Project Index

Research Projects

A focused view of my work in computer vision, multimodal understanding, and segmentation, with an emphasis on models that connect visual structure, language, and physically grounded perception.

Google Scholar GitHub

3 Featured works

2 WACV papers

3 Segmentation systems

WACV 2026

Transparent Object Segmentation Transformer

Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues

Tuan-Anh Vu, Nguyen Truong Hai, Zheng Ziqiang, Binh-Son Hua, Qing Guo, Ivor Tsang, Sai-Kit Yeung

TransCues introduces an efficient transformer-based segmentation architecture capable of handling transparent, reflective, and general objects. By proposing Boundary Feature Enhancement (BFE) and Reflection Feature Enhancement (RFE), we enable the model to better capture subtle details in both glass and non-glass regions, resulting in more accurate and robust segmentation.

Segmentation Transformer Transparent Objects

Project

WACV 2025

Referring Expression Segmentation Vision-Language

Vision-Aware Text Features in Referring Expression Segmentation: From Object Understanding to Context Understanding

Hai Nguyen-Truong, E-Ro Nguyen, Tuan-Anh Vu, Minh-Triet Tran, Binh-Son Hua, Sai-Kit Yeung

VATEX is a novel method for referring image segmentation that leverages vision-aware text features to improve text understanding. By decomposing language cues into object and context understanding, the model can better localize objects and interpret complex sentences, leading to significant performance gains.

Segmentation Referring Expression Multimodal

Project Paper

ISBI 2022

Medical Image Segmentation CNN + Transformer

SegTransVAE: Hybrid CNN - Transformer with Regularization for medical image segmentation

Hai Nguyen-Truong, Quan-Dung Pham, Nam Nguyen Phuong, Khoa NA Nguyen, Chanh DT Nguyen, Trung Bui, Steven QH Truong

SegTransVAE is the first work exploiting the hybrid architecture between CNN, Transformers with the Variational Autoencoder (VAE) branch to the network to reconstruct the input images jointly with segmentation.

Medical Imaging Segmentation Transformer VAE

Code Paper

No projects match the current search.