Computer Vision

Introduction

Human vision, by enabling people to interpret their surrounding environment, is one of our most important senses, as many experts consider that 80% of what we perceive comes through vision. Put simply, Computer Vision is the sub-discipline of Artificial Intelligence which teaches machines to “see like a human”. More precisely, it consists of specific hardware, and/or software algorithms providing computers with the ability to capture, process and interpret images, videos or signals taken from a camera or other sensors.

Researchers started to work on Computer Vision in the 1960’s, achieving constant progress in this field. In the 2010’s, Deep Learning, a branch of Machine Learning, revolutionized Computer Vision. Among other breakthroughs, Deep Learning-based algorithms surpassed human in their ability to recognize human faces in 2014. Since then, Computer Vision is one of the hottest topics in the broad field of Artificial Intelligence. Computer Vision is nowadays applied in most of the aspects of our daily life: medicine, manufacturing, biometry, autonomous vehicles, digitization of paper documents and books for electronic access, military and law enforcement, recycling household waste or other environmental applications using aerial/satellite images, etc.

Our research group focuses on the conception and development of high-speed, light-weight and effective algorithms for analysis and understanding different types of images/videos: natural images/videos (taken through regular cameras), medical images, remote sensing images, document images. See the slides here for more detail.

Contact: Dr. Nguyen Thi Oanh, Email: oanhnt@soict.hust.edu.vn

Research Directions

We are especially interested in the tasks of object detection, classification, semantic segmentation and tracking.

Some keywords about our research directions include:

Multimodality
- Spatio-temporal information
- Raw data (or text) associated with the images
Domain adaptation
- Transferring the model learned from one set of images to a different set of images
Limited resources constraints (linked to embedded systems)
- Definition of light weight models
User interaction

Examples of methods we use include both traditional Image Processing methods and Machine Learning methods, especially Deep Learning (often with Convolutional Neural Networks and Recurrent Neural Networks).

Research Problems

Our research problems include, but are not limited to:

Medical imaging:
- Segmentation of colon polyps and identifying lesions at high-risk of malignancy (cancer) during endoscopy
- Detecting brain degeneration for Alzheimer’s patients from 3D MRI images and clinical data
Traffic monitoring and autonomous vehicles
- Vehicles and pedestrian tracking in videos, including embedding the proposed algorithms in edge devices
- Semantic segmentation for intelligent vehicles

Remote sensing – satellite image processing and analysis:
- Adjusting Geostationary (GEO) satellite images with Low-Earth-Orbit (LEO) images
- Study of Urban Heat Islands and their impact on the environment and humans

Gesture recognition from videos:
- Human Action Recognition
- Hand Gesture Recognition

Document analysis and understanding:
- Incremental multimodal classification from streams of documents
- Understanding ancient Vietnamese text (Han-Nom characters)

Biometry access control: face verification and anti-spoofing

Team Members

Assoc. Prof. Muriel VISANI
Team Leader

Dr. Dinh Viet Sang
Member

Dr. Nguyen Thi Oanh
Member

Dr. Tran Nguyen Ngoc
Member

Dr. Dang Tuan Linh
Member

Dr. Ngo Thanh Trung
Member

Projects and Solutions

[VINIF2020] Development of a Real-time AI-assisted System to Detect Colon Polyps and Identify Lesions at High Risk of Malignancy During Endoscopy

[NAVER2020] Hand gesture recognition

BKAI-IGH NeoPolyp-Small: A dataset for fine-grained polyp segmentation

Collaborations

National partners (in Vietnam)

USTH: ICTLab & Space departments
MICA (HUST)
HUS-VNU
VNU-UET (FIMO)
VNUA (FIT)
HCMUS
Can Tho University
IRD: Institut de Recherche pour le Développement (Vietnam branch)

International partners

Asia-Pacific:
- Australia: University of Technology Sydney, Bureau of meteorology, CSIRO, Griffith Uniersity, The University of Queensland
- China: Lanzhou University
- Japan: University of Tsukuba, Kochi University of Technology
- South Korea: Chosun University
America:
- USA: University of Hawaii
- Brazil: University of Sao Paulo
Russia: Tula State University
Africa: Tunisia – Sfax University
Europe:
- France: La Rochelle University, Poitiers University, Bordeaux University, INSA Lyon, Nancy University
- Switzerland: Fribourg University
- Spain: Universitat Autonoma de Barcelona

Latest Publications

Publications in 2024

T. K. Lai, and I. L. Ngo. An investigation on the thermo-electrohydraulic performance of novel ECF micro-pump.. International Journal of Heat and Mass Transfer. 29/09/2024
T. K. Lai, K. D. Tran, and I. L. Ngo. A numerical study on the thermo-electrohydrodynamic performance of ECF micro-pumps. Sustainability and Emerging Technologies for Smart Manufacturing. 29/04/2024
Tuan Linh Dang, Thuy Ha Hoang, Minh Hoang Cu, Duc Quang Nguyen, Huu Phuc Hoang. Semi-supervised Learning for Image Quality Assessment Problem. International Journal of Computer Applications. 9-13. 21/02/2024
JYE Tin, WW Tan, AA Bakar, MS Mahali, FF Lothai, NF Mohammad, SSA Hassan & KF Chin. A Conceptual Design of Sustainable Solar Photovoltaic (PV) Powered Corridor Lighting System with IoT Application. ICREEM 2022. 09/03/2024
Quang Minh Dang, Minh Tuyen Truong, Tuan Linh Dang. A lightweight approach for image quality assessment. Signal, Image and Video Processing. 1-8. 01/06/2024
T. K. Lai, and I. L. Ngo. A new design and optimization of VD-ECF micro-pump: Advancements in electrohydraulic performance. Physics of Fluids. 29/07/2024
Tuan Linh Dang, Trung Hieu Pham, Duc Loc Le, Xuan Tung Tran, Hoang Nam Le, Khanh Hung Nguyen, Tran Tuan Nghia Trinh. Person re-identification on lightweight devices: end-to-end approach. Multimedia Tools and Applications. 1-14. 27/03/2024
T. K. Lai, and I. L. Ngo. An investigation on the electrohydraulic performance of novel ECF micro-pump with NACAshaped electrodes. Theoretical and Computational Fluid Dynamics. 29/02/2024
Yukinobu Hoshino, Masahiro Shimasaki, Namal Rathnayake, Tuan Linh Dang. Performance verification and latency time evaluation of hardware image processing module for appearance inspection systems using FPGA. Journal of Real-Time Image Processing. 1-16. 26/11/2023
Yukinobu Hoshino , Yuka Nishiyama, Toshimi Yamamoto, Yuki Shinomiya, Namal Rathnayake , Tuan Linh Dang. Human-inspired similarity control system: Enhancing line-following robot perception. Applied Soft Computing Journal. 1-15. 14/04/2024
Tuan Linh Dang, Trung Hieu Pham, Duc Manh Dao, Hoang Vu Nguyen, Quang Minh Dang, Ba Tuan Nguyen, Nicolas Monet. DATE: a video dataset and benchmark for dynamic hand gesture recognition. Neural Computing and Applications. 1-15. 09/05/2024

Publications in 2023

Nguyễn Đức Ca, Phan Thị Thu, Hoàng Thị Minh Anh, Phạm Ngọc Dương, Nguyễn Hoàng Giang, Nguyễn Lệ Hằng. Nâng cao hiệu quả quản trị đại học trong bối cảnh đổi mới giáo dục tại Việt Nam. Tạp chí khoa học giáo dục Việt Nam. 14/03/2023
Nguyen Quang Duc, Tran Khanh Luong, Le Hong Duc, Nguyen Huy Hoan, Trinh Anh Phuc, Dinh Viet Sang. Improving Single Positive Multi-label Classification via Knowledge-based Label-weighted Large Loss Rejection. The 12th International Symposium on Information and Communication Technology. 429-434. 07/12/2023
Thuong Nguyen Canh; Trung Thanh Ngo; Hajime Nagahara. Human-Imperceptible Identification with Learnable Lensless Imaging. IEEE Access. 95724 - 95733. 18/08/2023
Chenhao Li, Trung Thanh Ngo, Hajime Nagahara. Inverse Rendering of Translucent Objects using Physical and Neural Renderers. The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023. 12510-12520. Vancouver Convention Center, Canada. 18/06/2023
Huy-Hoang Nguyen, Thi-Oanh Nguyen. HRSeg: Leveraging High-Resolution Images to Enhance Polyp Segmentation Quality. 2023 15th International Conference on Knowledge and Systems Engineering (KSE). 1-4. 18/10/2023
Ren-Jun Soon, Dinh Viet Sang, Chin-Boon Chng, Chee-Kong Chui. Explainable AI for CPS-Based Manufacturing Workcell. 2023 International Conference on System Science and Engineering (ICSSE). 332-337. 27/07/2023
Ngo-Kien Duong, Viet-Sang Dinh, Thi-Oanh Nguyen. MCLDA: Multi-level Contrastive Learning for Domain Adaptive Semantic Segmentation. SOICT '23: Proceedings of the 12th International Symposium on Information and Communication Technology. 343–350. Ho Chi Minh, Viet Nam. 07/12/2023
Tung Nguyen Quang, Thi-Oanh Nguyen. Language Knowledge-Assisted in Topology Construction for Skeleton-Based Action Recognition. SOICT 2023: The 12th International Symposium on Information and Communication Technology. 443-449. Ho Chi Minh, Vietnam. 07/12/2023
Tuan Linh Dang, Gia Tuyen Nguyen, Thang Cao. Real-Time Image Processing Using Edge AI Devices. International Journal of Computer Applications. 1-7. 09/11/2023
Nguyen Minh Chau, Nguyen Ngoc Toan, Le Dinh Tuyen, Dinh Viet Sang, Pooi-Mun Wong, Chin-Boon Chng and Chee-Kong Chui. Boosting Facial Landmark Detection via Self-supervised and Semi-supervised Learning. The 12th International Symposium on Information and Communication Technology. 485-492. 07/12/2023
Nguyen Van Giang, Nguyen Minh Son, Kieu Anh Van, Tran Cat Khanh, Pham Ngoc Minh, Dinh Viet Sang. One-stage Robotic Grasp Detection. International Conference on Knowledge and Systems Engineering (KSE). 18/10/2023
Namal Rathnayake, Tuan Linh Dang, Akira Miyazaki, and Yukinobu Hoshino. An Efficient Approach for Age-Wise Rice Seeds Classification using SURF-BOF with Modified Cascaded-ANFIS algorithm. Fifteenth International Conference on Machine Vision (ICMV 2022). 1-9. Rome, Ý. 18/11/2022
Namal Rathnayake, Akira Miyazaki, Tuan Linh Dang, Yukinobu Hoshino. Age Classification of Rice Seeds in Japan Using Gradient-Boosting and ANFIS Algorithms. Sensors. 1-18. 03/03/2023
Namal Rathnayake , Upaka Rathnayake, Tuan Linh Dang, Yukinobu Hoshino. Water level prediction using soft computing techniques: A case study in the Malwathu Oya, Sri Lanka. PLoS ONE. 1-21. 22/02/2023
Vu Quoc Hung, Tran Le Phuong Thao, Trinh Xuan Minh, Dinh Viet Sang. LSegDiff: A Latent Diffusion Model for Medical Image Segmentation. The 12th International Symposium on Information and Communication Technology. 456-462. 07/12/2023
Tuan Linh Dang, Duc Loc Le, Trung Hieu Pham, Xuan Tung Tran. Lightweight Models’ Performances on a Resource-Constrained Device for Traffic Application. The Fourth International Conference on Artificial Intelligence and Computational Intelligence. (AICI 2023) (Kỷ yếu được đăng trong Deep Learning and Other Soft Computing Techniques, Studies in Computational Intelligence 1097 ). 1-14. Hà Nội. 13/01/2023
Nguyen Viet Hoai, Pham Vu Hung, Dinh Viet Sang. Memory-Driven Region Contrast for Enhanced Polyp Semantic Segmentation. 2023 International Conference on Multimedia Analysis and Pattern Recognition, MAPR 2023 - Proceedings. 05/10/2023
Nguyen Hong Son, Nguyen Thanh Huyen, Dinh Viet Sang. Semi-Supervised Learning with Dense Target Producer for End-to-End Lightweight Polyp Detection. International Conference on Knowledge and Systems Engineering (KSE). 18/10/2023
Pham Van Toan, Dinh Viet Sang. M3C-Polyp: Mixed Momentum Model Committee for Improved Semi-Supervised Learning in Polyp Segmentation. World Symposium on Software Engineering. 274-279. 22/09/2023
Tuan Linh Dang, Trung Hieu Pham, Quang Minh Dang, Nicolas Monet. A lightweight architecture for hand gesture recognition. Multimedia Tools and Applications. 28569–28587. 31/01/2023
Pham Van Toan, Dinh Viet Sang. ESSL-Polyp: A Robust Framework of Ensemble Semi-supervised Learning in Polyp Segmentation. Lecture Notes in Networks and Systems. 39-52. London, UK. 22/06/2023
Toan Pham Van, Sang Dinh Viet, Linh Bao Doan, Thanh Tung Nguyen, Quang Hung Nguyen, Duc Trung Tran. Improve polyp semi-supervised segmentation with prioritizing the reliability of unlabeled images. ICSIE. 35-40. 21/10/2022
Namal Rathnayake, Upaka Rathnayake, Imiya Chathuranika, Tuan Linh Dang, Yukinobu Hoshino. Projected Water Levels and Identified Future Floods: A Comparative Analysis for Mahaweli River, Sri Lanka. IEEE Access. 8920-8937. 17/01/2023
Namal Rathnayake, Upaka Rathnayake, Imiya Chathuranika, Tuan Linh Dang, Yukinobu Hoshino. Cascaded-ANFIS to simulate nonlinear rainfall–runoff relationship. Applied Soft Computing. 1-14. 26/07/2023
Pham Van Toan, Dinh Viet Sang. ESSL-Polyp: A Robust Framework of Ensemble Semi-supervised Learning in Polyp Segmentation. Computing Conference 2023. 39-52. 20/08/2023
Yong Cheng, Wei Wang, Wenjie Zhang, Ling Yang, Jun Wang, Huan Ni, Tingzhao Guan, Jiaxin He, Yakang Gu and Ngoc Nguyen Tran. A Multi-Feature Fusion and Attention Network for Multi-Scale Object Detection in Remote Sensing Images. Remote Sensing. 1-19. 12/04/2023

Publications in 2022

Namal Rathnayake, Tuan Linh Dang , Yukinobu Hoshino. Designing and Implementation of Novel Ensemble model basedon ANFIS and Gradient Boosting methods for Hand Gestures. SoICT 2022. 283-289. Hanoi-HaLong. 01/12/2022
Manh Nguyen Duy, Hang Dao Viet, Long Dao Van, Hung Le Quang, Khanh Pham Cong, Oanh Nguyen Thi, Thuy Nguyen Thi, Sang Dinh Viet. EndoUNet: A Unified Model for Anatomical Site Classification, Lesion Categorization and Segmentation for Upper Gastrointestinal Endoscopy. KSE. 19/10/2022
Phan Thị Thu. Những vấn đề đặt ra đối với thiết chế Hội đồng trường đại học công lập ở Việt Nam hiện nay. Kỷ yếu hội thảo khoa học trường Học viện báo chí và tuyên truyền. 14/11/2022
Phan Thị Thu. Cuộc cạnh tranh của các công ty người Việt với nước ngoài trong lĩnh vực vận tải đường thuỷ ở bắc kỳ đầu thế kỉ XX và những kinh nghiệm đối với doanh nhân Việt Nam hiện nay. Kỷ yếu hội thảo quốc gia tổ chức tại trường ĐHSP Hà Nội 2. 14/12/2022
Tuan Linh Dang, Thuy Hang Nguyen, Gia Tuyen Nguyen, Thang Cao. Traffic Collision Warning Using Deep Learning Models. ICIC Express Letters. 17-24. 01/08/2021
Namal Rathnayake, Upaka Rathnayake, Tuan Linh Dang, Yukinobu Hoshino. An Efficient Automatic Fruit-360 Image Identification and Recognition Using a Novel Modified Cascaded-ANFIS Algorithm. Sensors. 4401. 08/06/2022
K. D. Nam, T. M. Nguyen, T. V. Dieu, M. Visani, T. -O. Nguyen and D. V. Sang. A Novel Unsupervised Domain Adaption Method for Depth-Guided Semantic Segmentation Using Coarse-to-Fine Alignment. IEEE Access. 101248-101262. 21/08/2022
N. T. Duc, N. T. Oanh, N. T. Thuy, T. M. Triet and V. S. Dinh. ColonFormer: An Efficient Transformer Based Method for Colon Polyp Segmentation. IEEE Access. 80575-80586. 25/07/2022
Vien Truong Nguyen, Quang-Van Doan, Ngoc Nguyen Tran, Ly Thi Mai Luong, Pham Minh Chinh, Phong K Thai, Dung Phung, Hong H T C Le, Tran Ngoc Dang. The protective effect of green space on heat-related respiratory hospitalization among children under 5 years of age in Hanoi, Vietnam. Environmental Science and Pollution Research. 1-15. 20/05/2022
Keita Mitani, Namal Rathnayake, Upaka Rathnayake, Tuan Linh Dang, Yukinobu Hoshino. Brain Activity Associated with the Planning Process during the Long-Time Learning of the Tower of Hanoi (ToH) Task: A Pilot Study. Sensors. 1-14. 26/10/2022
Namal Rathnayake, Upaka Rathnayake, Tuan Linh Dang, Yukinobu Hoshino. A Cascaded Adaptive Network-Based Fuzzy Inference System for Hydropower Forecasting. Sensors. 2905. 08/04/2022
Trần Hoàng Hải, Nguyễn Thanh Hùng, Nguyễn Nhất Hải, Đặng Tuấn Linh, Huỳnh Quyết Thắng. eHUST - Một mô hình mẫu cho hệ thống quản trị Nhà trường hỗ trợ Chuyển đổi số tại Việt Nam. Thúc đấy Chuyển đổi số, Kinh tế tuần hoàn và kinh tế xanh - Hướng tới mục tiêu phát triển bền vững. 18-26. Trường Đại học Phenikaa. 12/11/2022
Tuan Linh Dang, Sy Dat Tran, Thuy Hang Nguyen, Suntae Kim, Nicolas Monet. An improved hand gesture recognition system using keypoints and hand bounding boxes. Array. 1-10. 21/09/2022
C Palanichamy, WW Tan & P Naveen. A Microgrid for the Secluded Paana Theertham Kani Settlement in India. Clean Energy. 09/02/2022
Ling Yao, Jiaying Lu, Wenjie Zhang, Jun Qin, Chenghu Zhou, Ngoc Nguyen Tran, Ekena Rangel Pinagé. Spatiotemporal Analysis of Extreme Temperature Change on the Tibetan Plateau Based On Quantile Regression. Earth and Space Science. 1-15. 29/09/2022
Yuhe Zhao, Minyu Wang, Tianxiang Zhao , Yi Luo , Yuhan Li , Kai Yan , Lei Lu , Ngoc Nguyen Tran , Xiaodan Wu , Xuanlong Ma. Evaluating the potential of H8/AHI geostationary observations for monitoring vegetation phenology over different ecosystem types in northern China. International Journal of Applied Earth Observation and Geoinformation. 1-12. 21/07/2022
Nam Kieu Dang, Oanh Nguyen Thi, Thuy Nguyen Thi, Hang Dao Viet, Long Dao Van, Trung Tran Quang, Sang Dinh Viet. A Coarse-to-fine Unsupervised Domain Adaptation Method for Cross-Mode Polyp Segmentation. KSE. 19/10/2022
Tuan Linh Dang, Huu Thang Nguyen, Duc Manh Dao, Hoang Vu Nguyen, Duc Long Luong, Ba Tuan Nguyen, Suntae Kim, Nicolas Monet. SHAPE: a dataset for hand gesture recognition. Neural Computing and Applications. 21849–21862. 18/07/2022
Namal Rathnayake, Upaka Rathnayake, Tuan Linh Dang, and Yukinobu Hoshino. Streamflow prediction using Cascaded-ANFIS algorithm in Kelani River, Sri Lanka. The 10th International Symposium on Computational Intelligence and Industrial Applications (ISCIIA2022). 1-6. Bắc Kinh, Trung Quốc. 23/09/2022
Tuan Linh Dang, Tran Sy Dat, Thuy Ha Hoang, Trong Nghia Nguyen, Tuan Minh Vu. Prototype of a parking system with path recommendation. SoICT2022. 309-316. Hanoi-HaLong. 01/12/2022
Nguyen Minh Chau, Le Truong Giang, Dinh Viet Sang. PolypDEQ: Towards Effective Transformer-Based Deep Equilibrium Models for Colon Polyp Segmentation. ISVC (Lecture Notes in Computer Science). 456-467. USA. 03/10/2022
Nguyen Viet Manh, Kieu Dang Nam, Dinh Viet Sang, Thi-Oanh Nguyen. G2L: A Global to Local Alignment Method for Unsupervised Domain Adaptive Semantic Segmentation. Procedia Computer Science. 2698-2707. Verona, Italia. 06/09/2022
Toan Pham Van, Linh Doan Bao, Duc Tran Trung, Quan Nguyen Van, Sang Dinh Viet. Online pseudo labeling for polyp segmentation with momentum networks. KSE. 19/10/2022
Tuan Linh Dang, Nhat Minh Ngo. SDNs Delay Prediction Using Machine Learning Algorithms. The Third International Conference on Artificial Intelligence and Computational Intelligence (AICI 2022) (Kỷ yếu được đăng trong Biomedical and Other Applications of Soft Computing). 133-141. Online vì COVID19. 14/01/2022
Tuan Linh Dang, Viet Tien Ha. Shop Product Tracking and Early Fire Detection Using Edge Devices. The Third International Conference on Artificial Intelligence and Computational Intelligence (AICI 2022) (Kỷ yếu được đăng trong Biomedical and Other Applications of Soft Computing). 121-131. Online vì COVID19. 14/01/2022
Nguyen Tuan Hung, Phan Ngoc Lan, Nguyen Thi Oanh, Nguyen Thi Thuy, Dinh Viet Sang. GCEENet: A Global Context Enhancement and Exploitation for Medical Image Segmentation. ISVC (Lecture Notes in Computer Science). 141-152. USA. 03/10/2022
Dinh Viet Sang, Do Duy Quang. Incremental Boundary Refinement using Self Axial Reverse Attention and Uncertainty-Aware Gate for Colon Polyp Segmentation. SoICT. 322-328. 01/12/2022

Introduction

Research Directions

Research Problems