Human vision, by enabling people to interpret their surrounding environment, is one of our most important senses, as many experts consider that 80% of what we perceive comes through vision. Put simply, Computer Vision is the sub-discipline of Artificial Intelligence which teaches machines to “see like a human”. More precisely, it consists of specific hardware, and/or software algorithms providing computers with the ability to capture, process and interpret images, videos or signals taken from a camera or other sensors.

Researchers started to work on Computer Vision in the 1960’s, achieving constant progress in this field. In the 2010’s, Deep Learning, a branch of Machine Learning, revolutionized Computer Vision. Among other breakthroughs, Deep Learning-based algorithms surpassed human in their ability to recognize human faces in 2014. Since then, Computer Vision is one of the hottest topics in the broad field of Artificial Intelligence. Computer Vision is nowadays applied in most of the aspects of our daily life: medicine, manufacturing, biometry, autonomous vehicles, digitization of paper documents and books for electronic access, military and law enforcement, recycling household waste or other environmental applications using aerial/satellite images, etc.

Our research group focuses on the conception and development of high-speed, light-weight and effective algorithms for analysis and understanding different types of images/videos: natural images/videos (taken through regular cameras), medical images, remote sensing images, document images. See the slides here for more detail.

Contact: Dr. Nguyen Thi Oanh, Email:

Research Directions

We are especially interested in the tasks of object detection, classification, semantic segmentation and tracking.

Some keywords about our research directions include:

  • Multimodality
    • Spatio-temporal information
    • Raw data (or text) associated with the images
  • Domain adaptation
    • Transferring the model learned from one set of images to a different set of images
  • Limited resources constraints (linked to embedded systems)
    • Definition of light weight models
  • User interaction

Examples of methods we use include both traditional Image Processing methods and Machine Learning methods, especially Deep Learning (often with Convolutional Neural Networks and Recurrent Neural Networks).

Research Problems

Our research problems include, but are not limited to: 

  • Medical imaging:
    • Segmentation of colon polyps and identifying lesions at high-risk of malignancy (cancer) during endoscopy
    • Detecting brain degeneration for Alzheimer’s patients from 3D MRI images and clinical data
  • Traffic monitoring and autonomous vehicles
    • Vehicles and pedestrian tracking in videos, including embedding the proposed algorithms in edge devices
    • Semantic segmentation for intelligent vehicles 
  • Remote sensing – satellite image processing and analysis:
    • Adjusting Geostationary (GEO) satellite images with Low-Earth-Orbit (LEO) images
    • Study of Urban Heat Islands and their impact on the environment and humans 
  • Gesture recognition from videos:
    • Human Action Recognition
    • Hand Gesture Recognition 
  • Document analysis and understanding:
    • Incremental multimodal classification from streams of documents
    • Understanding ancient Vietnamese text (Han-Nom characters)
  • Biometry access control: face verification and anti-spoofing

Team Members

Assoc. Prof. Muriel VISANI
Team Leader

Dr. Dinh Viet Sang

Dr. Nguyen Thi Oanh

Dr. Tran Nguyen Ngoc

Dr. Dang Tuan Linh

Dr. Ngo Thanh Trung

Projects and Solutions


National partners (in Vietnam)

  • USTH: ICTLab & Space departments
  • VNUA (FIT)
  • Can Tho University
  • IRD: Institut de Recherche pour le Développement (Vietnam branch)

International partners

  • Asia-Pacific:
    • Australia: University of Technology Sydney, Bureau of meteorology, CSIRO, Griffith Uniersity, The University of Queensland
    • China: Lanzhou University
    • Japan: University of Tsukuba, Kochi University of Technology
    • South Korea: Chosun University
  • America:
    • USA: University of Hawaii
    • Brazil: University of Sao Paulo
  • Russia: Tula State University
  • Africa: Tunisia – Sfax University
  • Europe:
    • France: La Rochelle University, Poitiers University, Bordeaux University, INSA Lyon, Nancy University
    • Switzerland: Fribourg University
    • Spain: Universitat Autonoma de Barcelona

Latest Publications

Publications in 2023

  1. Nguyễn Đức Ca, Phan Thị Thu, Hoàng Thị Minh Anh, Phạm Ngọc Dương, Nguyễn Hoàng Giang, Nguyễn Lệ Hằng. Nâng cao hiệu quả quản trị đại học trong bối cảnh đổi mới giáo dục tại Việt Nam. Tạp chí khoa học giáo dục Việt Nam. 14/03/2023
  2. Thuong Nguyen Canh; Trung Thanh Ngo; Hajime Nagahara. Human-Imperceptible Identification with Learnable Lensless Imaging. IEEE Access. 95724 - 95733. 18/08/2023
  3. Chenhao Li, Trung Thanh Ngo, Hajime Nagahara. Inverse Rendering of Translucent Objects using Physical and Neural Renderers. The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023. 12510-12520. Vancouver Convention Center, Canada. 18/06/2023
  4. Huy-Hoang Nguyen, Thi-Oanh Nguyen. HRSeg: Leveraging High-Resolution Images to Enhance Polyp Segmentation Quality. 2023 15th International Conference on Knowledge and Systems Engineering (KSE). 1-4. 18/10/2023
  5. Tuan Linh Dang, Duc Loc Le, Trung Hieu Pham, Xuan Tung Tran. Lightweight Models’ Performances on a Resource-Constrained Device for Traffic Application. The Fourth International Conference on Artificial Intelligence and Computational Intelligence. (AICI 2023) (Kỷ yếu được đăng trong Deep Learning and Other Soft Computing Techniques, Studies in Computational Intelligence 1097 ). 1-14. Hà Nội. 13/01/2023
  6. Ngo-Kien Duong, Viet-Sang Dinh, Thi-Oanh Nguyen. MCLDA: Multi-level Contrastive Learning for Domain Adaptive Semantic Segmentation. SOICT '23: Proceedings of the 12th International Symposium on Information and Communication Technology. 343–350. Ho Chi Minh, Viet Nam. 07/12/2023
  7. Tung Nguyen Quang, Thi-Oanh Nguyen. Language Knowledge-Assisted in Topology Construction for Skeleton-Based Action Recognition. SOICT 2023: The 12th International Symposium on Information and Communication Technology. 443-449. Ho Chi Minh, Vietnam. 07/12/2023
  8. Namal Rathnayake, Tuan Linh Dang, Akira Miyazaki, and Yukinobu Hoshino. An Efficient Approach for Age-Wise Rice Seeds Classification using SURF-BOF with Modified Cascaded-ANFIS algorithm. Fifteenth International Conference on Machine Vision (ICMV 2022). 1-9. Rome, Ý. 18/11/2022
  9. Tuan Linh Dang, Trung Hieu Pham, Quang Minh Dang, Nicolas Monet. A lightweight architecture for hand gesture recognition. Multimedia Tools and Applications. 28569–28587. 31/01/2023
  10. ham Van Toan, Dinh Viet Sang. ESSL-Polyp - A Robust Framework of Ensemble Semi-Supervised Learning in Polyp Segmentation. Computing Conference. London, UK. 22/06/2023
  11. Toan Pham Van, Sang Dinh Viet, Linh Bao Doan, Thanh Tung Nguyen, Quang Hung Nguyen, Duc Trung Tran. Improve polyp semi-supervised segmentation with prioritizing the reliability of unlabeled images. ICSIE. 35-40. 21/10/2022
  12. Namal Rathnayake, Upaka Rathnayake, Imiya Chathuranika, Tuan Linh Dang, Yukinobu Hoshino. Projected Water Levels and Identified Future Floods: A Comparative Analysis for Mahaweli River, Sri Lanka. IEEE Access. 8920-8937. 17/01/2023
  13. Namal Rathnayake, Akira Miyazaki, Tuan Linh Dang, Yukinobu Hoshino. Age Classification of Rice Seeds in Japan Using Gradient-Boosting and ANFIS Algorithms. Sensors. 1-18. 03/03/2023
  14. Namal Rathnayake , Upaka Rathnayake, Tuan Linh Dang, Yukinobu Hoshino. Water level prediction using soft computing techniques: A case study in the Malwathu Oya, Sri Lanka. PLoS ONE. 1-21. 22/02/2023
  15. Namal Rathnayake, Upaka Rathnayake, Imiya Chathuranika, Tuan Linh Dang, Yukinobu Hoshino. Cascaded-ANFIS to simulate nonlinear rainfall–runoff relationship. Applied Soft Computing. 1-14. 26/07/2023
  16. Yong Cheng, Wei Wang, Wenjie Zhang, Ling Yang, Jun Wang, Huan Ni, Tingzhao Guan, Jiaxin He, Yakang Gu and Ngoc Nguyen Tran. A Multi-Feature Fusion and Attention Network for Multi-Scale Object Detection in Remote Sensing Images. Remote Sensing. 1-19. 12/04/2023

Publications in 2022

  1. Namal Rathnayake, Tuan Linh Dang , Yukinobu Hoshino. Designing and Implementation of Novel Ensemble model basedon ANFIS and Gradient Boosting methods for Hand Gestures. SoICT 2022. 283-289. Hanoi-HaLong. 01/12/2022
  2. Keita Mitani, Namal Rathnayake, Upaka Rathnayake, Tuan Linh Dang, Yukinobu Hoshino. Brain Activity Associated with the Planning Process during the Long-Time Learning of the Tower of Hanoi (ToH) Task: A Pilot Study. Sensors. 1-14. 26/10/2022
  3. Namal Rathnayake, Upaka Rathnayake, Tuan Linh Dang, Yukinobu Hoshino. A Cascaded Adaptive Network-Based Fuzzy Inference System for Hydropower Forecasting. Sensors. 2905. 08/04/2022
  4. Manh Nguyen Duy, Hang Dao Viet, Long Dao Van, Hung Le Quang, Khanh Pham Cong, Oanh Nguyen Thi, Thuy Nguyen Thi, Sang Dinh Viet. EndoUNet: A Unified Model for Anatomical Site Classification, Lesion Categorization and Segmentation for Upper Gastrointestinal Endoscopy. KSE. 19/10/2022
  5. Trần Hoàng Hải, Nguyễn Thanh Hùng, Nguyễn Nhất Hải, Đặng Tuấn Linh, Huỳnh Quyết Thắng. eHUST - Một mô hình mẫu cho hệ thống quản trị Nhà trường hỗ trợ Chuyển đổi số tại Việt Nam. Thúc đấy Chuyển đổi số, Kinh tế tuần hoàn và kinh tế xanh - Hướng tới mục tiêu phát triển bền vững. 18-26. Trường Đại học Phenikaa. 12/11/2022
  6. Phan Thị Thu. Những vấn đề đặt ra đối với thiết chế Hội đồng trường đại học công lập ở Việt Nam hiện nay. Kỷ yếu hội thảo khoa học trường Học viện báo chí và tuyên truyền. 14/11/2022
  7. Phan Thị Thu. Cuộc cạnh tranh của các công ty người Việt với nước ngoài trong lĩnh vực vận tải đường thuỷ ở bắc kỳ đầu thế kỉ XX và những kinh nghiệm đối với doanh nhân Việt Nam hiện nay. Kỷ yếu hội thảo quốc gia tổ chức tại trường ĐHSP Hà Nội 2. 14/12/2022
  8. Tuan Linh Dang, Thuy Hang Nguyen, Gia Tuyen Nguyen, Thang Cao. Traffic Collision Warning Using Deep Learning Models. ICIC Express Letters. 17-24. 01/08/2021
  9. Tuan Linh Dang, Sy Dat Tran, Thuy Hang Nguyen, Suntae Kim, Nicolas Monet. An improved hand gesture recognition system using keypoints and hand bounding boxes. Array. 1-10. 21/09/2022
  10. Namal Rathnayake, Upaka Rathnayake, Tuan Linh Dang, Yukinobu Hoshino. An Efficient Automatic Fruit-360 Image Identification and Recognition Using a Novel Modified Cascaded-ANFIS Algorithm. Sensors. 4401. 08/06/2022
  11. Ling Yao, Jiaying Lu, Wenjie Zhang, Jun Qin, Chenghu Zhou, Ngoc Nguyen Tran, Ekena Rangel Pinagé. Spatiotemporal Analysis of Extreme Temperature Change on the Tibetan Plateau Based On Quantile Regression. Earth and Space Science. 1-15. 29/09/2022
  12. Yuhe Zhao, Minyu Wang, Tianxiang Zhao , Yi Luo , Yuhan Li , Kai Yan , Lei Lu , Ngoc Nguyen Tran , Xiaodan Wu , Xuanlong Ma. Evaluating the potential of H8/AHI geostationary observations for monitoring vegetation phenology over different ecosystem types in northern China. International Journal of Applied Earth Observation and Geoinformation. 1-12. 21/07/2022
  13. Nam Kieu Dang, Oanh Nguyen Thi, Thuy Nguyen Thi, Hang Dao Viet, Long Dao Van, Trung Tran Quang, Sang Dinh Viet. A Coarse-to-fine Unsupervised Domain Adaptation Method for Cross-Mode Polyp Segmentation. KSE. 19/10/2022
  14. K. D. Nam, T. M. Nguyen, T. V. Dieu, M. Visani, T. -O. Nguyen and D. V. Sang. A Novel Unsupervised Domain Adaption Method for Depth-Guided Semantic Segmentation Using Coarse-to-Fine Alignment. IEEE Access. 101248-101262. 21/08/2022
  15. Tuan Linh Dang, Huu Thang Nguyen, Duc Manh Dao, Hoang Vu Nguyen, Duc Long Luong, Ba Tuan Nguyen, Suntae Kim, Nicolas Monet. SHAPE: a dataset for hand gesture recognition. Neural Computing and Applications. 21849–21862. 18/07/2022
  16. Namal Rathnayake, Upaka Rathnayake, Tuan Linh Dang, and Yukinobu Hoshino. Streamflow prediction using Cascaded-ANFIS algorithm in Kelani River, Sri Lanka. The 10th International Symposium on Computational Intelligence and Industrial Applications (ISCIIA2022). 1-6. Bắc Kinh, Trung Quốc. 23/09/2022
  17. Tuan Linh Dang, Tran Sy Dat, Thuy Ha Hoang, Trong Nghia Nguyen, Tuan Minh Vu. Prototype of a parking system with path recommendation. SoICT2022. 309-316. Hanoi-HaLong. 01/12/2022
  18. N. T. Duc, N. T. Oanh, N. T. Thuy, T. M. Triet and V. S. Dinh. ColonFormer: An Efficient Transformer Based Method for Colon Polyp Segmentation. IEEE Access. 80575-80586. 25/07/2022
  19. Vien Truong Nguyen, Quang-Van Doan, Ngoc Nguyen Tran, Ly Thi Mai Luong, Pham Minh Chinh, Phong K Thai, Dung Phung, Hong H T C Le, Tran Ngoc Dang. The protective effect of green space on heat-related respiratory hospitalization among children under 5 years of age in Hanoi, Vietnam. Environmental Science and Pollution Research. 1-15. 20/05/2022
  20. Nguyen Minh Chau, Le Truong Giang, Dinh Viet Sang. PolypDEQ: Towards Effective Transformer-Based Deep Equilibrium Models for Colon Polyp Segmentation. ISVC (Lecture Notes in Computer Science). 456-467. USA. 03/10/2022
  21. Nguyen Viet Manh, Kieu Dang Nam, Dinh Viet Sang, Thi-Oanh Nguyen. G2L: A Global to Local Alignment Method for Unsupervised Domain Adaptive Semantic Segmentation. Procedia Computer Science. 2698-2707. Verona, Italia. 06/09/2022
  22. Toan Pham Van, Linh Doan Bao, Duc Tran Trung, Quan Nguyen Van, Sang Dinh Viet. Online pseudo labeling for polyp segmentation with momentum networks. KSE. 19/10/2022
  23. Tuan Linh Dang, Nhat Minh Ngo. SDNs Delay Prediction Using Machine Learning Algorithms. The Third International Conference on Artificial Intelligence and Computational Intelligence (AICI 2022) (Kỷ yếu được đăng trong Biomedical and Other Applications of Soft Computing). 133-141. Online vì COVID19. 14/01/2022
  24. Tuan Linh Dang, Viet Tien Ha. Shop Product Tracking and Early Fire Detection Using Edge Devices. The Third International Conference on Artificial Intelligence and Computational Intelligence (AICI 2022) (Kỷ yếu được đăng trong Biomedical and Other Applications of Soft Computing). 121-131. Online vì COVID19. 14/01/2022
  25. Nguyen Tuan Hung, Phan Ngoc Lan, Nguyen Thi Oanh, Nguyen Thi Thuy, Dinh Viet Sang. GCEENet: A Global Context Enhancement and Exploitation for Medical Image Segmentation. ISVC (Lecture Notes in Computer Science). 141-152. USA. 03/10/2022
  26. Dinh Viet Sang, Do Duy Quang. Incremental Boundary Refinement using Self Axial Reverse Attention and Uncertainty-Aware Gate for Colon Polyp Segmentation. SoICT. 322-328. 01/12/2022