Human vision, by enabling people to interpret their surrounding environment, is one of our most important senses, as many experts consider that 80% of what we perceive comes through vision. Put simply, Computer Vision is the sub-discipline of Artificial Intelligence which teaches machines to “see like a human”. More precisely, it consists of specific hardware, and/or software algorithms providing computers with the ability to capture, process and interpret images, videos or signals taken from a camera or other sensors.

Researchers started to work on Computer Vision in the 1960’s, achieving constant progress in this field. In the 2010’s, Deep Learning, a branch of Machine Learning, revolutionized Computer Vision. Among other breakthroughs, Deep Learning-based algorithms surpassed human in their ability to recognize human faces in 2014. Since then, Computer Vision is one of the hottest topics in the broad field of Artificial Intelligence. Computer Vision is nowadays applied in most of the aspects of our daily life: medicine, manufacturing, biometry, autonomous vehicles, digitization of paper documents and books for electronic access, military and law enforcement, recycling household waste or other environmental applications using aerial/satellite images, etc.

Our research group focuses on the conception and development of high-speed, light-weight and effective algorithms for analysis and understanding different types of images/videos: natural images/videos (taken through regular cameras), medical images, remote sensing images, document images. See the slides here for more detail.

Research Directions

We are especially interested in the tasks of object detection, classification, semantic segmentation and tracking.

Some keywords about our research directions include:

  • Multimodality
    • Spatio-temporal information
    • Raw data (or text) associated with the images
  • Domain adaptation
    • Transferring the model learned from one set of images to a different set of images
  • Limited resources constraints (linked to embedded systems)
    • Definition of light weight models
  • User interaction

Examples of methods we use include both traditional Image Processing methods and Machine Learning methods, especially Deep Learning (often with Convolutional Neural Networks and Recurrent Neural Networks).

Research Problems

Our research problems include, but are not limited to: 

  • Medical imaging:
    • Segmentation of colon polyps and identifying lesions at high-risk of malignancy (cancer) during endoscopy
    • Detecting brain degeneration for Alzheimer’s patients from 3D MRI images and clinical data
  • Traffic monitoring and autonomous vehicles
    • Vehicles and pedestrian tracking in videos, including embedding the proposed algorithms in edge devices
    • Semantic segmentation for intelligent vehicles 
  • Remote sensing – satellite image processing and analysis:
    • Adjusting Geostationary (GEO) satellite images with Low-Earth-Orbit (LEO) images
    • Study of Urban Heat Islands and their impact on the environment and humans 
  • Gesture recognition from videos:
    • Human Action Recognition
    • Hand Gesture Recognition 
  • Document analysis and understanding:
    • Incremental multimodal classification from streams of documents
    • Understanding ancient Vietnamese text (Han-Nom characters)
  • Biometry access control: face verification and anti-spoofing

Team Members

Assoc. Prof. Muriel VISANI
Team Leader

Dr. Dinh Viet Sang

Dr. Nguyen Thi Oanh

Dr. Tran Nguyen Ngoc

Dr. Dang Tuan Linh

National partners (in Vietnam)

  • USTH: ICTLab & Space departments
  • VNUA (FIT)
  • Can Tho University
  • IRD: Institut de Recherche pour le Développement (Vietnam branch)

International partners

  • Asia-Pacific:
    • Australia: University of Technology Sydney, Bureau of meteorology, CSIRO, Griffith Uniersity, The University of Queensland
    • China: Lanzhou University
    • Japan: University of Tsukuba, Kochi University of Technology
    • South Korea: Chosun University
  • America:
    • USA: University of Hawaii
    • Brazil: University of Sao Paulo
  • Russia: Tula State University
  • Africa: Tunisia – Sfax University
  • Europe:
    • France: La Rochelle University, Poitiers University, Bordeaux University, INSA Lyon, Nancy University
    • Switzerland: Fribourg University
    • Spain: Universitat Autonoma de Barcelona

Latest publications

Publications in 2021

  1. Namal Rathnayake, Tuan Linh Dang, Yukinobu Hoshino. A Novel Optimization Algorithm: Cascaded Adaptive Neuro-Fuzzy Inference System. International Journal of Fuzzy Systems. 1-17. 19/02/2021
  2. Namal Rathnayake, Tuan Linh Dang, Yukinobu Hoshino. Performance Comparison of the ANFIS based Quad-Copter Controller Algorithms. 2021 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). 1-8. Luxembourg. 11/07/2021
  3. Tuan Linh Dang, Van Chuong Do. Fine-Grained Network Traffic Classification Using Machine Learning: Evaluation and Comparison. AICI 2021 (kỷ yếu được đăng trong Studies in Computational Intelligence Book Series, index bởi Scopus). 151-162. Viện Công nghệ thông tin, Viện Hàn lâm KHCN Việt Nam, Hà Nội, Việt Nam. 15/01/2021
  4. Tuan Linh Dang, Thang Cao, Yukinobu Hoshino. Engraved digit detection using HOG-real AdaBoost and deep neural network. Turkish Journal of Electrical Engineering & Computer Sciences. 138-151. 11/09/2020
  5. Hoang-Thuyen Nguyen, Thi-Oanh Nguyen. Attention-based network for effective action recognition from multi-view video. Procedia Computer Science, Elsevier, 25th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems. 971-980. 18/05/2021
  6. Ekena Rangel Pinagé, David M. Bell, Matthew Gregory, Ngoc Nguyen Tran, Wenjie Zhang, Alfredo Huete. Effects of Tropical Forest Degradation on Amazon Forest Phenology. International Geoscience and Remote Sensing Symposium (IGARSS), 2020. 4516–4519. Waikoloa, HI, USA. 26/09/2020
  7. Dung Phung, Thong Nguyen-Huy, Ngoc Nguyen Tran, Dang Ngoc Tran, Van Quang Doan, Son Nghiem, Nga Huy Nguyen, Trung Hieu Nguyen, Trude Bennett. Hydropower dams, river drought and health effects: A detection and attribution study in the lower Mekong Delta Region. Climate Risk Management. 100280. 25/01/2021
  8. Tran Thi Thanh Hai, Nguyen Tien Hai, Dinh Viet Sang. Significant Trajectories and Locality Constrained Linear Coding for Hand Gesture Representation. ICCE. 359-364. Phu Quoc, Vietnam. 13/01/2021

Publications in 2020

  1. Tuan Linh Dang, Gia Tuyen Nguyen, Thang Cao. OBJECT TRACKING USING IMPROVED DEEP SORT YOLOV3 ARCHITECTURE. ICIC Express Letters. 961-969. 01/06/2020
  2. Xuan Bui, Hieu Vu, Oanh Nguyen, Khoat Than. MAP Estimation With Bernoulli Randomness, and Its Application to Text Analysis and Recommender Systems. IEEE Access. 127818 - 127833. 22/06/2020
  3. Anh-Vu Bui, Thi-Oanh Nguyen. Multi-view Human Action Recognition Based on TSN Architecture Integrated with GRU. Procedia Computer Science, Elsevier, 24nd International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES). 948-955. 16/09/2020
  4. Van-Sang Tran, Thi-Oanh Nguyen, Ha-Quang Thai. A camera-based solution for customer behavior identification. 2020 International Conference on Multimedia Analysis and Pattern Recognition (MAPR). Hanoi, Viet Nam. 08/10/2020
  5. Xuanlong Ma, Alfredo Huete, Ngoc Nguyen Tran, Jian Bi, Sicong Gao, Yelu Zeng. Sun-Angle Effects on Remote-Sensing Phenology Observed and Modelled Using Himawari-8. Remote Sensing. 1-23 (1339). 21/04/2020
  6. Ngoc Nguyen Tran, Alfredo Huete, Ha Nguyen, Ian Grant, Tomoaki Miura, Xuanlong Ma, Alexei Lyapustin, Yujie Wang, Elizabeth Ebert. Seasonal Comparisons of Himawari-8 AHI and MODIS Vegetation Indices over Latitudinal Australian Grassland Sites. Remote Sensing. 1-21 (2494). 01/08/2020
  7. Nguyen Thanh Dat, Nguyen Dang Tuan Anh, Dinh Viet Sang. PCA-based 3D Facial Reenactment From Single Image. MAPR 2020. 1-6. Hà Nội. 08/10/2020
  8. Dinh Viet Sang, Tran Quang Chung, Nguyen Duc Dung, In Seop Na. Attention ResCUNet-GAN: A Novel Facial UV Map Completion for Pose-invariant Face Recognition. HCIS Workshop 2020. 24/02/2020
  9. Tran Quang Chung, Hoang Cao Huyen, Dinh Viet Sang. A Novel Generative Model to Synthesize Face Images for Pose-invariant Face Recognition. MAPR 2020. 1-6. 08/10/2020
  10. In Seop Na, Chung Tran, Dung Nguyen, Sang Dinh. Facial UV Map Completion for Pose-invariant Face Recognition: A Novel Adversarial Approach based on Coupled Attention Residual UNets. Human-centric Computing and Information Sciences 2020. 10/11/2020
  11. Nguyen Thanh Hau, Le Cong Hau, Dinh Viet Sang, Tingting Yao, Wei Li, Zhiyong Wang. Efficient Brain Tumor Segmentation with Dilated Multi-fiber Network and Weighted Bi-directional Feature Pyramid Network. DICTA 2020. 30/11/2020
  12. Z. Ming, M. Visani, M.M. Luqman, J.C. Burie. A Survey on Anti-Spoofing Methods for Facial Recognition with RGB Cameras of Generic Consumer Devices. Journal of Imaging. 6(12):139, 56 pages, 2020.
  13. C. Ostertag, M. Beurton-Aimar, M. Visani, T. Urruty, K. Bertet. Predicting Brain Degeneration with a Multimodal Siamese Neural Network. JInternational Conference on Image Processing Theory, Tools and Applications (IPTA), IEEE, pages 1-6, November 2020.

Publications in 2019

  1. Tuan Linh Dang, Yukinobu Hoshino. Improved PSO Algorithm for Training of Neural Network in Co-design Architecture. International Journal of Computer Applications. pp 1-7. 18/02/2019
  2. Tuan Linh Dang, Thang Cao, Yukinobu Hoshino. Classification of Metal Objects Using Deep Neural Networks in Waste Processing Line. International Journal of Innovative Computing, Information and Control. 1901-1912. 01/07/2019
  3. Tuan Linh DANG, Yukinobu HOSHINO. Hardware-Based Principal Component Analysis for Hybrid Neural Network Trained by Particle Swarm Optimization on a Chip. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences. 1374-1382. 14/08/2019
  4. Manh-Hung Lu and Thi-Oanh Nguyen. Spatio-temporal Multi-level Fusion for Human Action Recognition. THE 10TH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY. 298-305. Ha Long, Vietnam. 04/12/2019
  5. Huong-Giang Doan, Thanh-Hai Tran, Hai Vu, Thi-Lan Le, Van-Toi Nguyen, Sang Viet Dinh, Thi-Oanh Nguyen, Thi-Thuy Nguyen, Duy-Cuong Nguyen. Multi-view discriminant analysis for dynamic hand gesture recognition. The 5th Asian Conference on Pattern Recognition ACPR 2019. NA. Auckland, New Zealand. 26/11/2019
  6. Luong Nguyen Van, Thi-Oanh Nguyen. Object counting based on density using perspective transformation. The IEEE-RIVF 2019 International Conference on Computing and Communication Technologies. 01/03/2019
  7. Xuanlong Ma, Alfredo Huete, Ngoc Nguyen Tran. Interaction of seasonal sun-angle and savanna phenology observed and modelled using MODIS. Remote Sensing. 1-19 (1398). 12/06/2019
  8. Jianxiu Shen, Alfredo Huete, Xuanlong Ma, Ngoc Nguyen Tran, Joanna Joiner, Jason Beringer, Derek Eamus, QiangYu. Spatial pattern and seasonal dynamics of the photosynthesis activity across Australian rainfed croplands. Ecological Indicators. 1-11 (105669). 31/08/2019
  9. Dinh Viet Sang, Jaesun Lee, OS Lee, Mingu Kang. An Approach to Detect Missing IRIS Plate in Smart Phone Lens Assembly. ICONI 2019. Hanoi. 16/12/2019
  10. Dinh Viet Sang, Duong Viet Hung. YOLOv3-VD: A sparse network for vehicle detection using variational dropout. SoICT 2019. 280-284. Hanoi - Ha Long Bay. 04/12/2019
  11. Dinh Viet Sang, Le Tran Bao Cuong. Improving CRNN with EfficientNet-like feature extractor and multi-head attention for text recognition. SoICT 2019. 285-290. Hanoi - Ha Long Bay. 04/12/2019
  12. Dinh Viet Sang, Phan Ngoc Lan. BK.Synapse: A scalable distributed training framework for deep learning. SoICT 2019. 43-48. Hanoi - Ha Long Bay. 04/12/2019
  13. Pham Cong Thang, Tran Thi Thu Thao, Phan Tran Dang Khoa, Dinh Viet Sang, Pham Minh Tuan, Nguyen Minh Hieu. An adaptive algorithm for restoring image corrupted by mixed noise. Cybernetics and physics. 73–82. 31/10/2019