A Variational Autoencoder Approach for Blink Detection in Mobile Eye Tracking Devices

Mahdi Heravian Shandiz; David Alonso-Caneiro; Scott A. Read; Michael J. Collins

doi:10.1109/DICTA63115.2024.00068

Back

A Variational Autoencoder Approach for Blink Detection in Mobile Eye Tracking Devices

Conference paper

Peer reviewed

A Variational Autoencoder Approach for Blink Detection in Mobile Eye Tracking Devices

Mahdi Heravian Shandiz, David Alonso-Caneiro, Scott A. Read and Michael J. Collins

Proceedings of the 2024 International Conference on Digital Image Computing: Techniques and Applications (DICTA), pp.419-426

International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2024 (Perth, Australia, 27-Nov-2024–29-Nov-2024)

Institute of Electrical and Electronics Engineers

2024

DOI: https://doi.org/10.1109/DICTA63115.2024.00068

Abstract

blink detection

eye tracking

variational autoencoders

classifier

Accurate blink detection is crucial for various applications such as eye tracking, driver drowsiness detection, brain-computer interfaces, diagnosing neurological disorders and understanding visual behavior. Despite the research in the area, some of the current mobile eye tracking devices still struggle to provide an accurate detection of blinks, which can impact their accuracy in eye tracking. This research aims to develop a deep learning method to reliably detect and classify blinks for use in mobile eye tracking devices. The approach comprises a variational autoencoder (VAE) stage followed by a feature classifier. The VAE architecture is responsible for learning a compressed representation of the input image, consisting of an encoder network that transforms the input images into a low-dimensional latent space, and a decoder network. Subsequently, the encoder part of the VAE is utilized to map images into the latent space, and these representations (i.e. features) are input into a classifier for blink detection. To better understand the proposed method, four VAE models (A, B, C, and D) with different levels of complexity, three latent parameter dimensions (2, 4, and 6) and four classifiers were trained. The findings suggest that higher-dimensional latent representations enable the VAE models to capture more essential features for blink detection, leading to improved discriminative power in the classifier using the latent parameter representation. In terms of classifier performance, the K-Nearest Neighbours (KNN) algorithm demonstrated the highest accuracy. The VAE model with five convolutional layers in the encoder (model B) and a latent parameter dimension of 6 achieved the best performance, with an accuracy of 99.40%. The proposed method outperformed other methods tested with the same dataset, indicating the effectiveness of the VAE-based approach for blink detection.

Details

Title: A Variational Autoencoder Approach for Blink Detection in Mobile Eye Tracking Devices
Authors: Mahdi Heravian Shandiz - Queensland University of Technology
David Alonso-Caneiro - University of the Sunshine Coast, Queensland, School of Science, Technology and Engineering
Scott A. Read - Queensland University of Technology
Michael J. Collins - Queensland University of Technology
Publication details: Proceedings of the 2024 International Conference on Digital Image Computing: Techniques and Applications (DICTA), pp.419-426
Conference details: International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2024 (Perth, Australia, 27-Nov-2024–29-Nov-2024)
Publisher: Institute of Electrical and Electronics Engineers
Date published: 2024
DOI: 10.1109/DICTA63115.2024.00068
ISBN: 9798350379037
Organisation Unit: School of Science, Technology and Engineering
Language: English
Record Identifier: 991104124002621
Output Type: Conference paper

Metrics

18 Record Views