Unsupervised data labeling and incremental cross-domain training for enhanced hybrid eye gaze estimation

Authors: Alejandro García de la Santa Javier Muguerza David López Unai Elordi Hidalgo Luis Unzueta Irurtia Arantxa Villanueva

Date: 04.06.2024


Abstract

This paper aims to advance the fields of unsupervised data labeling and incremental cross-domain training techniques. We apply these innovative methods to develop a model tailored for the Augmentative and Alternative Communication (AAC) application domain, introducing a new perspective in hybrid eye Gaze Estimation (GE). These hybrid eye GE models combine the generalization strengths of appearance-based models with the scene understanding capabilities inherent in geometrical reconstruction. The use of open eye tracking datasets for the AAC domain introduces domain shift, while accurately labeling gaze vectors is challenging without specialized hardware for proper 3D dimensional reconstruction. We propose an approach to solve this challenges by conducting standardized unsupervised gaze vector labeling across multiple open GE datasets and subsequently performing incremental training to adapt to the target domain. Using a proprietary dataset we were able to reduce the gaze error from 4.87 degrees to 3.95 degrees, compared to a traditional single-step training.

BIB_text

@Article {
title = {Unsupervised data labeling and incremental cross-domain training for enhanced hybrid eye gaze estimation},
keywds = {
HCI, Cross-domain training, gaze estimation, hybrid gaze estimation, unsupervised labeling, AAC
}
abstract = {

This paper aims to advance the fields of unsupervised data labeling and incremental cross-domain training techniques. We apply these innovative methods to develop a model tailored for the Augmentative and Alternative Communication (AAC) application domain, introducing a new perspective in hybrid eye Gaze Estimation (GE). These hybrid eye GE models combine the generalization strengths of appearance-based models with the scene understanding capabilities inherent in geometrical reconstruction. The use of open eye tracking datasets for the AAC domain introduces domain shift, while accurately labeling gaze vectors is challenging without specialized hardware for proper 3D dimensional reconstruction. We propose an approach to solve this challenges by conducting standardized unsupervised gaze vector labeling across multiple open GE datasets and subsequently performing incremental training to adapt to the target domain. Using a proprietary dataset we were able to reduce the gaze error from 4.87 degrees to 3.95 degrees, compared to a traditional single-step training.


}
isbn = {979-8-4007-0607-3},
date = {2024-06-04},
}
Vicomtech

Parque Científico y Tecnológico de Gipuzkoa,
Paseo Mikeletegi 57,
20009 Donostia / San Sebastián (Spain)

+(34) 943 309 230

Zorrotzaurreko Erribera 2, Deusto,
48014 Bilbao (Spain)

close overlay

Behavioral advertising cookies are necessary to load this content

Accept behavioral advertising cookies