Gaze Attention Estimation for Medical Environments

Natchapol Shinno

Yuki Furuya

Takeshi Saitoh

Haibo Zhang

Keiko Tsuchiya

Hitoshi Sato

Frank Coffey

Gaze attention estimation is the task that aims to understand where each person is looking in each scene. In this study, we introduce a new annotated dataset that is derived from medical simulation training videos, capturing diverse and authentic clinical scenarios from a practical medical environment and annotated by the ground truth data from eye-tracking devices (iMotions [1]) worn by medical professionals during procedures in the scenes. Most of the existing approaches in the field of gaze prediction often rely on object detection as a guide for the model. However, this becomes problematic when encountering specialised tools and equipment in medical environments, and that domain is probably absent from the standard large-scale object detection and segmentation dataset. To address the problem, this paper attempts to propose a gaze prediction framework that integrates with the head pose information, which consists of pitch, yaw, and roll. Enable the model to rely on gaze direction even when objects are not detected. This approach is based on the self-attention mechanism of vision transformers. We believe this will enhance the model’s performance in relation to the relationship between gaze direction and the scene. We hope to offer a more reliable framework for real-world medical applications.

This publication uses Eye Tracking and Eye Tracking Glasses which is fully integrated into iMotions Lab

Learn more

Learn more about the technologies used

Other publications you might be interested in

Calorie labels on restaurant menus: What do consumers see, think, and decide? Eye-tracking and interview insights

GatedPeer-Reviewed22/10/2025University of Surrey
Biometric responses to green and complete street elements in Devens, Massachusetts

Open AccessPeer-Reviewed17/10/2025Tufts University + 3
The impact of information overload on Gen Z iPhone-user product preferences and visual attention: a biometric approach

GatedPeer-Reviewed15/10/2025Loyola University Chicago
Machine Learning Techniques to Improve theCognitive Workload Classification UsingMultimodal Sensors’ Data

Open AccessPeer-Reviewed10/10/2025Wrocław University of Science and Technology
GUI Evaluation using Eye Tracking: Optimizing Instructor Station for Night Vision Training in Aviation

Open AccessPeer-Reviewed06/10/2025Tallinn University
Gaze Attention Estimation for Medical Environments

GatedPeer-Reviewed26/09/2025Kyushu Institute of Technology + 3

Bias: The Definitive Guide to the Architecture of Human Behavior

The Many Faces of Emotion: From the Duchenne Smile to the Grimace of Fear

Gaze Attention Estimation for Medical Environments

Learn more about the technologies used

Other publications you might be interested in

Calorie labels on restaurant menus: What do consumers see, think, and decide? Eye-tracking and interview insights

Biometric responses to green and complete street elements in Devens, Massachusetts

The impact of information overload on Gen Z iPhone-user product preferences and visual attention: a biometric approach

Machine Learning Techniques to Improve theCognitive Workload Classification UsingMultimodal Sensors’ Data

GUI Evaluation using Eye Tracking: Optimizing Instructor Station for Night Vision Training in Aviation

Gaze Attention Estimation for Medical Environments

Related Posts

Your Menu Is Your Most Powerful Marketing Asset

Measuring Pain: Advancing The Understanding Of Pain Measurement Through Multimodal Assessment

Feeling at Home: How to Design a Space Where the Brain can Relax

Why Dial Testing Alone Isn’t Enough in Media Testing – How to Build on It for Better Results

🍪 Use of cookies

Settings