Skip to main content

Browser-Based Eye Tracking


State of the art

There are few known projects published for eye tracking, which we will not be looking further into. Instead, I would like to advice our readers to take a look on this articlewhere a few known gaze recognition projects are being compared. Two JavaScript libraries were used for the needs of this project. Face-api.js by Vincent Mühler was used to detect the face and extract some useful information on facial expression and furthermore recognise the position of our eyes, nose, jawline etc. OpenCV library, which is known for its strong aspects on python projects, luckily has been also reshaped as a JavaScript library (opencv.js), with a less friendlier and well-documented environment. Nevertheless, it provides adequate functions to process the frames of the eye. Last but not least, the project of Antoine Lamé GazeTracking help us dig in the world of iris recognition, so props to him.

Wink-scroll is based on the unpopular JavaScript implementation of OpenCV and face-api; a very cool JavaScript library for facial landmark recognition

How it works

An analysis of the library is presented here, in order to better understand how we are going to deduct the iris from our frame and realise how will the system help us recognise the winking motion. So, the wink-scroll library is implemented in the following order:
video.id = “NAME”;
video.width = WIDTH
video.height = HEIGHT
video.style.position = “absolute”;
video.style.top = DISTANCE_T + “px”;
video.style.left = DISTANCE_L + “px”;
  • Expressions
  • Landmarks
The 68 landmarks layout of human face is commonly used by face recognition apps
ctx.drawImage(
  video, start X, start Y // Frame and starting points
  disX, disY, // Area to crop
  0, 0, // Place the result at 0, 0 in the canvas
  canvas2.width, canvas2.height) // Canvas dimensions
let src = cv.imread(‘canvas2’); // read the cropped frame as an image
let dst = new cv.Mat(); // the destination is an empty Mat array
cv.cvtColor(src, src, cv.COLOR_RGBA2RGB, 0); // setting the colorspace to RGB
cv.bilateralFilter(src, dst, 10, 15, 15); // applying bilateral filter
cv.erode(dst, src, M, anchor, 3, cv.BORDER_CONSTANT, DEFAULT_BORDER_VALUE);
cv.threshold(src, dst, threshold, 255, cv.THRESH_BINARY);
The usage of the library in DEBUG mode where one can see how the image processing takes place

Final thoughts

Is this project something that has been created for the first time? No. However, we are using technologies that are commonly known in a very promising programming stack (HTML5, JavaScript), and we are willing to motivate and inspire other scientists to use it and implement it. Furthermore, our library provides the web browser with the right tools to enhance the experience of browsing for people with disabilities. Computer vision is a promising and evolving technological aspect, especially with the rise of Artificial Intelligence and Deep Neural Networks. Imagine all the endless possibilities in the fields of Security, Retail, Automotive, Healthcare, Agriculture, Banking, Industry, Education. Moreover, people with special needs will have a chance in the future to improve their quality of life. All that matters now is to keep introducing computer vision functionalities to our programming environments and let the scientists and programmers do their wonders, taking the computer logic and understanding of the surroundings to the next level.

Comments