DROZY: The ULg Multimodality Drowsiness Database

Description

The "ULg Multimodality Drowsiness Database", also called DROZY, is a database containing various types of drowsiness-related data (signal, images, etc.) and intended to help researchers to carry out experiments, and to develop and evaluate systems (i.e. algorithms), in the area of drowsiness monitoring.

The (multimodality) data were collected by the Laboratory for Signal and Image Exploitation (INTELSIG), which is part of the Department of Electrical Engineering and Computer Science of the University of Liège (ULg), Liège, Belgium.

! Update !

As of July 2018, the DROZY database is available for download on the Open Repository and Bibliography (ORBi) of the ULg, instead of on a dedicated server. For storage reasons, we excluded from DROZY the raw near-infrared intensity and range images. However, the encoded videos of the intensity images are still included. Furthermore, given that some frames are missing, we added the interpIndices folder (see below).

Missing tests

Because of backup incidents, some of the data are lost (tests 9-1, 10-2, 12-2, 12-3, 13-3). Test 7-1 did not happen.

Content overview

More formatting details follow below. Check our website, read our paper(s), or contact us to get other informations.


Intrinsics parameters

The kinect-intrinsics.yaml file contains the intrinsics matrix (intrinsics node) and the distorsion parameters (k and p nodes). This file can be parsed easily with the cv.FileStorage class of OpenCV.

KSS

The file KSS.txt has values ranging from 1 to 9 (i.e. KSS score range) inside 14 lines (1 per subject) and 3 columns (1 per test). Test 7-1 has been arbitrarily given a KSS score of 0, since there is no recorded value.

Facial shape annotations

Manual (subject-level)

The folder annotations-manual contains the folders SUBJECT. The folders SUBJECT contains the files annot-timestamps.txt, annot-s2.txt, and annot-s3.txtand the folders depth, and nir.

Automatic (frame-level)

The folder annotations-auto contains the files SUBJECT-TEST-s2.txt and SUBJECT-TEST-s3.txt, containing respectively the 2D and 3D automatic annotations of the 68 face landmarks for all frames.

Face images

The intensity and range images have both a resolution of 512x424 and 1 channel. Data are stored as 16-bit unsigned integers in .png image format. Intensity and range images are perfectly aligned since they are derived from the same sensor. The framerate is at 30 fps. Beware, some tests (tests 2 and 3 of subjects 1->8) are at 15 fps because of a recording bug occurring in darkness.

Near-infrared intensity images

All images of a same test are grouped in a tarball (named SUBJECT-TEST.tar) in the images_nir16 folder.

Range images

All images of a same test are grouped in a tarball (named SUBJECT-TEST.tar) in the images_depth16 folder. Each value represents the distance in millimeters.

Encoded videos

The folder videos_i8 contains the encoded (intensity) videos SUBJECT-TEST.mp4 (1 video per test). Data are stored as 8-bit unsigned integers.

Timestamps

The folder timestamps contains the files SUBJECT-TEST.txt (1 file per test). The file SUBJECT-TEST.txt contains, for each frame (1 line per frame), the parsed timestamps and the elapsed time since the beginning of the test (in milliseconds). Each line has the following format : [year, month, day, hours, minutes, seconds, milliseconds, elapsedtime].

PSG

The folder psg contains the files SUBJECT-TEST.edf (EDF = European Data Format). EDF parsers can be found online. Data are recorded at a framerate of 512 Hz. Here's a list of all channels :