Version: 3.27 (latest)

Face detection

Overview

The face detection process in Face SDK consists of face detection and determination of face landmarks.

Face detection

Face detection in Face SDK is performed using a set of detectors. Detector is an algorithm of the libfacerec library that uses neural networks to detect faces in images. The result of the detector is the coordinates of a bounding rectangle (bbox) around the detected face.

Detectors

The following modifications are currently available:

Modification	Version	Face SDK version	Default parameters	Detection time CPU (ms)*
Modification	Version	Face SDK version	Default parameters	640x480	1280x720	1920x1080
uld	1	3.19	precision_level=1, confidence_threshold=0.7, coarse_confidence_threshold=0.3	7	7	8
			precision_level=2, confidence_threshold=0.7, coarse_confidence_threshold=0.3	37	38	40
			precision_level=3, confidence_threshold=0.7, coarse_confidence_threshold=0.3	194	187	197
ssyv	1	3.19	confidence_threshold=0.5, iou_threshold=0.5	151	150	152
	2			46	46	47
	3			96	94	96
	4	3.20		1517	1506	1502
ssyv_light	1	3.24		11	11	12
blf_front	1	3.19	confidence_threshold=0.67, iou_threshold=0.5	3	5	9
blf_back	1	3.19	confidence_threshold=0.67, iou_threshold=0.5	11	13	18

* - CPU Intel Xeon E5-2683 v4 (single-core)

note

"ssyv" is modification by default.

Detector parameters

confidence_threshold is detection confidence threshold.
iou_threshold determines whether two bboxes refer to the same face. For example, with a threshold of 0.5, two bboxes with an IOU greater than 0.5 are considered to belong to the same face.
min_size filters out all faces that are smaller than the specified value. It can accept both relative values [0, 1] and absolute values.
max_size filters out all faces that are larger than the specified value. It can accept both relative values [0, 1] and absolute values.
coarse_confidence_threshold(uld only) is coarse detection confidence threshold. During the detection the detector creates a set of bboxes, for each of them the confidence value is specified (number from 0 to 1, shows the degree of confidence that there is a face in the bbox). The bboxes with confidence_thresholds are fed to the nms algorithm, which determines the intersections (matches) between the bboxes. The coarse_confidence_threshold parameter allows to cut off bboxes with low confidence, which reduces the number of calculations performed by the nms-algorithm.
precision_level(uld only) defines the level of precision. The value is from the range [1, 3], the higher the value the higher the accuracy and the lower the speed. The default value is 1.

Examples of detectors operation in different conditions are presented below. You can customize detection thresholds (confidence_threshold) and other parameters when creating a processing block.

Click here to expand the table

BLF (confidence_threshold=0.6)	ULD (precision_level=3, confidence_threshold=0.7)

Click here to expand the table

ULD (confidence_threshold=0.4)	ULD (confidence_threshold=0.7)

Face detector specification

Input
Output

The input Context must contain an image in binary format

{
    "image" : {
        "format": "NDARRAY",
        "blob": "data pointer",
        "dtype": "uint8_t",
        "shape": [height, width, channels]
    }
}

Once the processing block is running, an array of objects will be added, each containing the coordinates of the bounding rectangle, the detection confidence, the class and the identifier in that array

{
    "image" : {},
    "objects": [{
        "id": {"type": "long", "minimum": 0},
        "class": "face",
        "confidence": {"double",  "minimum": 0,  "maximum": 1},
        "bbox": [x1, y2, x2, y2]
    }]
}

Face landmarks

There are three sets of face landmarks: fda, mesh, tddfa.

The fda set contains 21 facial points.
The tddfa set contains 68 facial points.
The mesh set contains 470 3D facial points. We recommend to use it to get a 3D face mask.

The following modifications are currently available:

Modification	Version	Face SDK version	Detection time CPU (ms)*	Detection time GPU (ms)**
fda	1	3.23	3	3
tddfa_faster	1	3.19	2	1
tddfa	1	3.19	6	2
mesh	1	3.19	6	3

* - CPU Intel Xeon E5-2683 v4 (single-core)
** - GPU (NVIDIA GTX 10xx series)

note

The default modification is fda.

Fitter specification

Input
Output

The input Context must contain an image in binary format and objects array from Face Detector.

{
    "image" : {
    "format": "NDARRAY",
        "blob": "data pointer",
        "dtype": "uint8_t",
        "shape": [height, width, channels]
    },
    "objects": [{
        "id": {"type": "long", "minimum": 0},
        "class": "face",
        "confidence": {"double",  "minimum": 0,  "maximum": 1},
        "bbox": [x1, y2, x2, y2]
    }]
}

After running the processing block, each object will be added: 21 key face points, points from fda, tddfa or mesh set.

{
    "keypoints": {
        "left_eye_brow_left":   {"proj" : [x, y]},
        "left_eye_brow_up":     {"proj" : [x, y]},
        "left_eye_brow_right":  {"proj" : [x, y]},
        "right_eye_brow_left":  {"proj" : [x, y]},
        "right_eye_brow_up":    {"proj" : [x, y]},
        "right_eye_brow_right": {"proj" : [x, y]},
        "left_eye_left":        {"proj" : [x, y]},
        "left_eye":             {"proj" : [x, y]},
        "left_eye_right":       {"proj" : [x, y]},
        "right_eye_left":       {"proj" : [x, y]},
        "right_eye":            {"proj" : [x, y]},
        "right_eye_right":      {"proj" : [x, y]},
        "left_ear_bottom":      {"proj" : [x, y]},
        "nose_left":            {"proj" : [x, y]},
        "nose":                 {"proj" : [x, y]},
        "nose_right":           {"proj" : [x, y]},
        "right_ear_bottom":     {"proj" : [x, y]},
        "mouth_left":           {"proj" : [x, y]},
        "mouth":                {"proj" : [x, y]},
        "mouth_right":          {"proj" : [x, y]},
        "chin":                 {"proj" : [x, y]},
        "points": ["proj": [x, y]]
    },
    "pose": {
        "pitch": {"type": "double"},
        "roll": {"type": "double"},
        "yaw": {"type": "double"}
    }
}

Example of face detection and estimation of face landmarks

Create Processing Blocks

Create a detector and fitter processing block object using the FacerecService.createProcessingBlock method, passing a Context container with set parameters as an argument.

auto detectorConfigCtx = service->createContext();
detectorConfigCtx["unit_type"] = "FACE_DETECTOR";
detectorConfigCtx["modification"] = "ssyv";

auto fitterConfigCtx = service->createContext();
fitterConfigCtx["unit_type"] = "FACE_FITTER";

pbio::ProcessingBlock faceDetector = service->createProcessingBlock(detectorConfigCtx);
pbio::ProcessingBlock faceFitter = service->createProcessingBlock(fitterConfigCtx);

detectorConfigCtx = {
    "unit_type": "FACE_DETECTOR",
    "modification": "ssyv",
}

fitterConfigCtx = {
    "unit_type": "FACE_FITTER"
}

faceDetector = service.create_processing_block(detectorConfigCtx)
faceFitter = service.create_processing_block(fitterConfigCtx)

Map<String, dynamic> configCtx = {
    "unit_type": "FACE_DETECTOR",
    "modification": "ssyv",
};

ProcessingBlock faceDetector = service.createProcessingBlock(configCtx);

Dictionary<object, object> detectorConfigCtx = new();
detectorConfigCtx["unit_type"] = "FACE_DETECTOR";
detectorConfigCtx["modification"] = "ssyv";

Dictionary<object, object> fitterConfigCtx = new();
fitterConfigCtx["unit_type"] = "FACE_FITTER";

ProcessingBlock faceDetector = service.CreateProcessingBlock(detectorConfigCtx);
ProcessingBlock faceFitter = service.CreateProcessingBlock(fitterConfigCtx);

Context detectorConfigCtx = service.createContext();
detectorConfigCtx.get("unit_type").setString("FACE_DETECTOR");
detectorConfigCtx.get("modification").setString("ssyv");

Context fitterConfigCtx = service.createContext();
fitterConfigCtx.get("unit_type").setString("FACE_FITTER");

ProcessingBlock faceDetector = service.createProcessingBlock(detectorConfigCtx);
ProcessingBlock faceFitter = service.createProcessingBlock(fitterConfigCtx);

val detectorConfigCtx = service.createContext()
detectorConfigCtx["unit_type"].string = "FACE_DETECTOR"
detectorConfigCtx["modification"].string = "ssyv"

val fitterConfigCtx = service.createContext();
fitterConfigCtx["unit_type"].string = "FACE_FITTER"

val faceDetector = service.createProcessingBlock(detectorConfigCtx)
val faceFitter = service.createProcessingBlock(fitterConfigCtx)

const detectorConfigCtx = new facerec.Context();
detectorConfigCtx.get("unit_type").value = "FACE_DETECTOR";
detectorConfigCtx.get("modification").value = "ssyv";

const fitterConfigCtx = new facerec.Context();
fitterConfigCtx.get("unit_type").value = "FACE_FITTER";

const faceDetector = new facerec.ProcessingBlock(detectorConfigCtx);
const faceFitter = new facerec.ProcessingBlock(fitterConfigCtx);

detectorConfigContext, err := facesdk.CreateContext()
context, err := detectorConfigContext.GetOrInsertByKey("unit_type")
err = context.SetString("FACE_DETECTOR")
context, err = detectorConfigContext.GetOrInsertByKey("modification")
err = context.SetString("ssyv")
defer detectorConfigContext.Close()

fitterConfigContext, err := facesdk.CreateContext()
context, err = fitterConfigContext.GetOrInsertByKey("unit_type")
err = context.SetString("FACE_FITTER")
defer fitterConfigContext.Close()

faceDetector, err := service.CreateProcessingBlock(detectorConfigContext)
faceFitter, err := service.CreateProcessingBlock(fitterConfigContext)

defer faceDetector.Close()
defer faceFitter.Close()

Processing Block configurable parameters

Run face detection

Pass the Context with a binary image into the Detector Processing Block:

ioData["image"] = imgCtx;
faceDetector(ioData);

ioData["image"] = imageCtx
faceDetector(ioData)

ioData["image"].placeValues(imageContext);
faceDetector.process(ioData);

ioData["image"] = imgCtx;
faceDetector.Invoke(ioData);

ioData.get("image").setContext(imgCtx);
faceDetector.process(ioData);

ioData["image"].setContext(imgCtx)
faceDetector.process(ioData)

ioData.get("image").value = imgCtx;
faceDetector.process(ioData);

context, err := ioData.GetByKey("image")
err = context.Copy(imgCtx)

err = faceDetector.Process(ioData)

The result of face detection is stored by the passed Context container according to the specification of the Processing Block.

Run fitting of face landmarks

Pass the Context container received after the Face Detector:

faceFitter(ioData);

faceFitter(ioData)

faceFitter.process(ioData);

faceFitter.Invoke(ioData);

faceFitter.process(ioData);

faceFitter.process(ioData)

faceFitter.process(ioData);

err = faceFitter.Process(ioData)

The resulting Context can be passed to methods for estimating the age, gender, quality, and Liveness (Face Estimation) and to Recognizer.processing to create a template (See Face Recognition).

Detector cascade

In cases where it is necessary to detect faces from different domains in one location, such as selfies, photos from access control cameras, and identification documents, it can be challenging to ensure high quality.

To address this issue, you can use the cascade modification. This modification will sequentially run the detectors that were added during the initialization of the processing block.

Module setting

To add detectors to the cascade, use the face_detectors parameter. This parameter specifies the configurations for the detectors.

Cascade configuration example

{
    "unit_type": "FACE_DETECTOR",
    "modification": "cascade",
    "face_detectors": [
        {
            "unit_type": "FACE_DETECTOR",
            "modification": "ssyv",
            "version": 3
        },
        {
            "unit_type": "FACE_DETECTOR",
            "modification": "uld",
            "precision_level": 2
        },
    ],
}

auto detectorConfigCtx1 = service->createContext();
detectorConfigCtx1["unit_type"] = "FACE_DETECTOR";
detectorConfigCtx1["modification"] = "ssyv";
detectorConfigCtx1["version"] = 3;

auto detectorConfigCtx2 = service->createContext();
detectorConfigCtx2["unit_type"] = "FACE_DETECTOR";
detectorConfigCtx2["modification"] = "uld";
detectorConfigCtx2["precision_level"] = 2;

auto detectorConfigCtx = service->createContext();
detectorConfigCtx["unit_type"] = "FACE_DETECTOR";
detectorConfigCtx["modification"] = "cascade";
detectorConfigCtx["version"] = 3;

detectorConfigCtx["face_detectors"].push_back(detectorConfigCtx1);
detectorConfigCtx["face_detectors"].push_back(detectorConfigCtx2);

auto faceDetector = service.create_processing_block(detectorConfigCtx);

detectorConfigCtx = {
    "unit_type": "FACE_DETECTOR",
    "modification": "cascade",
    "face_detectors": [
        {
            "unit_type": "FACE_DETECTOR",
            "modification": "ssyv",
            "version": 3
        },
        {
            "unit_type": "FACE_DETECTOR",
            "modification": "uld",
            "precision_level": 2
        },
    ],
}

faceDetector = service.create_processing_block(detectorConfigCtx)

Map<String, dynamic> configCtx = {
    "unit_type": "FACE_DETECTOR",
    "modification": "cascade",
    "face_detectors": [
        {
            "unit_type": "FACE_DETECTOR",
            "modification": "ssyv",
            "version": 3
        },
        {
            "unit_type": "FACE_DETECTOR",
            "modification": "uld",
            "precision_level": 2
        },
    ],
};

ProcessingBlock faceDetector = service.createProcessingBlock(configCtx);

Dictionary<object, object> detectorConfigCtx = new()
{
    ["unit_type"] = "FACE_DETECTOR",
    ["modification"] = "cascade",
    ["face_detectors"] = new List<object>
    {
        new Dictionary<object, object>
        {
            ["unit_type"] = "FACE_DETECTOR",
            ["modification"] = "ssyv",
            ["version"] = 3
        },
        new Dictionary<object, object>
        {
            ["unit_type"] = "FACE_DETECTOR",
            ["modification"] = "uld",
            ["precision_level"] = 2
        }
    }
};

ProcessingBlock faceDetector = service.CreateProcessingBlock(detectorConfigCtx);

Context detectorConfigCtx1 = service.createContext();
detectorConfigCtx1.get("unit_type").setString("FACE_DETECTOR");
detectorConfigCtx1.get("modification").setString("ssyv");
detectorConfigCtx1.get("version").setLong(3);

Context detectorConfigCtx2 = service.createContext();
detectorConfigCtx2.get("unit_type").setString("FACE_DETECTOR");
detectorConfigCtx2.get("modification").setString("uld");
detectorConfigCtx2.get("precision_level").setLong(2);

Context detectorConfigCtx = service.createContext();
detectorConfigCtx.get("unit_type").setString("FACE_DETECTOR");
detectorConfigCtx.get("modification").setString("cascade");
detectorConfigCtx.get("face_detectors").pushBack(detectorConfigCtx1);
detectorConfigCtx.get("face_detectors").pushBack(detectorConfigCtx2);
ProcessingBlock faceDetector = service.createProcessingBlock(detectorConfigCtx);

val detectorConfigCtx1 = service.createContext()
detectorConfigCtx1["unit_type"].string = "FACE_DETECTOR"
detectorConfigCtx1["modification"].string = "ssyv"
detectorConfigCtx1["version"].long = 3

val detectorConfigCtx2 = service.createContext()
detectorConfigCtx2["unit_type"].string = "FACE_DETECTOR"
detectorConfigCtx2["modification"].string = "uld"
detectorConfigCtx2["precision_level"].long = 2

val detectorConfigCtx = service.createContext()
detectorConfigCtx["unit_type"].string = "FACE_DETECTOR"
detectorConfigCtx["modification"].string = "cascade"
detectorConfigCtx.pushBack(detectorConfigCtx1)
detectorConfigCtx.pushBack(detectorConfigCtx2)
val faceDetector = service.createProcessingBlock(detectorConfigCtx)

let detectorConfigCtx1 = new facerec.Context();
detectorConfigCtx1.get("unit_type").value = "FACE_DETECTOR";
detectorConfigCtx1.get("modification").value = "ssyv";
detectorConfigCtx1.get("version").value = 3;

let detectorConfigCtx2 = new facerec.Context();
detectorConfigCtx2.get("unit_type").value = "FACE_DETECTOR";
detectorConfigCtx2.get("modification").value = "uld";
detectorConfigCtx2.get("precision_level").value = 2;

let detectorConfigCtx = new facerec.Context();
detectorConfigCtx.get("unit_type").value = "FACE_DETECTOR";
detectorConfigCtx.get("modification").value = "cascade";
detectorConfigCtx.get("face_detectors").push(detectorConfigCtx1);
detectorConfigCtx.get("face_detectors").push(detectorConfigCtx2);
const faceDetector = new facerec.ProcessingBlock(detectorConfigCtx);

detectorConfig1 := facesdk.CreateContext()
context, err := detectorConfig1.GetOrInsertByKey("unit_type")
err = context.SetString("FACE_DETECTOR")
context, err = detectorConfig1.GetOrInsertByKey("modification")
err = context.SetString("ssyv")
context, err = detectorConfig1.GetOrInsertByKey("version")
err = context.SetInt(3)
defer detectorConfig1.Close()

detectorConfig2 := facesdk.CreateContext()
context, err = detectorConfig2.GetOrInsertByKey("unit_type")
err = context.SetString("FACE_DETECTOR")
context, err = detectorConfig2.GetOrInsertByKey("modification")
err = context.SetString("uld")
context, err = detectorConfig2.GetOrInsertByKey("precision_level")
err = context.SetInt(2)
defer detectorConfig2.Close()

detectorConfig := facesdk.CreateContext()
context, err = detectorConfig.GetOrInsertByKey("unit_type")
err = context.SetString("FACE_DETECTOR")
context, err = detectorConfig.GetOrInsertByKey("modification")
err = context.SetString("cascade")
context, err = detectorConfig.GetOrInsertByKey("face_detectors")
err = context.PushBack(detectorConfig1)
err = context.PushBack(detectorConfig2)
defer detectorConfig.Close()

faceDetector, err := service.CreateProcessingBlock(detectorConfig)
defer faceDetector.Close()

Overview​

Face detection​

Detectors​

Detector parameters​

Face detector specification​

Face landmarks​

Fitter specification​

Example of face detection and estimation of face landmarks​

Create Processing Blocks​

Run face detection​

Run fitting of face landmarks​

Detector cascade​

Module setting​

Cascade configuration example​

Overview

Face detection

Detectors

Detector parameters

Face detector specification

Face landmarks

Fitter specification

Example of face detection and estimation of face landmarks

Create Processing Blocks

Run face detection

Run fitting of face landmarks

Detector cascade

Module setting

Cascade configuration example