Mathematical Model

The FaceLog recognition pipeline relies on mathematical transformations that convert images into numerical representations.

These representations allow the system to compare faces using distance metrics.

Image Representation

An image is represented as a three-dimensional matrix:

X \in \mathbb{R}^{m \times n \times 3}

Where:

Face detection relies on convolution operations applied across the image.

S(i, j) = \sum_{m=0}^{k} \sum_{n=0}^{k} X(i + m, j + n) \cdot W(m, n)

Where:

This operation highlights visual patterns that indicate the presence of a face.

After detecting a face, the system generates an embedding vector using a neural network.

e = f_\theta(X)

Where:

Embeddings encode facial characteristics into a numerical representation.

To determine whether two embeddings represent the same person, the system computes the Euclidean distance:

d(x,y) = \sqrt{\sum_{i=1}^{n}(x_i - y_i)^2}

If the distance is sufficiently small, the embeddings are considered to represent the same individual.

note

Image Processing and Facial Recognition: - DeepFace v0.0.85: Library for facial analysis and recognition.