Databases of Facial Age Estimation Benchmark

arXiv: A Call to Reflect on Evaluation Practices for Age Estimation: Comparative Analysis of the State-of-the-Art and a Unified Benchmark

This document explains how to replicate the data splits used in the paper: A Call to Reflect on Evaluation Practices for Age Estimation: Comparative Analysis of the State-of-the-Art and a Unified Benchmark.

This repository is a submodule of "Facial-Age-Estimation-Benchmark".

Important: Due to limitations of git LFS, we recommend downloading the files from https://paplham.cloud/index.php/s/KSkaSPF7Btay2cb

Prerequisites

In order to replicate our data splits, we need to understand the structure of two file types, databases and benchmarks.

A database file define a unified representation of a dataset. A benchmark file defines how to use the samples from (possibly multiple) databases. I.e., benchmark defines which data are used for training, which are used for validation and for testing.

The database file is a list of entries, where each entry corresponds to one data sample. For example, take a look at an entry in a database of the AFAD dataset:

{
        "img_path": "AFAD/AFAD-Full/15/111/638660-1.jpg",
        "id_num": 638660,
        "age": 15,
        "gender": "M",
        "database": "AFAD-Full",
        "folder": 0,
        "aligned_bbox": [
            -57,
            -95,
            372,
            -83,
            360,
            346,
            -69,
            334
        ],
        "alignment_source": "in the wild"
    }

We can see that the entry defines multiple attributes. For us, the important attributes are:

img_path: Path to the original, raw, image from the dataset.
age: The ground-truth age of the sample, i.e., the label y.
folder: Assignment of the sample to an imaginary folder. We use these folders to define training, validation and test parts. E.g., folders 0, 1 and 2 might correspond to the training part, which folder 3 corresponds to the validation part. These assignments are given to the sample during the creation of the database file. For each unique person, we always assign all of their samples to a single folder (subject-exclusive splitting) to avoid information leakage between training, validation and testing parts.
aligned_bbox: For tasks such as age estimation, we do not want to use the entirety of the image as input to our model. Instead, we want to extract a part of the image corresponding to the face of the person. To this end, we locate the face and normalize its position by an aligning bounding box. This bounding box (or possibly, its extension) should be cropped out to be used as the data sample x.

Data Splits

To separate data into training, validation and testing parts, we use the folder attribute of the entry in a database file.

For datasets that define a single training, validation and test split (CLAP2016 and CACD2000), we use the following data splits by folder.

  - trn: [0]
    val: [1]
    tst: [2]

For the remaining datasets, we define a total of 10 folders (0-9). See scripts in facebase/to_json/ for more information.

We then define a total of 5 subject exlusive splits as follows:

Split 0

  - trn: [0,1,2,3,4,5]
    val: [6,7]
    tst: [8,9]

Split 1

  - trn: [2,3,4,5,6,7]
    val: [8,9]
    tst: [0,1]

Split 2

  - trn: [4,5,6,7,8,9]
    val: [0,1]
    tst: [2,3]

Split 3

  - trn: [5,6,7,8,9,0]
    val: [1,2]
    tst: [3,4]

Split 4

  - trn: [6,7,8,9,0,1]
    val: [2,3]
    tst: [4,5]

JSON origin

file	origin
AFAD-Full.json	created by `facebase/to_json/afad_to_json.py`
AgeDB.json	created by `facebase/to_json/agedb_to_json`
CACD2000.json	created by `facebase/to_json/cacd_to_json.py`
CLAP2016.json	created by `facebase/to_json/clap2016_to_json.py`
FG-Net.json	created by `facebase/to_json/fgnet_to_json.py`
IMDB-CLEAN-FP-AGE.json	created by `facebase/to_json/imdb_fp_age_to_json.py`
IMDB-EM-CNN.json	created by `facebase/to_json/imdb_to_json.py`
MORPH.json	created by `facebase/to_json/morph_to_json.py`
UTKFace.json	created by `facebase/to_json/utkface_to_json.py`
AFAD-Full_aligned.json	created by `prepare_alignment alignment_config.yaml facebase/benchmarks/databases/AFAD-Full.json`
AgeDB_aligned.json	created by `prepare_alignment alignment_config.yaml facebase/benchmarks/databases/AgeDB.json`
CACD2000_aligned.json	created by `prepare_alignment alignment_config.yaml facebase/benchmarks/databases/CACD2000.json`
CLAP2016_aligned.json	created by `prepare_alignment alignment_config.yaml facebase/benchmarks/databases/CLAP2016.json`
FG-Net_aligned.json	created by `prepare_alignment alignment_config.yaml facebase/benchmarks/databases/FG-Net.json`
IMDB-CLEAN-FP-AGE_aligned.json	created by `prepare_alignment alignment_config.yaml facebase/benchmarks/databases/IMDB-CLEAN-FP-AGE.json`
IMDB-EM-CNN_aligned.json	created by `prepare_alignment alignment_config.yaml facebase/benchmarks/databases/IMDB-EM-CNN.json`
MORPH_aligned.json	created by `prepare_alignment alignment_config.yaml facebase/benchmarks/databases/MORPH.json`
UTKFace_aligned.json	created by `prepare_alignment alignment_config.yaml facebase/benchmarks/databases/UTKFace.json`

Conventions used in the database JSON files

Age

The attribute age denotes the number of completed years (revolutions of the subject around the sun). I.e. the values are 0, 1, 2, ..., max_age. A special value -1 means unknown age.

Gender

The attribute gender denotes the biological gender as defined by the source datasets: 'M' stand for male; 'F' stands for female; 'U' means unknown gender.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Databases of Facial Age Estimation Benchmark

Prerequisites

Data Splits

Split 0

Split 1

Split 2

Split 3

Split 4

JSON origin

Conventions used in the database JSON files

Age

Gender

About

Releases

Packages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitattributes		.gitattributes
AFAD-Full.json		AFAD-Full.json
AFAD-Full_aligned.json		AFAD-Full_aligned.json
AgeDB.json		AgeDB.json
AgeDB_aligned.json		AgeDB_aligned.json
CACD2000.json		CACD2000.json
CACD2000_aligned.json		CACD2000_aligned.json
CLAP2016.json		CLAP2016.json
CLAP2016_aligned.json		CLAP2016_aligned.json
FG-Net.json		FG-Net.json
FG-Net_aligned.json		FG-Net_aligned.json
IMDB-CLEAN-FP-AGE.json		IMDB-CLEAN-FP-AGE.json
IMDB-CLEAN-FP-AGE_aligned.json		IMDB-CLEAN-FP-AGE_aligned.json
IMDB-EM-CNN.json		IMDB-EM-CNN.json
IMDB-EM-CNN_aligned.json		IMDB-EM-CNN_aligned.json
MORPH.json		MORPH.json
MORPH_aligned.json		MORPH_aligned.json
README.md		README.md
UTKFace.json		UTKFace.json
UTKFace_aligned.json		UTKFace_aligned.json

paplhjak/Facial-Age-Estimation-Benchmark-Databases

Folders and files

Latest commit

History

Repository files navigation

Databases of Facial Age Estimation Benchmark

Prerequisites

Data Splits

Split 0

Split 1

Split 2

Split 3

Split 4

JSON origin

Conventions used in the database JSON files

Age

Gender

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages