FROG: A new people detection dataset for knee-high 2D range finders

Fernando Amodeo, Noé Perez-Higueras, Luis Merino, Fernando Caballero
Service Robotics Lab, Universidad Pablo de Olavide, Seville, Spain

Resources

You can read the preprint on arXiv.

FROG Benchmark on GitHub

Cite us as:

@misc{frog2023,
	author = {Fernando Amodeo and Noé Pérez-Higueras and Luis Merino and Fernando Caballero},
	title = {FROG: A new people detection dataset for knee-high 2D range finders},
	year = {2023},
	eprint = {arXiv:2306.08531},
}

Abstract

Mobile robots require knowledge of the environment, especially of humans located in its vicinity. While the most common approaches for detecting humans involve computer vision, an often overlooked hardware feature of robots for people detection are their 2D range finders. These were originally intended for obstacle avoidance and mapping/SLAM tasks. In most robots, they are conveniently located at a height approximately between the ankle and the knee, so they can be used for detecting people too, and with a larger field of view and depth resolution compared to cameras.

In this paper, we present a new dataset for people detection using knee-high 2D range finders called FROG. This dataset has greater laser resolution, scanning frequency, and more complete annotation data compared to existing datasets such as DROW. Particularly, the FROG dataset contains annotations for 100% of its laser scans (unlike DROW which only annotates 5%), 17x more annotated scans, 100x more people annotations, and over twice the distance traveled by the robot. We propose a benchmark based on the FROG dataset, and analyze a collection of state-of-the-art people detectors based on 2D range finder data.

We also propose and evaluate a new end-to-end deep learning approach for people detection. Our solution works with the raw sensor data directly (not needing hand-crafted input data features), thus avoiding CPU preprocessing and releasing the developer of understanding specific domain heuristics. Experimental results show how the proposed people detector attains results comparable to the state of the art, while an optimized implementation for ROS can operate at more than 500 Hz.

Downloads

The data is provided in HDF5 format, containing several arrays:

scans: Laser scan data (N, 720).
timestamps: Timestamps associated with each laser scan (N).
circles: Person annotations (M, 6): x, y, radius, distance, angle, angular radius.
circle_idx and circle_num: Mapping between each scan and its corresponding slice of the circles array (N).
split: (Only present in training/validation set) Split associated to each scan (M; 0=training, 1=testing).

A full detailed description of the dataset format can be found in our paper. We also make available raw unaligned odometry data from the robot for each recorded sequence. Finally, the original raw ROS bag files from which this dataset was created are available. They contain data for other sensors present in the robot platform and which are not considered in this work, such as the cameras and back/tilted lasers.

Training/validation set (11:36 and 12:43) HDF5 Odom [1/2] Odom [2/2] Testing set (16:41) HDF5 Odom 10:31 HDF5 Odom 14:57 HDF5 Odom 15:53 HDF5 Odom

Leaderboards

The current best model for FROG is: DR-SPAAM (T = 5)

All model weights are available for download. If you have a new people detector model, please send us an email with a link to your paper and a GitHub repository so that we can include it here!

	ad = 0.5m			ad = 0.3m
	AP	Peak F₁	EER	AP	Peak F₁	EER
PeTra	47.9	66.6	66.4	47.5	66.1	65.9
PeTra*	59.1	67.9	67.3	58.7	67.4	66.8
LFE-Peaks (ours)	62.4	70.7	70.7	61.4	69.0	69.0
LFE-PPN (ours)	65.6	68.4	68.2	59.2	65.8	65.4
DROW3 (T = 1)	72.3	72.0	71.8	71.8	71.6	71.4
DR-SPAAM (T = 1)	72.2	72.1	71.8	71.6	71.6	71.3
DR-SPAAM (T = 5)	74.2	73.6	73.4	73.7	73.2	73.0

Acknowledgements

This work is partially funded by the Programa Operativo FEDER Andalucía 2014-2020, Consejería de Economía, Conocimiento y Universidades (DeepBot, PY20_00817) and by the projects NHoA PLEC2021-007868 and NORDIC TED2021-132476B-I00, funded by MCIN/AEI/10.13039/501100011033 and the European Union “NextGenerationEU”/“PRTR”.