BirdNet+: two-stage 3D object detection in LiDAR through a sparsity-invariant bird's eye view

authors

BARRERA DEL POZO, ALEJANDRO
BELTRAN DE LA CITA, JORGE
GUINDEL GOMEZ, CARLOS
IGLESIAS MARTINEZ, JOSE ANTONIO
GARCIA FERNANDEZ, FERNANDO

published in

IEEE Access Journal

publication date

November 2021

start page

160299

end page

160316

volume

9

Digital Object Identifier (DOI)

https://doi.org/10.1109/access.2021.3131389

full text

http://hdl.handle.net/10016/36461

Electronic International Standard Serial Number (EISSN)

2169-3536

abstract

Autonomous navigation relies upon an accurate understanding of the elements in the surroundings. Among the different on-board perception tasks, 3D object detection allows the identification of dynamic objects that cannot be registered by maps, being key for safe navigation. Thus, it often requires the use of LiDAR data, which is able to faithfully represent the scene geometry. However, although raw laser point clouds contain rich features to perform object detection, more compact representations such as the bird's eye view (BEV) projection are usually preferred in order to meet the time requirements of the control loop. This paper presents an end-to-end object detection network based on the well-known Faster R-CNN architecture that uses BEV images as input to produce the final 3D boxes. Our regression branches can infer not only the axis-aligned bounding boxes but also the rotation angle, height, and elevation of the objects in the scene. The proposed network provides state-of-the-art results for car, pedestrian, and cyclist detection with a single forward pass when evaluated on the KITTI 3D Object Detection Benchmark, with an accuracy that exceeds 64% mAP 3D for the Moderate difficulty. Further experiments on the challenging nuScenes dataset show the generalizability of both the method and the proposed BEV representation against different LiDAR devices and across a wider set of object categories by being able to reach more than 30% mAP with a single LiDAR sweep and almost 40% mAP with the usual 10-sweep accumulation.

BirdNet+: two-stage 3D object detection in LiDAR through a sparsity-invariant bird's eye view Articles

Overview

authors

published in

publication date

start page

end page

volume

Digital Object Identifier (DOI)

full text

Electronic International Standard Serial Number (EISSN)

abstract

Classification

subjects

keywords