Welcome to OpenSonarDatasets, a repository dedicated to consolidate open-source sonar datasets for underwater research and development. We encourage researchers in the field to expand the current collection, aiming to increase the visibility of open-source sonar datasets and provide an easier way to find and compare datasets.
The lack of consolidated and accessible sonar datasets makes research in underwater robotics challenging. This repository aims to bridge that gap by offering an organized collection of open-source sonar datasets with their link repository to help researchers interested in open-source sonar datasets start their projects efficiently.
This dataset comparison originates from the journal paper:
M. Aubard, A. Madureira, L. Teixeira and J. Pinto,
Sonar-Based Deep Learning in Underwater Robotics: Overview, Robustness, and Challenges,
IEEE Journal of Oceanic Engineering, 2025.
https://doi.org/10.1109/JOE.2025.3531933
If this repository or the associated paper helps your research or project, please consider citing our paper.
While this repository will continue to evolve with new datasets contributed by the community, these additions are not part of the original journal comparison, as the paper content is fixed upon publication.
The following table compares the state-of-the-art sonar underwater datasets by analyzing the type of sonar (Sonar), type of data (Data), number of data samples (No Data), objects labeled in the data (Object Labels), if the data is annotated, DL tasks (Annotation), if the data collection set up such as sonar frequency, altitude, etc. are described in the dataset or not (Set-up), and finally the year of the dataset publication (Year). The * symbol indicates that the dataset is not only limited to sonar but extended to other sensors such as optical cameras.
In addition, for readability, the sonar acronyms are SSS (Side-Scan Sonar), FLS (Forward-Looking Sonar), MSIS (Mechanical Scanning Imaging Sonar), and MBES (Multi-Beam Echo Sounder) represent the different types of sonar used in these datasets.
Dataset | Sonar | Data | No Data | Object labels | Annotation | Set-up | Year | Paper |
---|---|---|---|---|---|---|---|---|
Northern Adriatic Reefs | SSS | GeoTIFF | 7 | Reefs | β | β | 2010 | β |
Lago Grey | SSS | Raw | β | Glacier, Walls | β | β | 2019 | Paper |
UCI ML | β | Raw | 211 | Mines, Rocks | Classification | β | β | β |
SeabedObjects-KLSG | SSS | Images | 1190 | Wrecks, Humans, Mines | Classification | β | 2020 | Paper |
Marine_PULSE | SSS | Images | 627 | Pipes, Mounds, Platforms | Classification | β | 2023 | Paper |
NKSID | FLS | Images | 2617 | Infrastructures, Propellers, Tires | Classification | β | 2024 | Paper |
UATD | FLS | Images | 9200 | Tires, Mannequins, Boxes | Object Detection | β | 2022 | Paper |
SSS for Mine Detection | SSS | Images | 1170 | Mines | Object Detection | β | 2024 | Paper |
SWDD | SSS | Images | 7904 | Walls | Object Detection | β | 2024 | Paper |
SubPipe | SSS * | Images | 10030 | Pipelines | Object Detection | β | 2024 | Paper |
UXO | FLS | Images/Raw | 74437 | Unexploded Ordnances | Object Detection | β | 2024 | Paper |
MDT | FLS | Images | 2471 | Infrastructures, Debris | Segmentation | β | 2021 | Paper |
NNSSS | SSS | Images | 10 | Sea Grass, Rocks, Sands | Segmentation | β | 2020 | Paper |
SASSED | SAS | Images | 129 | Muds, Sea Grass, Rocks, Sands | Segmentation | β | 2023 | β |
Seafloor Sediments | SSS | Images | 434164 | Rocks, Marine life | Segmentation | β | 2023 | Paper |
DIDSON | FLS | Images | 1000 | Fishes Species | Segmentation | β | 2022 | Paper |
AI4Shipwreck | SSS | Images | 286 | Shipwrecks | Segmentation | β | 2024 | Paper |
Cave Sonar | MSIS * | Rosbag | 500 meters | Cave Seabed | SLAM | β | 2017 | Paper |
Aurora | MBES, SSS * | Raw | MBES: 81km, SSS: 15h | Seabed, Marine habitats | SLAM | β | 2020 | Paper |
MBES-Slam | MBES | Rosbag | 4 missions | Seabed | SLAM | β | 2022 | Paper |
We welcome contributions from the community! If you have an open-source sonar dataset that you would like to add to this repository, please create a pull request with the line description for the dataset table and the dataset link (and, if any, the published paper). This repository does not store the datasets but is a central directory to find links to most available datasets. By contributing, you help create a central location for researchers to easily access and compare sonar datasets, ultimately benefiting the field of underwater robotics.
This project has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie SkΕodowska-Curie grant agreement No. 956200.
This work is part of the Reliable AI for Marine Robotics (REMARO) Project. For more info, please visit: https://remaro.eu/