Yamaha-CMU Off-Road Dataset

Yamaha-CMU Off-Road Dataset

Published: by

Yamaha-CMU Off-Road Dataset

We have collected Yamaha-CMU-Off-Road, or YCOR, which consists of 1076 images collected in four different locations in Western Pennsylvania and Ohio (as shown in the figure), spanning three different seasons.

The dataset was labelled using a polygon-based interface with eight classes: sky, rough trail, smooth trail, traversable grass, high vegetation, non-traversable low vegetation, obstacle. The polygon labels were post-processed using a Dense CRF to densify the labels; the output of the CRF was manually inspected, and in some cases corrected, to ensure no wrong labels were created.

We believe our dataset is more diverse and challenging than DeepScene. In the following figure, we show the mean RGB image and pixelwise labelmode of each dataset. The DeepScene dataset shows a left-right bias and more predictable structure than ours; if we used the pixelwise mode as a baseline classifier, we would obtain 0.30 pixelwise error-rate in DeepScene, but 0.51 in ours. However, we acknowledge that compared to recent efforts, both datasets are relatively small.

Data and segmentation labels
First two columns: A comparison of dataset statistics. We show the mean RGB frame and the pixelwise mode for the labelled frames in the training sets of each dataset used. Last column: a map with locations where YCOR was collected.

Our current split has 931 training images, and 145 validation images. This split was generated randomly, ensuring there was no overlap in data collection session between images in the training and validation split. However, there is overlap in locations used.

Data and segmentation labels
A glance of the dataset.

Citation

Please read our paper for details.

@inproceedings{maturana2018real,
  title={Real-time semantic mapping for autonomous off-road navigation},
  author={Maturana, Daniel and Chou, Po-Wei and Uenoyama, Masashi and Scherer, Sebastian},
  booktitle={Field and Service Robotics},
  pages={335--350},
  year={2018},
  organization={Springer}
}

Download

The dataset can be downloaded here.

Contact

Sebastian Scherer - (basti [at] cmu [dot] edu)

Wenshan Wang - (wenshanw [at] andrew [dot] cmu [dot] edu)

Term of use

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Latest Research

SubT-MRS: Pushing SLAM Towards All-weather Environments
SubT-MRS: Pushing SLAM Towards All-weather Environments

Simultaneous localization and mapping (SLAM) is a fundamental task for numerous applications such...

TartanDrive 2.0: More Modalities and Better Infrastructure to Further Self-Supervised Learning Research in Off-Road Driving Tasks
TartanDrive 2.0: More Modalities and Better Infrastructure to Further Self-Supervised Learning Research in Off-Road Driving Tasks

TartanAviation: Image, Speech, and Trajectory Datasets for Terminal Airspace Operations
TartanAviation: Image, Speech, and Trajectory Datasets for Terminal Airspace Operations

We introduce TartanAviation, an open-source multi-modal dataset focused on terminal-area airspace...