Yamaha-CMU Off-Road Dataset
We have collected Yamaha-CMU-Off-Road, or YCOR, which consists of 1076 images collected in four different locations in Western Pennsylvania and Ohio (as shown in the figure), spanning three different seasons.
The dataset was labelled using a polygon-based interface with eight classes: sky, rough trail, smooth trail, traversable grass, high vegetation, non-traversable low vegetation, obstacle. The polygon labels were post-processed using a Dense CRF to densify the labels; the output of the CRF was manually inspected, and in some cases corrected, to ensure no wrong labels were created.
We believe our dataset is more diverse and challenging than DeepScene. In the following figure, we show the mean RGB image and pixelwise labelmode of each dataset. The DeepScene dataset shows a left-right bias and more predictable structure than ours; if we used the pixelwise mode as a baseline classifier, we would obtain 0.30 pixelwise error-rate in DeepScene, but 0.51 in ours. However, we acknowledge that compared to recent efforts, both datasets are relatively small.

Our current split has 931 training images, and 145 validation images. This split was generated randomly, ensuring there was no overlap in data collection session between images in the training and validation split. However, there is overlap in locations used.

Citation
Please read our paper for details.
@inproceedings{maturana2018real,
title={Real-time semantic mapping for autonomous off-road navigation},
author={Maturana, Daniel and Chou, Po-Wei and Uenoyama, Masashi and Scherer, Sebastian},
booktitle={Field and Service Robotics},
pages={335--350},
year={2018},
organization={Springer}
}
Download
The dataset can be downloaded here.
Contact
Sebastian Scherer - (basti [at] cmu [dot] edu)
Wenshan Wang - (wenshanw [at] andrew [dot] cmu [dot] edu)
Term of use
This work is licensed under a Creative Commons Attribution 4.0 International License.