DIML / CVLAB RGB+D Dataset

large-scale RGB+D Database

through comprehensive performance analysis.

Overview

We introduce an RGB-D scene dataset consisting of more than 200 indoor / outdoor scenes.

This dataset contains synchronized RGB-D frames from both Kinect v2 and Zed stereo camera. For the outdoor scene, we first generate disparity maps using an accurate stereo matching method and convert them using calibration parameters. A per-pixel confidence map of disparity is also provided. Our scenes are captured at various places, e.g., offices, rooms, dormitory, exhibition center, street, road etc., from Yonsei University and Ewha University

This dataset has been used to train convolutional neural networks in our project [1] and for our papers[2], [3], [4], [5]
"High quality 2D-to-multiview contents generation from large-scale RGB-D database", under Grant by the Institute for Information and Communications Technology Promotion(IITP) through the Korean Government(MSIP)(R0115-16-1007)

Citations

If you use this dataset in your research, please cite these publications :

DIML/CVL RGB-D technical_report : This material is presented to provide a detailed description about the DIML/CVl RGB-D dataset.
Y. Kim, B. Ham, C. Oh, and K. Sohn, "Structure selective depth super-resolution for RGB-D cameras," IEEE Trnas. on Image Processing, vol.25, no. 11, pp. 5527-38, Nov. 2016. [Paper]
S. Kim, D. Min, B. Ham, S. Kim, and K. Sohn, "Deep Stereo Confidence Prediction for Depth Estimation," IEEE International Conference on Image Processing, Sept. 2017. [Paper]
Y. Kim, H. Jung, D. Min, and K. Sohn, "Deep Monocular Depth Estimation via Integration of Global and Local Predictions," IEEE Trnas. on Image Processing, vol.27, no. 8, pp. 4131-43, Aug. 2018. [Paper]
J. Cho, D. Min, Y. Kim, and K. Sohn, "Deep Monocular Depth Estimation Leveraging a Large-scale Outdoor Stereo Dataset," Expert Systems with Applications, Mar. 2021. [Paper]

Sample Data

Do you want to preview our dataset?

Indoor Data

Kinect

depth_field : filled depth images using colorization (mm, 16bit png)
depth_raw : Raw depth images (mm, 16bit png)
depth_v : Normalized depth images for visualization purpose (8bit png)
color : Color images obtained from Kinect v2 (8bit png)

Training 1500

HR - 2.8 GB, LR - 706 MB

Testing 500

HR - 847 MB, LR - 208 MB

Outdoor Data

ZED

Left Left images obtained from ZED camera (8bit png)
Right : Right images obtained from ZED camera (8bit png)
Disparity : Disparity maps obtained by stereo matching (8bit png)
Depth : Disparity -> Depth (mm, 16bit png)
Confidence : Confidence maps for stereo matching (8bit png)

Training 1500

HR - 9.9 GB, LR - 1.8 GB

Testing 500

HR - 3.5 GB, LR - 610 MB

Contact Us

DIML

RGB+D DATASET

large-scale RGB+D Database

Overview

We introduce an RGB-D scene dataset consisting of more than 200 indoor / outdoor scenes.

Citations

If you use this dataset in your research, please cite these publications :

Sample Data

Indoor Data

Outdoor Data

Contact Us

Total Visitors

RGB+D DATASET

large-scale RGB+D Database

Overview

We introduce an RGB-D scene dataset consisting of more than 200 indoor / outdoor scenes.

Citations

If you use this dataset in your research, please cite these publications :

Sample Data

Indoor Data

Outdoor Data

Download our full Raw Dataset

Contact Us

Total Visitors