Point-Voxel CNN for efficient 3D deep learning: Depth + IR

TODO