Conference paperLook at What I’m Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
Conference paperRT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices