Measuring Generalization with Optimal Transport

Ching-Yao Chuang; Youssef Mroueh; Kristjan Greenewald; Antonio Torralba; Stefanie Jegelka

Publication

NeurIPS 2021

Conference paper

Measuring Generalization with Optimal Transport

NeurIPS 2021

Download paper

Abstract

Understanding the generalization of deep neural networks is one of the most important tasks in deep learning. Although much progress has been made, theoretical error bounds still often behave disparately from empirical observations. In this work, we develop margin-based generalization bounds, where the margins are normalized with optimal transport costs between independent random subsets sampled from the training distribution. In particular, the optimal transport cost can be interpreted as a generalization of variance which captures the structural properties of the learned feature space. Our bounds robustly predict the generalization error, given training data and network parameters, on large scale datasets. Theoretically, we demonstrate that the concentration and separation of features play crucial roles in generalization, supporting empirical results in the literature.

Date

06 Dec 2021

Publication

NeurIPS 2021

Authors

IBM-affiliated at time of publication

Topics

Machine Learning

Resources

Publication

Abstract

Date

Publication

Authors

Topics

Resources

Share