Research Collection School Of Computing and Information Systems

How important is the train-validation split in meta-learning?

Yu BAI
Minshuo CHEN
Pan ZHOU, Singapore Management UniversityFollow
Tuo ZHAO
D. Jason LEE
Sham KAKADE
Huan WANG
Caiming XIONG

Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

7-2021

Abstract

Meta-learning aims to perform fast adaptation on a new task through learning a “prior” from multiple existing tasks. A common practice in meta-learning is to perform a train-validation split (train-val method) where the prior adapts to the task on one split of the data, and the resulting predictor is evaluated on another split. Despite its prevalence, the importance of the train-validation split is not well understood either in theory or in practice, particularly in comparison to the more direct train-train method, which uses all the pertask data for both training and evaluation. We provide a detailed theoretical study on whether and when the train-validation split is helpful in the linear centroid meta-learning problem. In the agnostic case, we show that the expected loss of the train-val method is minimized at the optimal prior for meta testing, and this is not the case for the train-train method in general without structural assumptions on the data. In contrast, in the realizable case where the data are generated from linear models, we show that both the train-val and train-train losses are minimized at the optimal prior in expectation. Further, perhaps surprisingly, our main result shows that the train-train method achieves a strictly better excess loss in this realizable case, even when the regularization parameter and split ratio are optimally tuned for both methods. Our results highlight that sample splitting may not always be preferable, especially when the data is realizable by the model. We validate our theories by experimentally showing that the train-train method can indeed outperform the train-val method, on both simulations and real meta-learning tasks.

Discipline

Artificial Intelligence and Robotics | Graphics and Human Computer Interfaces

Research Areas

Intelligent Systems and Optimization

Areas of Excellence

Digital transformation

Publication

Proceedings of the 38th International Conference on Machine Learning, Virtual Conference, 2021 July 18-24

First Page

Last Page

Publisher

https://proceedings.mlr.press/v139/bai21a.html

City or Country

Virtual Conference

Citation

BAI, Yu; CHEN, Minshuo; ZHOU, Pan; ZHAO, Tuo; LEE, D. Jason; KAKADE, Sham; WANG, Huan; and XIONG, Caiming. How important is the train-validation split in meta-learning?. (2021). Proceedings of the 38th International Conference on Machine Learning, Virtual Conference, 2021 July 18-24. 1-11.
Available at: https://ink.library.smu.edu.sg/sis_research/8991

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Download

Included in

Artificial Intelligence and Robotics Commons, Graphics and Human Computer Interfaces Commons

COinS

Research Collection School Of Computing and Information Systems

How important is the train-validation split in meta-learning?

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Areas of Excellence

Publication

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

How important is the train-validation split in meta-learning?

Author

Publication Type

Version

Publication Date

Abstract

Discipline

Research Areas

Areas of Excellence

Publication

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Included in

Share

Search

Links

Browse

Links