Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

4-2022

Abstract

Federated learning is a new learning paradigm that jointly trains a model from multiple data sources without sharing raw data. For the practical deployment of federated learning, data source selection is compulsory due to the limited communication cost and budget in real-world applications. The necessity of data source selection is further amplified in presence of data heterogeneity among clients. Prior solutions are either low in efficiency with exponential time cost or lack theoretical guarantees. Inspired by the diminishing marginal accuracy phenomenon in federated learning, we study the problem from the perspective of submodular optimization. In this paper, we aim at efficient data source selection with theoretical guarantees. We prove that data source selection in federated learning is a monotone submodular maximization problem and propose FDSS, an efficient algorithm with a constant approximate ratio. Furthermore, we extend FDSS to FDSS-d for dynamic data source selection. Extensive experiments on CIFAR10 and CIFAR100 validate the efficiency and effectiveness of our algorithms.

Keywords

Federated learning, Data source selection, Submodularity

Discipline

Databases and Information Systems | Software Engineering

Research Areas

Software and Cyber-Physical Systems

Publication

Database Systems for Advanced Applications: 27th International Conference, DASFAA 2022, Virtual Conference, April 11-14: Proceedings

Volume

13246

First Page

606

Last Page

614

ISBN

9783031001253

Identifier

10.1007/978-3-031-00126-0_43

Publisher

Springer

City or Country

Cham

Citation

ZHANG, Ruisheng; WANG, Yansheng; ZHOU, Zimu; REN, Ziyao; TONG, Yongxin; and XU, Ke. Data source selection in federated learning: A submodular optimization approach. (2022). Database Systems for Advanced Applications: 27th International Conference, DASFAA 2022, Virtual Conference, April 11-14: Proceedings. 13246, 606-614.
Available at: https://ink.library.smu.edu.sg/sis_research/7219

Copyright Owner and License

Authors

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1007/978-3-031-00126-0_43

Download

Find it in your library

Included in

Databases and Information Systems Commons, Software Engineering Commons

COinS

Research Collection School Of Computing and Information Systems

Data source selection in federated learning: A submodular optimization approach

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Data source selection in federated learning: A submodular optimization approach

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links