A Survey on Federated Learning: Fundamentals, Challenges and Client Selection Methods
Abstract
Federated learning (FL) represents a transformative shift in machine learning, moving from conventional centralized approaches to a distributed framework that emphasizes data privacy and security. The FL server transmits an initial model to clients, which they train locally on their private data. After training, the server aggregates model updates from each client to update the global model. Selecting the best clients in FL is critical to improving the convergence speed and accuracy of the final model, which requires careful client selection approaches. The client selection phase of FL faces numerous challenges that impact overall training performance, including statistical heterogeneity and system heterogeneity resulting from the diversity of client data and resources. Communication costs present another challenge, especially in networks with limited client communication resources. Additionally, selecting trustworthy clients represents another challenge, as selecting malicious clients creates a significant risk within the FL training process. Moreover, the fairness challenge entails providing fair opportunities for all clients to participate in training. To address these challenges, we offer solutions that utilize effective techniques, approaches, and client selection methods. This survey presents a taxonomy of modern client selection methods in FL, highlighting the improvements in FL performance and effectiveness achieved through these methods, including greedy selection, reinforcement learning-based selection, multi-armed bandit-based selection, clustering-based selection, and reputation & security-based selection. Subsequently, it offers a general comparison between these methods in terms of their core ideas, advantages, limitations, and use cases. Finally, future and potential trends in client selection and improving performance in FL are identified.
Identifiers
Download this PDF file
##submission.supplementaryFiles##
Statistics
How to Cite
Copyright and Licensing

This work is licensed under a Creative Commons Attribution 4.0 International License.





