federated learning split learning split federated learning artificial intelligent internet of things edge computing
In the context of smart cities, the integration of artificial intelligence (AI) and the Internet of Things (IoT) has led to the proliferation of AIoT systems, which handle vast amounts of data to enhance urban infrastructure and services. However, the collaborative training of deep learning models within these systems encounters significant challenges, chiefly due to data privacy concerns and dealing with communication latency from large-scale IoT devices. To address these issues, multi-level split federated learning (multi-level SFL) has been proposed, merging the benefits of split learning (SL) and federated learning (FL). This framework introduces a novel multi-level aggregation architecture that reduces communication delays, enhances scalability, and addresses system and statistical heterogeneity inherent in large AIoT systems with non-IID data distributions. The architecture leverages the Message Queuing Telemetry Transport (MQTT) protocol to cluster IoT devices geographically and employs edge and fog computing layers for initial model parameter aggregation. Simulation experiments validate that the multi-level SFL outperforms traditional SFL by improving model accuracy and convergence speed in large-scale, non-IID environments. This paper delineates the proposed architecture, its workflow, and its advantages in enhancing the robustness and scalability of AIoT systems in smart cities while preserving data privacy.
Details
Title
Multi-Level Split Federated Learning for Large-Scale AIoT System Based on Smart Cities
Authors
Hanyue Xu - Xi’an Jiaotong-Liverpool University
Kah Phooi Seng (Corresponding Author) - Xi’an Jiaotong-Liverpool University
Jeremy Smith - University of Liverpool
Li Minn Ang - University of the Sunshine Coast, Queensland, School of Science, Technology and Engineering
The Fashion MNIST dataset can be found in Kaggle: https://www. kaggle.com/datasets/zalando-research/fashionmnist (accessed on 15 January 2024). The HAM10000 dataset comes from the paper: “The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions”, DOI: https://doi.org/10.1038/sdata.2018.161.