A journal of IEEE and CAA , publishes high-quality papers in English on original theoretical/experimental research and development in all areas of automation
Volume 13 Issue 6
Jun.  2026

IEEE/CAA Journal of Automatica Sinica

  • JCR Impact Factor: 18.3, Top 1 (SCI Q1)
    CiteScore: 28.2, Top 1% (Q1)
    Google Scholar h5-index: 95, TOP 5
Turn off MathJax
Article Contents
Z. Yang, X. Dai, J. Cheng, Y. Huang, and P. Shi, “Quality or quantity? Error-informed selective online learning with gaussian processes in multi-agent systems,” IEEE/CAA J. Autom. Sinica, vol. 13, no. 6, pp. 1325–1338, Jun. 2026. doi: 10.1109/JAS.2025.125993
Citation: Z. Yang, X. Dai, J. Cheng, Y. Huang, and P. Shi, “Quality or quantity? Error-informed selective online learning with gaussian processes in multi-agent systems,” IEEE/CAA J. Autom. Sinica, vol. 13, no. 6, pp. 1325–1338, Jun. 2026. doi: 10.1109/JAS.2025.125993

Quality or Quantity? Error-Informed Selective Online Learning With Gaussian Processes in Multi-Agent Systems

doi: 10.1109/JAS.2025.125993
Funds:  This work was supported by the Federal Ministry of Research, Technology, and Space of Germany in the Programme of “Souverän Digital Vernetzt” Under Joint Project 6G-life With Project (16KIS2414) and the National Natural Science Foundation of China (U24B20184, 62373118)
More Information
  • Effective cooperation is pivotal in distributed learning for multi-agent systems, where the interplay between the quantity and quality of the machine learning models is crucial. This paper reveals the irrationality of indiscriminate inclusion of all models on agents for joint prediction, highlighting the imperative to prioritize quality over quantity in cooperative learning. Specifically, we present the first selective online learning framework for distributed Gaussian process (GP) regression, namely distributed error-informed GP (EIGP), that enables each agent to assess its neighboring collaborators, using the proposed selection function to choose the higher quality GP models with less prediction errors. Moreover, algorithmic enhancements are embedded within the EIGP, including a greedy algorithm (gEIGP) for accelerating prediction and an adaptive algorithm (aEIGP) for improving prediction accuracy. In addition, approaches for fast prediction and model update are introduced in conjunction with the error-informed quantification term iteration and a data deletion strategy to achieve real-time learning operations. Numerical simulations are performed to demonstrate the effectiveness of the developed methodology, showcasing its superiority over the state-of-the-art distributed GP methods with different benchmarks.

     

  • loading
  • 1 The generalization of aEIGP and its variations are presented in Appendix A.

    2 Codes and datasets are all available at https://github.com/Zewen-Yang/EIGP.

    3 For additional results, refer to the extended version [36].

  • [1]
    Y. Yang, H. Modares, K. G. Vamvoudakis, and F. L. Lewis, “Cooperative finitely excited learning for dynamical games,” IEEE Trans. Cybern., vol. 54, no. 2, pp. 797–810, Feb. 2024. doi: 10.1109/TCYB.2023.3274908
    [2]
    Q. Wang, S. Jin, and Z. Hou, “Event-triggered cooperative model-free adaptive iterative learning control for multiple subway trains with actuator faults,” IEEE Trans. Cybern., vol. 53, no. 9, pp. 6041–6052, Sep. 2023. doi: 10.1109/TCYB.2023.3246096
    [3]
    O. Friha, M. A. Ferrag, L. Shu, L. Maglaras, and X. Wang, “Internet of things for the future of smart agriculture: A comprehensive survey of emerging technologies,” IEEE/CAA J. Autom. Sinica, vol. 8, no. 4, pp. 718–752, Apr. 2021. doi: 10.1109/JAS.2021.1003925
    [4]
    Z. Yan, Z. Yang, X. Pan, J. Zhou, and D. Wu, “Virtual leader based path tracking control for multi-UUV considering sampled-data delays and packet losses,” Ocean Eng., vol. 216, Art. no. 108065, Nov. 2020. doi: 10.1016/j.oceaneng.2020.108065
    [5]
    M. Li, Y. Long, T. Li, H. Liang, and C. L. P. Chen, “Dynamic event-triggered consensus control for input constrained multi-agent systems with a designable minimum inter-event time,” IEEE/CAA J. Autom. Sinica, vol. 11, no. 3, pp. 649–660, Mar. 2024. doi: 10.1109/JAS.2023.123582
    [6]
    C. Ma and D. Dong, “Finite-time prescribed performance time-varying formation control for second-order multi-agent systems with non-strict feedback based on a neural network observer,” IEEE/CAA J. Autom. Sinica, vol. 11, no. 4, pp. 1039–1050, Apr. 2024. doi: 10.1109/JAS.2023.123615
    [7]
    F. Zhang, Q. Yang, and D. An, “Privacy preserving demand side management method via multi-agent reinforcement learning,” IEEE/CAA J. Autom. Sinica, vol. 10, no. 10, pp. 1984–1999, Oct. 2023. doi: 10.1109/JAS.2023.123321
    [8]
    M. Wang, H. Shi, and C. Wang, “Distributed cooperative learning for discrete-time strict-feedback multi agent systems over directed graphs,” IEEE/CAA J. Autom. Sinica, vol. 9, no. 10, pp. 1831–1844, Oct. 2022. doi: 10.1109/JAS.2022.105542
    [9]
    G. Wang, J. Wu, R. He, and B. Tian, “Speed and accuracy tradeoff for LiDAR data based road boundary detection,” IEEE/CAA J. Autom. Sinica, vol. 8, no. 6, pp. 1210–1220, Jun. 2021. doi: 10.1109/JAS.2020.1003414
    [10]
    T. Liu, J. Lu, Z. Yan, and G. Zhang, “Robust Gaussian process regression with input uncertainty: A PAC-Bayes perspective,” IEEE Trans. Cybern., vol. 54, no. 2, pp. 962–973, Feb. 2024. doi: 10.1109/TCYB.2022.3191022
    [11]
    C. Zhu, H. Yang, X. Jin, K. Xu, and H. Li, “Multilayer online sequential reduced kernel extreme learning machine-based modeling for time-varying distributed parameter systems,” IEEE Trans. Cybern., vol. 54, no. 1, pp. 624–634, Jan. 2024. doi: 10.1109/TCYB.2023.3293196
    [12]
    C. E. Rasmussen and C. K. I. Williams, Gaussian Processes for Machine Learning. Cambridge, USA: MIT Press, 2006.
    [13]
    H. Liu, Y.-S. Ong, X. Shen, and J. Cai, “When Gaussian process meets big data: A review of scalable GPs,” IEEE Trans. Neural Netw. Learn. Syst., vol. 31, no. 11, pp. 4405–4423, Nov. 2020. doi: 10.1109/TNNLS.2019.2957109
    [14]
    V. Tresp, “Mixtures of Gaussian processes,” in Proc. 14th Int. Conf. Neural Information Processing Systems, Denver, USA, 2000, pp. 633−639.
    [15]
    G. E. Hinton, “Training products of experts by minimizing contrastive divergence,” Neural Comput., vol. 14, no. 8, pp. 1771–1800, Aug. 2002. doi: 10.1162/089976602760128018
    [16]
    Y. Cao and D. J. Fleet, “Generalized product of experts for automatic and principled fusion of Gaussian process predictions,” arXiv preprint arXiv: 1410.7827, 2015.
    [17]
    V. Tresp, “A Bayesian committee machine,” Neural Comput., vol. 12, no. 11, pp. 2719–2741, Nov. 2000. doi: 10.1162/089976600300014908
    [18]
    M. P. Deisenroth and J. W. Ng, “Distributed Gaussian processes,” in Proc. 32nd Int. Conf. Machine Learning, Lille, France, 2015, pp. 1481−1490.
    [19]
    H. Liu, J. Cai, Y. Wang, and Y. S. Ong, “Generalized robust Bayesian committee machine for large-scale Gaussian process regression,” in Proc. 35th Int. Conf. Machine Learning, Stockholm, Sweden, 2018, pp. 3131−3140.
    [20]
    Z. Yang, S. Sosnowski, Q. Liu, J. Jiao, A. Lederer, and S. Hirche, “Distributed learning consensus control for unknown nonlinear multi-agent systems based on Gaussian processes,” in Proc. 60th IEEE Conf. Decision and Control, Austin, USA, 2021, pp. 4406−4411.
    [21]
    Z. Yang, S. Dong, A. Lederer, X. Dai, S. Chen, S. Sosnowski, G. Hattab, and S. Hirche, “Cooperative learning with Gaussian processes for Euler-Lagrange systems tracking control under switching topologies,” in Proc. American Control Conf., Toronto, Canada, 2024, pp. 560−567.
    [22]
    A. Lederer, Z. Yang, J. Jiao, and S. Hirche, “Cooperative control of uncertain multiagent systems via distributed Gaussian processes,” IEEE Trans. Autom. Control, vol. 68, no. 5, pp. 3091–3098, May 2023. doi: 10.1109/TAC.2022.3205424
    [23]
    Z. Yuan and M. Zhu, “Lightweight distributed Gaussian process regression for online machine learning,” IEEE Trans. Autom. Control, vol. 69, no. 6, pp. 3928–3943, Jun. 2024. doi: 10.1109/TAC.2024.3351555
    [24]
    T. N. Hoang, Q. M. Hoang, K. H. Low, and J. How, “Collective online learning of Gaussian processes in massive multi-agent systems,” in Proc. 33rd AAAI Conf. Artificial Intelligence, Honolulu, USA, 2019, pp. 7850−7857.
    [25]
    S. He, M. Tang, J. Fu, and J. Liang, “Distributed online sparse Gaussian process regression for multi-agent coverage control,” in Proc. 42nd Chinese Control Conf., Tianjin, China, 2023, pp. 5464−5469.
    [26]
    X. Dai, A. Lederer, Z. Yang, and S. Hirche, “Can learning deteriorate control? Analyzing computational delays in Gaussian process-based event-triggered online learning,” in Proc. 5th Annu. Learning for Dynamics and Control Conf., Philadelphia, USA, 2023, pp. 445−457.
    [27]
    X. Dai, Z. Yang, M. Xu, S. Zhang, F. Liu, G. Hattab, and S. Hirche, “Decentralized event-triggered online learning for safe consensus control of multi-agent systems with Gaussian process regression,” Eur. J. Control, vol. 80, Art. no. 101058, Nov. 2024. doi: 10.1016/j.ejcon.2024.101058
    [28]
    X. Dai, Z. Yang, S. Zhang, D.-H. Zhai, Y. Xia, and S. Hirche, “Cooperative online learning for multiagent system control via Gaussian processes with event-triggered mechanism,” IEEE Trans. Neural Netw. Learn. Syst., vol. 36, no. 7, pp. 13304–13318, Jul. 2025. doi: 10.1109/TNNLS.2024.3446732
    [29]
    T. Ding, R. Zheng, S. Zhang, and M. Liu, “Resource-efficient cooperative online scalar field mapping via distributed sparse Gaussian process regression,” IEEE Robot. Autom. Lett., vol. 9, no. 3, pp. 2295–2302, Mar. 2024. doi: 10.1109/LRA.2024.3355782
    [30]
    D. Jang, J. Yoo, C. Y. Son, D. Kim, and H. J. Kim, “Multi-robot active sensing and environmental model learning with distributed Gaussian process,” IEEE Robot. Autom. Lett., vol. 5, no. 4, pp. 5905–5912, Oct. 2020. doi: 10.1109/LRA.2020.3010456
    [31]
    Z. Yang, X. Dai, A. Dubey, S. Hirche, and G. Hattab, “Whom to trust? Elective learning for distributed Gaussian process regression,” in Proc. 23rd Int. Conf. Autonomous Agents and Multiagent Systems, Auckland, New Zealand, 2024, pp. 2020−2028.
    [32]
    B. Laurent and P. Massart, “Adaptive estimation of a quadratic functional by model selection,” Ann. Stat., vol. 58, no. 5, pp. 1302–1338, Oct. 2000. doi: 10.1214/aos/1015957395
    [33]
    N. Srinivas, A. Krause, S. M. Kakade, and M. W. Seeger, “Information-theoretic regret bounds for Gaussian process optimization in the bandit setting,” IEEE Trans. Inf. Theory, vol. 58, no. 5, pp. 3250–3265, May 2012. doi: 10.1109/TIT.2011.2182033
    [34]
    A. Lederer, J. Umlauft, and S. Hirche, “Uniform error bounds for Gaussian process regression with application to safe control,” in Proc. 33rd Int. Conf. Neural Information Processing Systems, Vancouver, Canada, 2019, Art. no. 60.
    [35]
    H. Wendland, Scattered Data Approximation. Cambridge, UK: Cambridge University Press, 2004.
    [36]
    Z. Yang, X. Dai, J. Cheng, Y. Huang, and P. Shi, “Quality or quantity? Error-informed selective online learning with Gaussian processes in multi-agent systems: Extended version,” arXiv preprint arXiv: 2601.14275, 2026.
    [37]
    Z. Yang, X. Dai, and S. Hirche, “Asynchronous distributed Gaussian process regression,” in Proc. 39th AAAI Conf. Artificial Intelligence, Philadelphia, Pennsylvania, 2025, pp. 22065−22073.
    [38]
    J. W. Ng and M. P. Deisenroth, “Hierarchical mixture-of-experts model for large-scale Gaussian process regression,” arXiv preprint arXiv: 1412.3078, 2014.

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(7)  / Tables(1)

    Article Metrics

    Article views (49) PDF downloads(4) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return