Journal of the Operations Research Society of China ›› 2025, Vol. 13 ›› Issue (3): 750-774.doi: 10.1007/s40305-025-00585-0

Previous Articles     Next Articles

Black-Box Rare-Event Simulation for Safety Testing of AI Agents: An Overview

Yuan-Lu Bai1, Zhi-Yuan Huang2, Henry Lam1, Ding Zhao3   

  1. 1 Department of Industrial Engineering and Operations Research, Columbia University, New York 10027, USA;
    2 School of Economics and Management, Tongji University, Shanghai 200092, China;
    3 Department of Mechanical Engineering, Carnegie Mellon University, Pittsburgh 15213, USA
  • Received:2024-03-05 Revised:2024-12-15 Online:2025-09-30 Published:2025-09-16
  • Contact: Zhi-Yuan Huang E-mail:huangzy@tongji.edu.cn
  • Supported by:
    Zhi-Yuan Huang’s research is supported by the National Natural Science Foundation of China (No. 72301195), the Shanghai Rising-Star Program (No. 22YF1451100), and the Fundamental Research Funds for the Central Universities.

Abstract: This paper provides an overview of black-box rare-event simulation methods applicable to the safety testing of artificial intelligence agents. We explore the challenges and efficiency criteria in black-box simulation, especially emphasizing the subtle occurrence and control of underestimation errors. The paper reviews various adaptive methods, such as the cross-entropy method and adaptive multilevel splitting, highlighting both their empirical effectiveness and theoretical limitations. Additionally, it offers a comparative analysis of different confidence interval constructions for crude Monte Carlo methods, aiming to mitigate underestimation errors through effective uncertainty quantification. The paper concludes with a certifiable deep importance sampling approach, using deep neural networks to develop conservative estimators that address underestimation issues.

Key words: Rare-event simulation, Black-box systems, AI system safety, Underestimation

CLC Number: