1 School of Engineering and Physical Sciences, Heriot-Watt University, Edinburgh, U.K.
2 State Key Laboratory of Autonomous Intelligent Unmanned Systems, Tongji University, Shanghai, China
3 College of Computer Science and Technology, Jilin University, Changchun, China
4 University of Shanghai for Science and Technology, Shanghai, China
5 National Key Laboratory of Automotive Chassis Integration and Bionics, Jilin University, Changchun, China
6 Department of Civil Engineering, The University of Hongkong, Hongkong, China
* Corresponding author: cheng.wang@hw.ac.uk
CARS responsibility-attributed adversarial scenario generation framework.
(a) Context-aware adversary selection. At each simulation step, CARS scores surrounding agents by their kinematic relationship to the ADS and stabilizes the selected agent over time, allowing the threat assignment to track the evolving conflict.
(b) Adversarial trajectory generation. The selected agent is initialized from a Gaussian-mixture diffusion prior. Closed-loop reinforcement learning fine-tunes the agent to approach the ADS and create safety-critical interactions.
(c) Attribution-aware objective. CARS targets ADS-attributable collisions under a CCD reference. A collision is attributable when it occurs for the ADS under test but would have been avoided by the CCD in the same encounter.
CARS is evaluated across three datasets spanning three continents, three road topologies, and contrasting driving cultures.
Three evaluation datasets spanning three continents and three road topologies.
(a) nuScenes front-facing camera frame at a Boston (US) intersection (1,000 urban scenes across Boston and Singapore).
(b) RounD overhead drone frame at the Neuweiler (Germany) roundabout (22 recordings of a four-arm yield-priority roundabout).
(c) AD4CHE overhead drone frame on a multi-lane highway (68 recordings across four Chinese cities).
(d) Geographic locations of the four data-collection countries; pins are coloured by dataset.
The same frozen CARS policy, trained only on nuScenes urban traffic, is applied to two naturalistic drone datasets with contrasting scene geometries. The diffusion generator and target-model checkpoints are held fixed; only a lightweight agent-selection classifier is refitted on each dataset's labelled target–adversary pairs. Both videos are rendered on the original drone photo backgrounds provided with each dataset.
A histogram gradient-boosting classifier re-evaluates all surrounding vehicles at every simulation step; a temporal confirmation gate (Kconf=5) suppresses transient ranking errors so the adversary role tracks the evolving scene. The same selection mechanism transfers to unseen scene geometries when retrained on each dataset's labelled target–adversary pairs.
Context-aware adversary re-selection across datasets.
(a) nuScenes.
(b) AD4CHE.
(c) RounD.
Each row shows three bird's-eye-view snapshots from one rollout scenario at t=0, the adversary-switch frame, and the collision frame, with a heatmap strip beneath giving the per-step adversarial probability Padv(t) for candidates A1 (upper) and A2 (lower); darker red indicates higher probability.
The target is shown in blue, the currently active adversary in red, and other agents in grey.
Each generated collision is replayed with a reference driver model controlling the target while the adversary trajectory is held fixed; the scenario is retained only if the reference driver still fails to avoid impact. Attribution is cross-validated under three reference models: FSM (UN R157 primary CCDM), CC-JP (Japanese careful-driver reference), and RSS (formal safety envelope).
Responsibility attribution and collision kinematics on CARS-generated nuScenes scenarios.
(a–c) Representative attribution disagreements in which FSM fails (a), CC-JP fails (b), and RSS fails (c). Adversary trajectory in red; target trajectory in blue; the target's stop position under each reference model is shown as a front-edge line (purple, FSM; orange, CC-JP; green, RSS).
(d) Max braking deficit BD, box-and-whisker per FSM criticality tier (Easy / Medium / Hard); box spans the interquartile range, whiskers extend to the 5th–95th percentiles, points are outliers, and median is annotated. BD > 0 indicates that required stopping distance exceeded the available gap.
(e) Adversary speed at collision, raincloud plot per FSM criticality tier (median annotated).
(f) Longitudinal closing speed at collision, kernel-density estimate per criticality tier (median annotated).
Benchmark comparison against existing adversarial generators and ADS-planner robustness tests on the nuScenes validation split.
| Method | Responsibility validity (%) ↑ | Diversity ↑ | Kinematic risk ↓ | Feasibility ↓ | ||
|---|---|---|---|---|---|---|
| FSM | CC-JP | RSS | Hcrit | BD+% | IP% | |
| Adversarial methods on nuScenes | ||||||
| STRIVE | 7.3 | 5.8 | 6.1 | 0.528 | 53.8 | 36.39 |
| SafeSim | 44.8 | 44.8 | 44.8 | 0.628 | 48.3 | 12.94 |
| Bezier-CAT | 21.1 | 36.4 | 15.0 | 0.260 | 84.1 | 73.08 |
| CARS (K=1 adv) | 45.2 | 35.5 | 53.2 | 0.797 | 66.1 | 27.40 |
| CARS (Ours) | 88.7 | 79.7 | 97.1 | 0.798 | 22.5 | 0.04 |
| ADS-planner robustness (fixed CARS adv) | ||||||
| One-component diffusion planner | 87.8 | 80.0 | 96.7 | 0.834 | 21.7 | 0.04 |
| CTG planner | 86.0 | 80.0 | 92.4 | 0.745 | 29.2 | 0.02 |
Validity columns report the percentage of each method's collision scenarios classified as attributable to the target under the three reference models. Hcrit is the normalised Shannon entropy of the FSM Hard/Medium/Easy distribution (higher is more balanced). BD+% is the fraction of scenarios with a positive braking deficit during the encounter. IP% is the scenario-averaged fraction of time steps exceeding any UN R157 feasibility bound (|a|>7 m/s², |jerk|>12.65 m/s³, |alat|>3.0 m/s²). CARS simultaneously achieves the highest attribution (88.7% FSM), balanced severity coverage (Hcrit=0.798), and near-zero kinematic infeasibility (IP=0.04%). Each baseline fails on a different axis: Bezier-CAT and STRIVE drive the adversary beyond physical limits; SafeSim concentrates severity into a narrower tier band; STRIVE also loses most of its collisions under FSM as unattributable. The ADS-planner rows fix the CARS adversary and only replace the target planner, showing that CARS does not depend on the planner architecture used during training.
The same frozen CARS policy, evaluated under three CCDMs (FSM, CC-JP, RSS), on three datasets with contrasting scene geometries.
| Dataset | Scene geometry | Episodes | FSM valid% ↑ | CC-JP valid% ↑ | RSS valid% ↑ |
|---|---|---|---|---|---|
| nuScenes Training | urban intersections | 408 | 88.7 | 79.7 | 97.1 |
| AD4CHE | multi-lane highway | 470 | 76.4 | 63.8 | 80.9 |
| RounD | four-arm roundabout | 927 | 57.5 | 52.0 | 70.7 |
Valid% = fraction of collisions classified as attributable to the target by the reference driver model (higher is better). AD4CHE and RounD numbers are from the same frozen CARS generator, with only the lightweight agent-selection classifier refitted to each dataset's labelled target–adversary pairs.