This is an old revision of the document!
Planning and control must be validated as a system function, not only as isolated algorithms. A planner may produce a technically correct trajectory, and a controller may track a reference path accurately, but the combined behavior can still be unsafe if perception is delayed, localization drifts, prediction is wrong, or the actuation path introduces latency. The validation view therefore focuses on the complete decision–execution loop and asks whether the autonomous vehicle behaves safely, predictably, and consistently inside its intended operating conditions.
The purpose of this subsection is to define how planning and control should be evaluated across different validation levels. The key point is that component tests are useful, but they are only one part of the evidence chain. The reader should be able to trace a planning or control function from its local behavior to its interaction with the full vehicle system, and then to the operating domain where the vehicle is actually expected to function.
<!– Figure comment: Validation view from component verification to system and operational validation –>
The validation process can be understood as a progression from local correctness to system-level safety. This progression is especially important in the planning and control layer, because the behavior of the system depends on tightly coupled interactions between the decision layer, the planner, the controller, the vehicle model, and the environment.
| Validation level | What is checked | Example for planning and control | Typical evidence |
|---|---|---|---|
| Unit level | The component in isolation | Planner logic, controller law, fallback trigger, trajectory tracking rule | Component test results, assertion checks, interface tests |
| Integration level | Interaction between modules | Planner with localization, prediction, and control interface | Closed-loop integration logs, timing traces, message traces |
| System level | Full vehicle behavior in a closed loop | Lane change, overtaking, stopping, obstacle avoidance | Scenario results, system KPIs, safety metrics |
| Operational level | Behavior in the intended use context | Validated operation inside the ODD, track, or field pilot | Validation report, field data, pass/fail evidence |
This distinction matters because a good unit test does not guarantee safe system behavior. A motion planner can be correct as a software module and still create unsafe motion if it receives stale state information or if the controller cannot follow the generated path within the vehicle’s physical limits. Likewise, a controller can be stable in isolation but still produce an unsafe outcome if the trajectory itself is too aggressive or if the vehicle state changes faster than expected.
In the V-model perspective used throughout the handbook, planning and control occupy the portion where implementation evidence must be mapped back to the system requirements. The chapter does not need to repeat the general V-model explanation here. It is enough to show that planning and control are evaluated at several points along that validation chain: first as modules, then as integrated functions, and finally as part of the complete autonomous vehicle.
The validation question is not simply “does the algorithm work?” It is “does the vehicle behave safely and correctly when the algorithm is embedded in the full autonomy stack?” That means the evaluation must cover the following aspects together:
| Validation aspect | What it means in practice |
|---|---|
| Functional correctness | The maneuver or trajectory matches the intended behavior |
| Physical feasibility | The motion can be executed by the vehicle without violating dynamics |
| Safety | The vehicle avoids collisions and unsafe close approaches |
| Rule compliance | The motion respects traffic rules, road geometry, and operational constraints |
| Robustness | The behavior remains acceptable under uncertainty, delays, and disturbances |
| Comfort | The motion does not introduce excessive jerk, sharp braking, or unstable steering |
| Timeliness | The planner and controller act within the response time allowed by the scenario |
Planning and control are sensitive to the assumptions behind the system. A small change in localization accuracy, actuation delay, road friction, or prediction uncertainty may produce a very different trajectory outcome. For this reason, validation should not be framed as a single pass/fail test on one nominal case. It should be framed as a collection of evidence showing that the system remains acceptable across the planned range of operating conditions.
The planning and control layer is best validated through a chain of evidence. First, the team defines a maneuver or mission objective. Then the system assumptions and operating constraints are specified. Next, scenarios are generated to exercise the maneuver under controlled variation. After that, simulation, closed-loop execution, and physical confirmation are used to check the system response. Finally, the results are expressed in measurable metrics and tied back to the safety argument.
This logic is already visible in the current material, which treats digital twins as the basis for meaningful simulation, uses design-of-experiments to stress the decision and control logic, and combines local properties such as trajectory tracking with system-level effects such as minimum distance to collision. It also emphasizes that the simulator must remain predictive as the product evolves, so that post-deployment logs, updated vehicle parameters, and map changes can be folded back into continuous validation.
The important design principle is that planning and control validation should support both:
1. **local evidence**, where the behavior of a single planner or controller can be checked; 2. **system evidence**, where the combined behavior of the autonomy stack is evaluated in closed loop.
This is why scenario execution, digital twin fidelity, and timing realism matter. If the virtual environment is too abstract, the test may not reveal the same failure modes that appear in the real vehicle. If the virtual environment is too expensive or too detailed, the test program may not scale to a useful number of scenarios. Validation therefore needs a balance between breadth and realism.
For this chapter, the most useful evidence types are trajectory evidence, timing evidence, and safety evidence.
| Evidence type | Example output | Why it matters |
|---|---|---|
| Trajectory evidence | Path error, tracking error, lane deviation, path smoothness | Shows whether the plan can be executed as intended |
| Timing evidence | Planner latency, controller latency, response delay | Shows whether the system reacts quickly enough |
| Safety evidence | Collision result, TTC, DTC, minimum distance | Shows whether the behavior remains safe |
| Robustness evidence | Performance under sensor delay, localization drift, or actuation variation | Shows whether the result survives uncertainty |
| Operational evidence | Performance inside the intended ODD | Shows whether the system is ready for realistic use |
These evidence types should be collected together, not separately. A vehicle that tracks a path accurately but violates safety margins is not acceptable. A vehicle that avoids collision but behaves erratically or unpredictably is also not acceptable. The validation view therefore requires a combined reading of the metrics rather than a single score.
The planning and control layer is the point where autonomous behavior becomes visible in the physical world. A mistake here is not only a software error; it is an action error. That is why this subsection must be treated as a bridge between the planning algorithms described earlier and the scenario-based test methodology that follows. It prepares the reader to ask the right questions: what should be tested, at what level, under which conditions, and with what evidence.
The next subsection should therefore move from this validation view to the concrete generation of test scenarios, logical ranges, and executable cases.