Loads the full bundle and runs hundreds of probing tests across the 22
modules — focus on API surfaces, edge cases, and error paths that the
per-module smoke tests don't exercise. Each check(...) is
isolated with try/except so a single ✗ doesn't halt the run. The section
footers and final summary show pass/fail counts — start digging where
the failure ratio jumps.