Add robustness badges mined from each parser's source and behavior by LucaCappelletti94 · Pull Request #19 · LucaCappelletti94/sql_ast_benchmark

LucaCappelletti94 · 2026-06-09T07:10:28Z

Adds per-parser robustness badges to each parser page (static panic discipline, empirical panic rate on the real corpus, unsafe surface, recursion-depth resilience, dependency count, serde-on-AST), produced by a new offline featurescan crate that scans each parser's source with syn and probes recursion depth in a child process, plus a ParseOutcome enum that lets grading tell a caught panic apart from an honest error. qusql-parse is the only parser that panics on real input, and only sqlparser-rs and sqlite3-parser are depth-guarded among the pure-Rust parsers (polyglot-sql overflows at depth 232). Also strips prose semicolons from rustdoc across the repo.

New featurescan crate parses each parser's library src with syn, counting panic-inducing constructs (panic, unreachable, unimplemented, todo, unwrap, expect, indexing) and unsafe usage, reading the crate's own lint policy, and probing recursion depth in a child process. Counts exclude tests, benches, examples, cfg(test) items, and test-helper files, and are a code-smell proxy rather than a crash proof. Grading now tells a caught panic apart from an honest error via a ParseOutcome enum, so each parser page reports its empirical panic rate on the real corpus. qusql-parse is the only non-zero offender (84 of 101085, 0.083 percent), all Option::unwrap on None. Each parser page gains six badges in the existing metadata grid (panic discipline, empirical panic rate, unsafe, recursion depth, deps, serde AST). The redundant hand-recorded unsafe pill is removed in favor of the scanned count. The feature scan and depth probe run as part of cargo regen, the shared schema lives in viz, and both committed snapshots are baked into the wasm at build time.

Replaces prose semicolons with periods in doc and code comments across the repo, per the no-semicolons-in-prose convention. Literal semicolon characters inside backticks (discussing SQL or code) are left intact. Comments only, no code or behavior change.

LucaCappelletti94 added 2 commits June 9, 2026 08:28

LucaCappelletti94 merged commit 5e5ac90 into main Jun 9, 2026
6 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add robustness badges mined from each parser's source and behavior#19

Add robustness badges mined from each parser's source and behavior#19
LucaCappelletti94 merged 2 commits into
mainfrom
robustness-badges

LucaCappelletti94 commented Jun 9, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

LucaCappelletti94 commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

LucaCappelletti94 commented Jun 9, 2026 •

edited

Loading