V2 quantizer: fix IO-boundary shared clusters left in float by rascani · Pull Request #20291 · pytorch/executorch

rascani · 2026-06-15T22:37:41Z

Summary:
Shared-op clusters (e.g. cat, view, reshape) on the quantized IO boundary were silently left in float by the composable TOSA quantizer (_TOSAQuantizerV2), causing them to fall off the Ethos-U integer delegate onto CPU.

SharedQspecQuantizer propagates a qspec only from already-quantized neighbors. A cluster whose only quantized neighbors are a uint8 model input (intentionally skipped by _skip_shared_qspec_from_io to confine uint8 to the IO boundary) and/or an input-state placeholder with no output_qspec had no qspec to propagate, so it was rejected and remained in float.

The fix adds _is_quantized_io_boundary, which detects annotated placeholder/output nodes that signal the cluster is on the quantized data path even when their qspec is filtered. _get_shared_clique now returns a touches_quantized_io flag alongside the usual results. When _annotate_shared_cluster finds an empty adjacent_qspecs but a boundary-touching cluster, it initiates quantization from the global config input-activation qspec instead of rejecting. _TOSAQuantizerV2.set_global now also propagates to shared_qspec_quantizer.global_config so the fallback is wired automatically.

This restores the correctness fix from D107320847, which was abandoned because its other fix (parameter-operand weight misclassification) had already been resolved via the is_weight PARAMETER_TARGETS refactor.

This change was developed with assistance from Claude.

Differential Revision: D108662081

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell

Summary: Shared-op clusters (e.g. `cat`, `view`, `reshape`) on the quantized IO boundary were silently left in float by the composable TOSA quantizer (`_TOSAQuantizerV2`), causing them to fall off the Ethos-U integer delegate onto CPU. `SharedQspecQuantizer` propagates a qspec only from already-quantized neighbors. A cluster whose only quantized neighbors are a uint8 model input (intentionally skipped by `_skip_shared_qspec_from_io` to confine uint8 to the IO boundary) and/or an input-state placeholder with no `output_qspec` had no qspec to propagate, so it was rejected and remained in float. The fix adds `_is_quantized_io_boundary`, which detects annotated `placeholder`/`output` nodes that signal the cluster is on the quantized data path even when their qspec is filtered. `_get_shared_clique` now returns a `touches_quantized_io` flag alongside the usual results. When `_annotate_shared_cluster` finds an empty `adjacent_qspecs` but a boundary-touching cluster, it initiates quantization from the global config input-activation qspec instead of rejecting. `_TOSAQuantizerV2.set_global` now also propagates to `shared_qspec_quantizer.global_config` so the fallback is wired automatically. This restores the correctness fix from D107320847, which was abandoned because its other fix (parameter-operand weight misclassification) had already been resolved via the `is_weight` `PARAMETER_TARGETS` refactor. This change was developed with assistance from Claude. Differential Revision: D108662081

pytorch-bot · 2026-06-15T22:37:45Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20291

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 1 Cancelled Job

As of commit c0ac9b6 with merge base e257a71 ():

NEW FAILURES - The following jobs have failed:

Cadence Build & Test / hifi-build / hifi4 (gh)
Input required and not supplied: aws-region
Cadence Build & Test / vision-build / vision (gh)
Input required and not supplied: aws-region

CANCELLED JOB - The following job was cancelled. Please retry:

trunk / test-models-macos-coreml (vit) / macos-job (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2026-06-15T22:37:51Z

@rascani has exported this pull request. If you are a Meta employee, you can view the originating Diff in D108662081.

github-actions · 2026-06-15T22:38:41Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

AdrianLundell

Thanks for the fix!

For some context, ideally we would just leave all nodes now handled by the SharedQspecQuantizer un-annotated and just let them be handled by dtype propagation, the reason it is done this way is mainly to ensure we know what nodes are quantized and not at partition-time. If we could do that in a more clever way maybe we could avoid the SharedQspecQuantizer altogether.

digantdesai

Thanks

Runs lintrunner -a on the two files flagged by the Lint check on pytorch#20291 (UFMT import ordering and signature wrapping, DOCFORMATTER docstrings). Formatting only; no logic changes. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

meta-codesync · 2026-06-16T21:53:28Z

@rascani has imported this pull request. If you are a Meta employee, you can view this in D108662081.

rascani requested a review from digantdesai as a code owner June 15, 2026 22:37

rascani had a problem deploying to cadence June 15, 2026 22:37 — with GitHub Actions Failure

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 15, 2026

meta-codesync Bot added the meta-exported label Jun 15, 2026

github-actions Bot added ciflow/trunk module: arm Issues related to arm backend labels Jun 15, 2026

rascani requested a review from AdrianLundell June 15, 2026 22:38

rascani mentioned this pull request Jun 16, 2026

Arm backend: Make composable_quantizer default #19758

Open

AdrianLundell approved these changes Jun 16, 2026

View reviewed changes

digantdesai approved these changes Jun 16, 2026

View reviewed changes

rascani had a problem deploying to cadence June 16, 2026 21:46 — with GitHub Actions Failure

rascani merged commit 218cc45 into pytorch:main Jun 17, 2026
488 of 493 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

V2 quantizer: fix IO-boundary shared clusters left in float#20291

V2 quantizer: fix IO-boundary shared clusters left in float#20291
rascani merged 2 commits into
pytorch:mainfrom
rascani:export-D108662081

rascani commented Jun 15, 2026 •

edited by pytorch-bot Bot

Loading

Uh oh!

pytorch-bot Bot commented Jun 15, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented Jun 15, 2026

Uh oh!

github-actions Bot commented Jun 15, 2026

Uh oh!

AdrianLundell left a comment

Uh oh!

digantdesai left a comment

Uh oh!

meta-codesync Bot commented Jun 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

rascani commented Jun 15, 2026 • edited by pytorch-bot Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Jun 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20291

❌ 2 New Failures, 1 Cancelled Job

Uh oh!

meta-codesync Bot commented Jun 15, 2026

Uh oh!

github-actions Bot commented Jun 15, 2026

This PR needs a release notes: label

Uh oh!

AdrianLundell left a comment

Choose a reason for hiding this comment

Uh oh!

digantdesai left a comment

Choose a reason for hiding this comment

Uh oh!

meta-codesync Bot commented Jun 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rascani commented Jun 15, 2026 •

edited by pytorch-bot Bot

Loading

pytorch-bot Bot commented Jun 15, 2026 •

edited

Loading

This PR needs a `release notes:` label