Fix reasoning parsing when streaming from AWS SageMaker AI by alvarobartt · Pull Request #2265 · strands-agents/harness-sdk

alvarobartt · 2026-05-08T09:35:57Z

Description

As of vllm-project/vllm#33402 released in vLLM v0.16.0, see https://github.com/vllm-project/vllm/releases/tag/v0.16.0; the reasoning_content field within the delta when streaming has been deprecated in favour of reasoning.

Also given that the latest AWS SageMaker AI DLC for vLLM runs with vLLM v0.20.1, see https://aws.github.io/deep-learning-containers/vllm/ and https://github.com/aws/deep-learning-containers/blob/9d519f6bca375b87422e5429803e7f2c3ca390df/docker/vllm/Dockerfile#L3, this means that the reasoning content when streaming will be ignored, whereas with this PR the content will be correctly parsed.

Note

By submitting this PR, I disclose that all the code in this PR was written entirely by me, @alvarobartt, without the use of any coding assistants or third-party agentic tools.

Related Issues

#2182 and #2191, though both of those wrongly claim that the issue is with vLLM v0.19.1 whereas https://github.com/vllm-project/vllm/releases/tag/v0.16.0 says v0.16.0 onwards.

Type of Change

Bug fix

Testing

How have you tested the change? Verify that the changes do not break functionality or introduce warnings in consuming repositories: agents-docs, agents-tools, agents-cli

I ran hatch run prepare

Checklist

I have read the CONTRIBUTING document
I have added any necessary tests that prove my fix is effective or my feature works
I have updated the documentation accordingly
I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
My changes generate no new warnings
Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

codecov · 2026-05-08T16:09:26Z

Codecov Report

❌ Patch coverage is 0% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/strands/models/sagemaker.py	0.00%	0 Missing and 1 partial ⚠️

📢 Thoughts on this report? Let us know!

github-actions · 2026-05-08T16:22:50Z

Issue: No unit tests cover the reasoning content streaming path. Codecov confirms 0% patch coverage. There are no existing tests for reasoning content in test_sagemaker.py at all.

Suggestion: Please add at minimum:

A test for streaming with "reasoning_content" key (backwards compatibility)
A test for streaming with "reasoning" key (new vLLM v0.16.0+ behavior)
A test for the non-streaming path with the same keys

These should be straightforward to add following the existing test_stream_with_streaming_enabled pattern.

github-actions · 2026-05-08T16:22:51Z

Assessment: Request Changes

The fix addresses a real issue (vLLM v0.16.0+ deprecating reasoning_content in favor of reasoning), but the current implementation has a subtle correctness issue: it checks for key existence rather than value truthiness, which differs from how text content is handled on the line above and from other model providers. This could cause empty/None reasoning deltas to be emitted.

Review Categories

Correctness: The any(k in dict) pattern + dict.get(key, default) approach has edge cases where None values slip through. Using or-based fallback is both simpler and more robust.
Completeness: The non-streaming path (line 461) has the same bug but wasn't fixed. Both paths should be consistent.
Testing: No unit tests exist for reasoning content parsing. The PR checklist also indicates tests were not added.

Thanks for tracking down the root cause to the specific vLLM version change — the PR description is very well-researched.

awsarron

Thanks @alvarobartt for your contribution!

I added a couple of minor comments. Could we also add a unit test to ensure that this behaves as expected and doesn't regress in the future?

awsarron · 2026-06-16T15:29:03Z

+                        # NOTE: Both `reasoning` and `reasoning_content` need to be handled as vLLM v0.16.0 deprecated
+                        # the `reasoning_content` in favour of `reasoning`
+                        # See https://github.com/vllm-project/vllm/pull/33402
+                        if any(


this is a bit hard to parse at a glance, could we simplify it, perhaps place in a private function and call that to make things more readable?

awsarron · 2026-06-16T15:29:34Z

-                                    "data": choice["delta"]["reasoning_content"],
+                                    # SAFETY: Here we guarantee that at least one of `reasoning` or `reasoning_content`
+                                    # is not None and a  non-empty string
+                                    "data": choice["delta"].get("reasoning_content")


nit: could we write this on one line?

awsarron · 2026-06-16T15:31:08Z

We'll need to rebase src/strands/models/sagemaker.py too

Fix reasoning parsing when streaming from AWS SageMaker AI

db4dd60

alvarobartt temporarily deployed to manual-approval May 8, 2026 09:36 — with GitHub Actions Inactive

github-actions Bot added the size/xs label May 8, 2026

alvarobartt had a problem deploying to manual-approval May 8, 2026 09:36 — with GitHub Actions Failure

github-actions Bot added the strands-running label May 8, 2026

github-actions Bot reviewed May 8, 2026

View reviewed changes

Comment thread src/strands/models/sagemaker.py Outdated

github-actions Bot reviewed May 8, 2026

View reviewed changes

Comment thread src/strands/models/sagemaker.py Outdated

github-actions Bot removed the strands-running label May 8, 2026

Fix {reasoning,reasoning_content} check

2f3d143

github-actions Bot added size/xs and removed size/xs labels May 12, 2026

alvarobartt had a problem deploying to manual-approval May 12, 2026 09:16 — with GitHub Actions Failure

Fix data assignment and add safety note

87bb5eb

github-actions Bot added size/xs and removed size/xs labels May 12, 2026

alvarobartt had a problem deploying to manual-approval May 12, 2026 09:18 — with GitHub Actions Failure

yonib05 added area-model Related to models or model providers python Pull requests that update python code bug Something isn't working labels May 27, 2026

yonib05 removed the python Pull requests that update python code label Jun 9, 2026

awsarron requested changes Jun 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix reasoning parsing when streaming from AWS SageMaker AI#2265

Fix reasoning parsing when streaming from AWS SageMaker AI#2265
alvarobartt wants to merge 3 commits into
strands-agents:mainfrom
alvarobartt:main

alvarobartt commented May 8, 2026

Uh oh!

codecov Bot commented May 8, 2026

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented May 8, 2026

Uh oh!

github-actions Bot commented May 8, 2026

Uh oh!

awsarron left a comment

Uh oh!

awsarron Jun 16, 2026

Uh oh!

awsarron Jun 16, 2026

Uh oh!

awsarron commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

alvarobartt commented May 8, 2026

Description

Related Issues

Type of Change

Testing

Checklist

Uh oh!

codecov Bot commented May 8, 2026

Codecov Report

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented May 8, 2026

Uh oh!

github-actions Bot commented May 8, 2026

Uh oh!

awsarron left a comment

Choose a reason for hiding this comment

Uh oh!

awsarron Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

awsarron Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

awsarron commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants