fix(mcp): split comma-separated tags in search_notes tags param by phernandez · Pull Request #932 · basicmachines-co/basic-memory

phernandez · 2026-06-09T23:12:15Z

Summary

search_notes(tags="alpha,beta") returned zero results because BeforeValidator(coerce_list) wrapped the bare string as the single literal tag ["alpha,beta"]. This was inconsistent with the tag: query shorthand in the same tool (which splits commas) and with write_note's documented tags convention.

The fix swaps the validator to parse_tags, which handles real lists, JSON-array strings, comma-separated strings, and strips leading # — keeping all tag entry points on one source of truth. coerce_list is left unchanged because canvas.py uses it for nodes/edges (lists of dicts) that must never comma-split.

Note: comma-split behavior for note_types / entity_types / categories is deliberately out of scope here and tracked in a follow-up issue.

Test plan

uv run pytest tests/mcp/test_tool_search.py -x -q — 55 passed
- New annotation wiring test validates the tags parameter metadata directly via pydantic.TypeAdapter (comma string, JSON-array string, real list, single string)
- New end-to-end test exercises the full MCP validation path via fastmcp Client, asserting comma-string and real-list inputs behave identically, plus a negative control proving the filter still applies

Closes #910

🤖 Generated with Claude Code

Reviewed SHA: 2c15c55f10e103231a979355300a859e3ca47ade
Verdict: approve
Status: success - BM Bossbot approved this head SHA

Summary:
No blocking issues found. The change correctly routes search_notes tag parameter validation through parse_tags, preserves coerce_list for other list-shaped parameters, and adds focused regression coverage for both annotation validation and the FastMCP tool path.

Blocking findings:

None

Non-blocking findings:

None

BM Bossbot image choices

Pull request: #932
Generated asset: /home/runner/work/basic-memory/basic-memory/docs/assets/infographics/pr-932.webp
Image model: gpt-image-2
Size: 1536x1024
Quality: high
Image mode: editorial-image
Theme source: auto

Theme / visual direction:

classic literature: sea voyages, gothic manors, Dickensian streets, library marginalia, and travel-journal artifacts

search_notes(tags="alpha,beta") returned zero results because BeforeValidator(coerce_list) wrapped the bare string as the single literal tag ["alpha,beta"]. This was inconsistent with the tag: query shorthand in the same tool (which splits commas) and with write_note's documented tags convention. Use parse_tags as the validator instead: it handles real lists, JSON-array strings, comma-separated strings, and strips leading '#', keeping all tag entry points on one source of truth. coerce_list is left unchanged because canvas.py uses it for nodes/edges (lists of dicts) that must never comma-split. Regression tests cover the annotation wiring directly and the full MCP validation path via fastmcp Client, asserting comma-string and real-list inputs behave identically, plus a negative control proving the filter still applies. Fixes #910 Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Signed-off-by: phernandez <paul@basicmachines.co>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: bee3ca41da

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-09T23:14:42Z

    tags: Annotated[
        List[str] | None,
-        BeforeValidator(coerce_list),
+        BeforeValidator(parse_tags),


Reject malformed tag values before parsing

When an MCP client sends a malformed tags value such as a JSON number or object, this validator now accepts it because parse_tags stringifies unsupported types (for example 42 becomes ["42"]) before Pydantic validates List[str]. With the previous coerce_list path those inputs were left as non-lists and rejected, so this weakens the tool boundary and turns caller mistakes into confusing no-result searches with bogus tag filters. Consider wrapping parse_tags here so only str, list, or None are accepted, or tightening parse_tags without breaking its frontmatter callers.

Useful? React with 👍 / 👎.

Addressed in follow-up PR #941: the tags BeforeValidator now uses a new strict_search_tags wrapper (utils.py) that only normalizes str/list/None via parse_tags and passes any other type through unchanged, so Pydantic rejects tags=42 / tags={"a": 1} with a clear validation error instead of stringifying them into silent no-result searches. parse_tags itself is unchanged for its frontmatter/write_note callers.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 2c15c55f10

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-10T02:57:25Z

    tags: Annotated[
        List[str] | None,
-        BeforeValidator(coerce_list),
+        BeforeValidator(parse_tags),


Normalize tags for direct tool callers too

Because the comma splitting lives only in the BeforeValidator, it is skipped by direct callers of search_notes; I checked src/basic_memory/cli/commands/tool.py, and the basic-memory tool search-notes command imports this function as mcp_search and calls it directly with Typer's tags list. In that path, --tag alpha,beta is still forwarded as ["alpha,beta"], so the search looks for a literal comma tag and returns no matches even though the newly documented MCP behavior says comma-separated tags should match tag: shorthand/write-note conventions. Consider normalizing tags inside search_notes before it is merged into SearchQuery, not only at the MCP validation boundary.

Useful? React with 👍 / 👎.

Addressed in follow-up PR #941: search_notes now also runs parse_tags(tags) in the function body (alongside the other filter-param normalization, before the tag: shorthand merge and SearchQuery construction), so direct callers like bm tool search-notes --tag alpha,beta (cli/commands/tool.py) get the same comma-split semantics as the MCP path. The BeforeValidator is retained for boundary strictness, and parse_tags is idempotent so MCP-validated input passes through unchanged. Covered by a direct-call regression test.

Integration regression test for #910: the tags= parameter and the tag:alpha,beta query shorthand must return the same results for the same comma string. Cherry-picked from #918. The validator swap itself (coerce_list -> parse_tags) landed separately via #932; this carries the remaining test-only piece with original authorship. Signed-off-by: K Jagadeeswara Reddy <social@jagadeeswar.com>

…llers (#941) Refs #910. Follow-up to #932. Signed-off-by: phernandez <paul@basicmemory.com>

chatgpt-codex-connector Bot reviewed Jun 9, 2026

View reviewed changes

phernandez added 2 commits June 9, 2026 19:38

Merge branch 'main' into fix/910-search-tags-comma-split

0a33507

Merge branch 'main' into fix/910-search-tags-comma-split

2c15c55

chatgpt-codex-connector Bot reviewed Jun 10, 2026

View reviewed changes

phernandez merged commit ca9a4d9 into main Jun 10, 2026
27 checks passed

phernandez deleted the fix/910-search-tags-comma-split branch June 10, 2026 03:23

phernandez added a commit that referenced this pull request Jun 10, 2026

fix(mcp): tighten search_notes tags input and normalize for direct ca…

df485aa

…llers (#941) Refs #910. Follow-up to #932. Signed-off-by: phernandez <paul@basicmemory.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(mcp): split comma-separated tags in search_notes tags param#932

fix(mcp): split comma-separated tags in search_notes tags param#932
phernandez merged 3 commits into
mainfrom
fix/910-search-tags-comma-split

phernandez commented Jun 9, 2026 •

edited by github-actions Bot

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 9, 2026

Uh oh!

phernandez Jun 10, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 10, 2026

Uh oh!

phernandez Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

phernandez commented Jun 9, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

phernandez Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

phernandez Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

phernandez commented Jun 9, 2026 •

edited by github-actions Bot

Loading