fix(core): split comma-separated tags in parse_tags by phernandez · Pull Request #911 · basicmachines-co/basic-memory

phernandez · 2026-06-08T02:17:54Z

Summary

bm tool write-note --tags "a,b" stored a single mangled tag a,b instead of splitting into ["a", "b"], diverging from the MCP write_note(tags="a,b") convention. Typer collects a single --tags "a,b" value into the one-element list ["a,b"], and parse_tags() only split a bare string on commas — list elements were returned verbatim, so the comma survived as part of a single literal tag.

parse_tags() (src/basic_memory/utils.py) now flattens by splitting each list element on commas before stripping #/whitespace, so "a,b", ["a,b"], and ["a", "b"] all converge to ["a", "b"]. This is the single source of truth, so CLI --tags "a,b", CLI --tags a --tags b, and the MCP comma/list/JSON forms now agree.

The search_notes tags coercer is intentionally left unchanged — its comma behavior is tracked separately as #910.

Bugs fixed

Bug feat: build installer via github action #7: CLI write-note --tags "a,b" does not split comma-separated tags (diverges from MCP write_note).

Testing

uv run pytest test-int/bughunt_fixes/test_parse_tags_comma_split_integration.py -q --no-cov — 2 passed (CLI/MCP convergence + MCP baseline; the bug-hunt integration test that failed on main now passes).
uv run pytest tests/utils/test_parse_tags.py -q --no-cov — 23 passed (added list-comma parametrize cases).
uv run pytest tests/mcp/test_tool_write_note.py test-int/cli/ tests/mcp/test_tool_contracts.py tests/mcp/test_tool_telemetry.py -q --no-cov — 116 passed (no regressions; tool signatures unchanged).
uv run ruff check . --fix, uv run ruff format ., uv run ty check src tests test-int — all clean for scoped files.

Risk

Low. Behavior change is confined to the parse_tags() list branch and only affects list elements that themselves contain commas — previously these produced an invalid single-literal tag. No existing test relied on commas being preserved inside a list element. The search-notes coercer is untouched.

🤖 Generated with Claude Code

Bug #7 (integration bug hunt): `bm tool write-note --tags "a,b"` stored one mangled tag "a,b" instead of splitting into ["a","b"], diverging from the MCP write_note convention. Typer collects a single `--tags "a,b"` into the one-element list ['a,b'], and parse_tags() only split a bare string on commas, leaving list elements verbatim. parse_tags() now flattens by splitting each list element on commas, so "a,b", ["a,b"], and ["a","b"] all converge to ["a","b"]. The search_notes 'tags' coercer is intentionally left unchanged (tracked separately as #910). Tests come from the integration bug hunt: a CLI/MCP convergence test under test-int/bughunt_fixes/ plus unit parametrize cases in tests/utils/test_parse_tags.py. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Signed-off-by: phernandez <paul@basicmachines.co>

github-actions · 2026-06-08T02:19:05Z

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: dfef6f4464

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-08T02:20:52Z

+        return [
+            tag.strip().lstrip("#")
+            for raw in tags
+            for tag in str(raw).split(",")


Skip null list entries before stringifying

When frontmatter or a JSON tag array contains a null entry, e.g. tags: [alpha, null], normalize_frontmatter_metadata() preserves that None and passes the list into parse_tags(). The old list branch skipped falsy entries, but this change stringifies each raw item before filtering, so None becomes the literal tag "None" and can be indexed or written back as a real tag. Filter raw/None before splitting so comma-separated strings are still flattened without reviving previously ignored null tag values.

Useful? React with 👍 / 👎.

Codex review: parse_tags must not revive a null list entry (YAML 'tags: [alpha, null]') as the literal tag 'None' — skip None before stringifying. Also fix the comma-split integration test's frontmatter assertion to use splitlines() so it passes on Windows CRLF. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Signed-off-by: phernandez <paul@basicmachines.co>

github-actions · 2026-06-08T02:50:18Z

Claude finished @phernandez's task in 1m 35s —— View job

PR Review: fix(core): split comma-separated tags in parse_tags

Tasks:

Gather context and understand the request
Read changed files (current state after 2nd commit)
Review code against team checklist
Post review findings

Summary

This is a well-scoped, clean bug fix. The root cause (Typer collecting --tags "a,b" into ["a,b"] which bypassed the comma-split path) is correctly diagnosed and fixed at the single source of truth. The second commit (2be0731) proactively addressed the Codex finding about None list entries being stringified to the literal tag "None" — that's a good sign of attentive iteration.

Team Checklist

Code Quality & Standards

Follows Basic Memory coding conventions — Trigger/Why/Outcome comment pattern used correctly (utils.py:529–535)
Python 3.12+ type annotations maintained — no regressions
100-character line limit respected
No commented-out code blocks
SQLAlchemy / FastAPI / Typer conventions not touched (out of scope)

Testing & Documentation

Unit tests added — 3 new parametrize cases in tests/utils/test_parse_tags.py (list-comma split, mixed list+comma, None entry)
Integration tests added — test-int/bughunt_fixes/test_parse_tags_comma_split_integration.py with both CLI and MCP convergence assertions
Edge cases covered — double-comma ("a,,b"), trailing/leading whitespace, None entries, mixed lists
100% coverage maintained — no net-new uncovered lines introduced

Basic Memory Architecture

Atomic, composable change — only parse_tags() in utils.py modified
No database schema changes (no migrations needed)
Local-first architecture unaffected
PR correctly defers search_notes tag coercion divergence to issue [BUG] search_notes(tags="a,b") ignores comma-separated tags (inconsistent with the tag: query shorthand) #910

Security & Performance

No hardcoded secrets or credentials
No sensitive data in logs
Input validation maintained (falsy-filter if tag and tag.strip() handles double-commas and whitespace-only splits)
No Windows/unicode compatibility regressions

Compatibility

No emoji/unicode in console output
Integration test uses content.splitlines() for line-ending agnosticism (CRLF-tolerant — addressed in 2nd commit)

Findings

No blocking issues. Three minor observations for awareness:

1. Type annotation doesn't reflect None list elements (utils.py:511)

def parse_tags(tags: Union[List[str], str, None]) -> List[str]:

The function now explicitly handles None inside the list (which the old code silently ignored via the falsy filter). The accurate annotation in Python 3.12 style would be list[str | None] | str | None. This is low-priority since static checkers won't flag callers passing None elements anyway (YAML frontmatter parsing produces these at runtime), but it's a documentation gap. Fix this →

2. Pre-existing inline import json inside parse_tags() (utils.py:547)

json is already imported at module top (line 3). The inline import is pre-existing and not introduced by this PR — noting only for completeness, not a blocker.

3. New test-int/bughunt_fixes/ directory convention

This is a new directory pattern without precedent in test-int/. It's properly structured (with __init__.py and a module docstring). If more regression guards accumulate, the team may want to settle on a canonical name (e.g., test-int/regression/ or test-int/bughunts/). Not a blocker for this PR.

Conclusion

The fix is correct and the None-entry guard (added in the 2nd commit in response to the Codex review) closes the only realistic edge case. The integration test verifies CLI ↔ MCP convergence end-to-end, which is the key behavioral guarantee. Ready to merge.

chatgpt-codex-connector Bot reviewed Jun 8, 2026

View reviewed changes

phernandez merged commit 85a8e59 into main Jun 8, 2026
25 checks passed

phernandez deleted the fix/parse-tags-comma branch June 8, 2026 03:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(core): split comma-separated tags in parse_tags#911

fix(core): split comma-separated tags in parse_tags#911
phernandez merged 2 commits into
mainfrom
fix/parse-tags-comma

phernandez commented Jun 8, 2026

Uh oh!

github-actions Bot commented Jun 8, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 8, 2026

Uh oh!

github-actions Bot commented Jun 8, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

phernandez commented Jun 8, 2026

Summary

Bugs fixed

Testing

Risk

Uh oh!

github-actions Bot commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: fix(core): split comma-separated tags in parse_tags

Summary

Team Checklist

Code Quality & Standards

Testing & Documentation

Basic Memory Architecture

Security & Performance

Findings

Conclusion

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: fix(core): split comma-separated tags in parse_tags

Summary

Team Checklist

Code Quality & Standards

Testing & Documentation

Basic Memory Architecture

Security & Performance

Compatibility

Findings

Conclusion

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions Bot commented Jun 8, 2026 •

edited

Loading

github-actions Bot commented Jun 8, 2026 •

edited

Loading