Skip to content

Add status change topic#155

Open
kevinwallimann wants to merge 6 commits into
masterfrom
feature/151-add-status-change-topic
Open

Add status change topic#155
kevinwallimann wants to merge 6 commits into
masterfrom
feature/151-add-status-change-topic

Conversation

@kevinwallimann
Copy link
Copy Markdown

@kevinwallimann kevinwallimann commented May 20, 2026

Overview

See

Release Notes

  • Add status change topic

Related

Closes #151

Summary by CodeRabbit

  • New Features

    • Added a status-change event topic with a comprehensive validation schema for job lifecycle events.
  • Chores

    • Updated access configuration to include the new topic.
    • Updated configuration handling and unit tests to discover and list the new topic.

Review Change Stack

@coderabbitai
Copy link
Copy Markdown

coderabbitai Bot commented May 20, 2026

Walkthrough

This PR adds a new status_change topic to EventGate by introducing a topic constant, defining a JSON Schema with typed fields and conditional validation rules for status-change events, configuring access control, and integrating topic discovery and loading into the existing handler and config loader systems. Test coverage is updated to verify the integration.

Changes

Status Change Topic Support

Layer / File(s) Summary
Topic constant and schema definition
src/utils/constants.py, conf/topic_schemas/status_change.json, conf/access.json
Topic constant TOPIC_STATUS_CHANGE is defined with value public.cps.za.status-change. The JSON Schema defines typed properties (UUID/epoch-millisecond formats), constrains event_type to lifecycle values, and adds conditional validation via allOf blocks: created events require source/platform/input fields, and finished events require and restrict status_type to SUCCEEDED | FAILED | KILLED. Access control maps the topic to TestUser.
Handler and config loader integration
src/handlers/handler_topic.py, src/utils/config_loader.py
HandlerTopic imports the new constant and loads status_change.json during schema initialization. config_loader.py imports the constant and adds status_change.json to the topic filename-to-topic mapping in load_topic_names(), enabling automatic discovery.
Test coverage
tests/unit/handlers/test_handler_topic.py, tests/unit/utils/test_config_loader.py
Handler tests add status_change.json to mocked schemas and verify the parsed topic and endpoint response include public.cps.za.status-change. Config loader tests update fixtures and expectations to verify discovery and exclusion behavior with the additional schema.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Poem

🐰 A small schema hops into place,
Fields aligned with tidy grace,
A constant, access, tests in flight,
Topics now wake to status light,
EventGate hums through day and night.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name Status Explanation Resolution
Docstring Coverage ⚠️ Warning Docstring coverage is 66.67% which is insufficient. The required threshold is 80.00%. Write docstrings for the functions missing them to satisfy the coverage threshold.
✅ Passed checks (4 passed)
Check name Status Explanation
Title check ✅ Passed The title 'Add status change topic' is concise and directly reflects the main change—adding a new status_change topic to EventGate.
Description check ✅ Passed The PR description includes an Overview section with documentation references and a Release Notes section, matching the repository template structure.
Linked Issues check ✅ Passed The PR successfully implements all acceptance criteria from issue #151: defines the status_change topic with JSON schema, makes it available in EventGate, and supports producer/consumer operations via existing API mechanisms.
Out of Scope Changes check ✅ Passed All changes are narrowly scoped to adding the status_change topic infrastructure; no extraneous modifications to unrelated features or systems are present.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch feature/151-add-status-change-topic

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown

@coderabbitai coderabbitai Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

🧹 Nitpick comments (1)
tests/unit/handlers/test_handler_topic.py (1)

90-90: ⚡ Quick win

Add one POST-path test for public.cps.za.status-change schema validation.

Current additions validate discovery/listing only. A valid + invalid POST case for the new topic would protect the end-to-end contract (load + validate + reject bad payloads) from regressions.

Also applies to: 103-107, 117-117

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/unit/handlers/test_handler_topic.py` at line 90, Add a new test
function (e.g., test_post_status_change_schema_validation) in the existing
test_handler_topic suite that exercises the POST path for the schema named
"public.cps.za.status-change": send one valid POST payload (matching the schema,
e.g., including execution_id as string) and assert the handler accepts it
(status accepted/success), then send an obviously invalid payload (e.g.,
execution_id missing or wrong type) and assert the handler rejects it with a
validation error (400 or rejection response and an error message referencing the
schema/field); place both assertions in the same test to ensure load+validate
behavior is covered end-to-end.
🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@conf/topic_schemas/status_change.json`:
- Around line 20-33: The schema currently treats tenant_id as optional and lacks
explicit previous_state/new_state fields; update status_change.json to make the
application identifier required (mark "tenant_id" as required in the root
"required" array or otherwise add a required "application_id" equivalent), add
"previous_state" and "new_state" properties (type "string", non-nullable) to the
schema, and include those two fields in the schema's "required" array so every
status_change event must carry tenant_id plus previous_state and new_state;
apply the same changes to the other schema blocks noted (lines referenced in the
review).
- Around line 52-55: The schema defines "timestamp_event" as epoch milliseconds
but uses "type": "number" which allows fractions; change the "timestamp_event"
field in the JSON schema to use "type": "integer" (and add "minimum": 0 if you
want to enforce non-negative timestamps) so the schema enforces
whole-millisecond epoch values.

---

Nitpick comments:
In `@tests/unit/handlers/test_handler_topic.py`:
- Line 90: Add a new test function (e.g.,
test_post_status_change_schema_validation) in the existing test_handler_topic
suite that exercises the POST path for the schema named
"public.cps.za.status-change": send one valid POST payload (matching the schema,
e.g., including execution_id as string) and assert the handler accepts it
(status accepted/success), then send an obviously invalid payload (e.g.,
execution_id missing or wrong type) and assert the handler rejects it with a
validation error (400 or rejection response and an error message referencing the
schema/field); place both assertions in the same test to ensure load+validate
behavior is covered end-to-end.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 469ed8af-1aad-4a69-aa28-c2f10a0eb96a

📥 Commits

Reviewing files that changed from the base of the PR and between a0b6f6f and a6dc8b4.

📒 Files selected for processing (7)
  • conf/access.json
  • conf/topic_schemas/status_change.json
  • src/handlers/handler_topic.py
  • src/utils/config_loader.py
  • src/utils/constants.py
  • tests/unit/handlers/test_handler_topic.py
  • tests/unit/utils/test_config_loader.py

Comment thread conf/topic_schemas/status_change.json
Comment thread conf/topic_schemas/status_change.json
kevinwallimann and others added 2 commits May 20, 2026 16:49
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
Copy link
Copy Markdown

@coderabbitai coderabbitai Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🧹 Nitpick comments (1)
conf/topic_schemas/status_change.json (1)

34-41: 💤 Low value

Minor: stray leading space in source_app description.

Line 36's description starts with a leading space (" Standardized source application name..."). Trivial cleanup while you're touching the field.

✏️ Proposed tweak
-            "description": " Standardized source application name (aqueduct, unify, lum, etc)"
+            "description": "Standardized source application name (aqueduct, unify, lum, etc)"
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@conf/topic_schemas/status_change.json` around lines 34 - 41, The description
value for the JSON schema field "source_app" contains a stray leading space;
update the "source_app" property's "description" (in status_change.json) to
remove the leading whitespace so it reads "Standardized source application name
(aqueduct, unify, lum, etc)" exactly, ensuring the JSON string has no extra
leading character.
🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@conf/topic_schemas/status_change.json`:
- Around line 34-41: The description value for the JSON schema field
"source_app" contains a stray leading space; update the "source_app" property's
"description" (in status_change.json) to remove the leading whitespace so it
reads "Standardized source application name (aqueduct, unify, lum, etc)"
exactly, ensuring the JSON string has no extra leading character.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 61200d85-a19c-420a-9e0c-38cd27ac8d69

📥 Commits

Reviewing files that changed from the base of the PR and between a6dc8b4 and c6e6750.

📒 Files selected for processing (1)
  • conf/topic_schemas/status_change.json

Copy link
Copy Markdown
Collaborator

@oto-macenauer-absa oto-macenauer-absa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see couple issue that should be fixed:
postgres writer - this one is not implemented - it requires specific sql defined, could be opted out if sql peristence is not desired but with the current state it will spam "unknow topic"errors
kafka writer - at the moment the key for the message is not used, but with this topic, if we need the ordering, it should use some key

is the ADR going to be part of the commit? There are some inconsistencies, e.g. country_code vs country, incomplete events, also isn't the status_type duplicate with the event_type?

Also I'm not sure EventGate works with the "allOf" conditional validation, this should be tested.

Comment thread src/utils/constants.py
TOPIC_RUNS = "public.cps.za.runs"
TOPIC_DLCHANGE = "public.cps.za.dlchange"
TOPIC_TEST = "public.cps.za.test"
TOPIC_STATUS_CHANGE = "public.cps.za.status-change"
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
TOPIC_STATUS_CHANGE = "public.cps.za.status-change"
TOPIC_STATUS_CHANGE = "public.cps.za.status_change"

I'd rather use snake case for the topic names (we use the same for the filenames)

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm in favour of snake_case too

Comment thread src/utils/constants.py
TOPIC_TEST = "public.cps.za.test"
TOPIC_STATUS_CHANGE = "public.cps.za.status-change"

SUPPORTED_WRITE_TOPICS: frozenset[str] = frozenset({TOPIC_RUNS, TOPIC_DLCHANGE, TOPIC_TEST})
Copy link
Copy Markdown
Collaborator

@oto-macenauer-absa oto-macenauer-absa May 25, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

missing the status_change topic in SUPPORTED_WRITE_TOPICS

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm also there is no addition of this new queue in src/writers/sql/inserts.sql - do we want to push it into our Postgres for analytical purposes or no? It might become quite massive over time though

"description": "Environment (dev, uat, pre-prod, prod, test or others)"
},
"timestamp_event": {
"type": "integer",
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The runs schema uses "type": "number" for timestamps. Herei s "integer". While epoch milliseconds are integers, the inconsistency may confuse producers.

"properties": {
"event_type": {
"enum": [
"JobCreatedEvent",
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok, I like that you thought about it, but aren't half of these already mandatory? These: source_app and source_app_version and environment so to me only the last 2 make sense here unless I misunderstood how it works.


Another thing - maybe some other event types can also require some mandatory attributes?

Like, JobUpdatedEvent might require status_type or status_detail - this is the primary change that the event would emit, right? Otherwise, what's there to update?


Thinking about this a bit more - mandatory parameters:

  • Consider adding job_name and perhaps even definition_id as required on creation events (identity should be established at creation)
  • Consider making status_type required for "JobCreatedAndStartedEvent", "JobStartedEvent", "JobUpdatedEvent"

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add status_change topic support to EventGate

3 participants