feat: [DNM] Async instant generation fixes over 0.14.1 by danny0405 · Pull Request #18767 · apache/hudi

danny0405 · 2026-05-18T06:42:17Z

Describe the issue this Pull Request addresses

Summary and Changelog

Impact

Risk Level

Documentation Update

Contributor's checklist

Read through contributor's guide
Enough context is provided in the sections above
Adequate tests were added if applicable

) The new instant time generation utilizes RPC request to coordinate creation of new instants. Each write task will send an RPC request to the coordinator for the instant time, the coordinator uses a global lock to guard the access from multiple tasks. Now one checkpoint id corresponds to one instant. Basic work flow: * write task send request: current ckp id * write task expected response: the instant time * coordinator mappings of checkpoint and instant: ckp-id → {instant → {write-task-id → write meta event}} Note that the ckp id used in the request is the last known id instead of the current checkpoint, if a task is restored from a state of the current job, it is the state checkpoint id, otherwise -1 for fresh new job. (cherry picked from commit e34a7ab)

…ite path (apache#18264) (cherry picked from commit 69e24ea)

hudi-bot · 2026-05-18T07:00:24Z

CI report:

8392973 Azure: FAILURE

Bot commands

@hudi-bot supports the following commands:

@hudi-bot run azure re-run the last Azure build

hudi-agent

🤖 This review was generated by an AI agent and may contain mistakes. Please verify any suggestions before applying.

Thanks for the contribution! This PR (chunk 2 of 2) is part of the async instant generation overhaul, replacing the bootstrap-event-driven instant initialization with an explicit request/response pattern (Correspondent) and reworking the related test harness around per-checkpoint instant requests. The production code surface is small here (thread-factory plumbing, a forced LAZY cleaning policy under non-blocking instant generation, and removing the fixed parallelism on the dummy sink); most of the diff is test-harness restructuring around the new flow. One thing worth a look in the inline comments around the silent policy override in FlinkWriteClients. Please take a look, and this should be ready for a Hudi committer or PMC member to take it from here. A couple of small readability issues below — a dead local variable and a copy-paste doc error. a few small naming and type-consistency nits below; overall the refactoring is clean and the new abstractions (Correspondent, EventBuffers, CommitGuard) are well-structured.

hudi-agent · 2026-05-18T16:30:10Z

      // open the function and ingest data
      this.pipeline.openFunction();
-      this.ckpMetadata = CkpMetadata.getInstance(conf);
+      HoodieWriteConfig writeConfig = this.pipeline.getCoordinator().getWriteClient().getConfig();


🤖 nit: writeConfig is declared but never used — it looks like dead code left over from replacing the ckpMetadata assignment. Could you either wire it into the rest of the method or remove it?

_{- AI-generated; verify before applying. React 👍/👎 to flag quality.}

hudi-agent · 2026-05-18T16:30:10Z

+    /**
+     * Asserts the data files are empty.
+     */
+    public TestHarness assertEmptyBaseFiles() {


🤖 nit: the Javadoc (line 503) is a copy-paste from assertEmptyDataFiles — it says "data files" instead of "base files". The assertion message on the next line has the same copy-paste issue ("No data files should have been created" → "No base files should have been created").

_{- AI-generated; verify before applying. React 👍/👎 to flag quality.}

hudi-agent · 2026-05-18T16:30:10Z

    this.taskID = taskID;
  }

+  public Long getCheckpointId() {


🤖 nit: could you change the return type to primitive long here? The underlying field is long and setCheckpointId also takes long, so returning boxed Long implies the value could be null when it never can be. Keeping it primitive stays consistent with getTaskID() and avoids any misleading nullable signal to callers.

_{- AI-generated; verify before applying. React 👍/👎 to flag quality.}

hudi-agent · 2026-05-18T16:30:10Z

+
+  private Thread t;
+
+  public ExplicitClassloaderThreadFactory(


🤖 nit: the single-character field name t makes the guard check if (t != null) in newThread() harder to parse at a glance — it's not immediately obvious what t represents. Renaming it to thread would make the intent self-explanatory.

_{- AI-generated; verify before applying. React 👍/👎 to flag quality.}

hudi-agent · 2026-05-18T16:30:12Z

🤖 Line 268: Could we log a WARN when this overrides a user-set policy? isBlockingInstantGeneration is false for MOR upserts without CDC (a very common case), so most users who explicitly configured EAGER will silently get LAZY here with no indication. A one-line WARN when the existing value differs from LAZY would make this much easier to debug.

_{- AI-generated; verify before applying. React 👍/👎 to flag quality.}

danny0405 and others added 4 commits May 18, 2026 13:59

perf: eliminate unnecessary timeline loading for Flink append only wr…

8392973

…ite path (apache#18264) (cherry picked from commit 69e24ea)

nsivabalan changed the title ~~feat: [DNM] Uber fixes~~ feat: [DNM] Async instant generation fixes over 0.14.1 May 18, 2026

hudi-agent reviewed May 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: [DNM] Async instant generation fixes over 0.14.1#18767

feat: [DNM] Async instant generation fixes over 0.14.1#18767
danny0405 wants to merge 4 commits into
apache:release-0.14.1from
danny0405:uber-fixes

danny0405 commented May 18, 2026

Uh oh!

hudi-bot commented May 18, 2026

Uh oh!

hudi-agent left a comment

Uh oh!

hudi-agent May 18, 2026

Uh oh!

hudi-agent May 18, 2026

Uh oh!

hudi-agent May 18, 2026

Uh oh!

hudi-agent May 18, 2026

Uh oh!

hudi-agent May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

danny0405 commented May 18, 2026

Describe the issue this Pull Request addresses

Summary and Changelog

Impact

Risk Level

Documentation Update

Contributor's checklist

Uh oh!

hudi-bot commented May 18, 2026

CI report:

Uh oh!

hudi-agent left a comment

Choose a reason for hiding this comment

Uh oh!

hudi-agent May 18, 2026

Choose a reason for hiding this comment

Uh oh!

hudi-agent May 18, 2026

Choose a reason for hiding this comment

Uh oh!

hudi-agent May 18, 2026

Choose a reason for hiding this comment

Uh oh!

hudi-agent May 18, 2026

Choose a reason for hiding this comment

Uh oh!

hudi-agent May 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants