Fix Claude analysis gating to handle co-failures correctly by iangmaia · Pull Request #25491 · wordpress-mobile/WordPress-iOS

iangmaia · 2026-04-08T11:56:52Z

Description

Follow-up to #25477. Fixes a logic bug in upload-claude-analysis.sh where the script exited early on the first non-essential failure (e.g. Danger), causing Claude analysis to be silently skipped even when essential jobs also failed in the same build.

Changes:

Fix co-failure logic: Count non-essential failures first, then query the Buildkite REST API for total failed job count. Only skip Claude when non_essential_failures == total_failures.
Fail-safe: If the API call fails, Claude runs anyway (safe default).
Fix shebang: #!/bin/bash -eu → #!/usr/bin/env bash + set -eu for portability.
Use Bash array: NON_ESSENTIAL_STEPS is now a proper array to avoid word splitting/glob issues.
YAML literal scalar: custom_prompt: > → custom_prompt: | to preserve numbered list and paragraph structure.
Include timed_out state: The jq filter catches both failed and timed_out job states.

Test Steps

Trigger a build where only Danger fails → Claude analysis should be skipped
Trigger a build where Danger and a real test both fail → Claude analysis should run
Trigger a build where only a real test fails → Claude analysis should run

I have considered if this change warrants user-facing release notes and have added them to RELEASE-NOTES.txt if necessary.

wpmobilebot · 2026-04-08T12:10:04Z

📲 You can test the changes from this Pull Request in WordPress by scanning the QR code below to install the corresponding build.

	App Name	WordPress
	Configuration	Release-Alpha
	Build Number	`32005`
	Version	`PR #25491`
	Bundle ID	`org.wordpress.alpha`
	Commit	`c530df9`
	Installation URL	166b11m92fe58

Automatticians: You can use our internal self-serve MC tool to give yourself access to those builds if needed.

wpmobilebot · 2026-04-08T12:10:25Z

📲 You can test the changes from this Pull Request in Jetpack by scanning the QR code below to install the corresponding build.

	App Name	Jetpack
	Configuration	Release-Alpha
	Build Number	`32005`
	Version	`PR #25491`
	Bundle ID	`com.jetpack.alpha`
	Commit	`c530df9`
	Installation URL	333jt4p48oo08

Automatticians: You can use our internal self-serve MC tool to give yourself access to those builds if needed.

iangmaia · 2026-04-09T19:28:26Z

@mokagio updated this PR as well to check for hard_failed on 683f6ca.

The previous script exited early when any non-essential step (e.g. Danger) failed, incorrectly skipping Claude analysis even when essential jobs also failed in the same build. Now counts non-essential failures first, then queries the Buildkite API for the total failed job count. Claude is only skipped when all failures are accounted for by non-essential steps. Fails safe: if the API call fails, Claude runs anyway. Also fixes shebang portability, uses a proper Bash array for step keys, uses YAML literal scalar to preserve prompt formatting, and includes timed_out jobs in the failure count. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Two bugs fixed: - buildkite-agent step get outcome returns "hard_failed", not "failed", so the non-essential check never matched - When all steps passed, the script fell through to uploading Claude analysis instead of exiting early Also restructures the script to query the API first, which simplifies the flow and avoids a redundant early-exit branch. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

mokagio

@iangmaia great job on this update across the apps.

Have you considered a next step where the script goes into CI toolkit with a default array of non-essential jobs comprising of Danger and SwiftLint, and the option for consumers to define a file in the repo (.buildkite/claude-analysis-non-essential-steps?) or an overriding env var in shared-pipeline-vars?

iangmaia · 2026-04-13T20:28:49Z

Have you considered a next step where the script goes into CI toolkit with a default array of non-essential jobs comprising of Danger and SwiftLint, and the option for consumers to define a file in the repo (.buildkite/claude-analysis-non-essential-steps?) or an overriding env var in shared-pipeline-vars?

That's not a bad idea! Though the Claude Build Analysis is probably going to change to adopt Buildkite Model Providers or a different solution given the plugin has been deprecated. As is, it is still interesting and we can likely migrate to other setup taking the learnings we had so far, but I wouldn't invest in tweaking the current setup that much.

mokagio · 2026-04-13T21:40:52Z

probably going to change to adopt Buildkite Model Providers or a different solution given the plugin has been deprecated

Good point, good point. I also wonder if this is actually useful for folks?

Do you have any feedback on it? But even then, AI analysis annotations might not be useful for devs but could be useful for us to get to a place where they can be useful? So definitely worth to keep investing on this, the question is how to measure usefulness/effectiveness.

iangmaia · 2026-04-14T21:24:33Z

Good point, good point. I also wonder if this is actually useful for folks?

Do you have any feedback on it? But even then, AI analysis annotations might not be useful for devs but could be useful for us to get to a place where they can be useful? So definitely worth to keep investing on this, the question is how to measure usefulness/effectiveness.

Yep, I'm still seeing it as an experiment. I often check it, sometimes it has useful insights, but often it is just noisy and verbose, runs when it shouldn't, etc so these changes are important to make it more reliable and relevant. The next iterations (hopefully already in a custom and more compact template) should keep improving it.

iangmaia added the Tooling Build, Release, and Validation Tools label Apr 8, 2026

iangmaia requested a review from mokagio April 8, 2026 11:58

iangmaia requested a review from twstokes April 8, 2026 15:41

iangmaia self-assigned this Apr 8, 2026

mokagio mentioned this pull request Apr 9, 2026

Update Claude build analysis prompt and model to Sonnet 4.6 #25477

Merged

iangmaia added this to the 26.9 milestone Apr 9, 2026

iangmaia and others added 2 commits April 10, 2026 19:19

iangmaia force-pushed the iangmaia/fix-claude-analysis-gating branch from 683f6ca to c530df9 Compare April 10, 2026 17:19

mokagio approved these changes Apr 13, 2026

View reviewed changes

iangmaia added this pull request to the merge queue Apr 13, 2026

Merged via the queue into trunk with commit 68ff59f Apr 13, 2026
24 checks passed

iangmaia deleted the iangmaia/fix-claude-analysis-gating branch April 13, 2026 20:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Claude analysis gating to handle co-failures correctly#25491

Fix Claude analysis gating to handle co-failures correctly#25491
iangmaia merged 2 commits intotrunkfrom
iangmaia/fix-claude-analysis-gating

iangmaia commented Apr 8, 2026 •

edited

Loading

Uh oh!

wpmobilebot commented Apr 8, 2026 •

edited

Loading

Uh oh!

wpmobilebot commented Apr 8, 2026 •

edited

Loading

Uh oh!

iangmaia commented Apr 9, 2026

Uh oh!

mokagio left a comment

Uh oh!

iangmaia commented Apr 13, 2026

Uh oh!

Uh oh!

mokagio commented Apr 13, 2026

Uh oh!

iangmaia commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

iangmaia commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Test Steps

Uh oh!

wpmobilebot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wpmobilebot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

iangmaia commented Apr 9, 2026

Uh oh!

mokagio left a comment

Choose a reason for hiding this comment

Uh oh!

iangmaia commented Apr 13, 2026

Uh oh!

Uh oh!

mokagio commented Apr 13, 2026

Uh oh!

iangmaia commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

iangmaia commented Apr 8, 2026 •

edited

Loading

wpmobilebot commented Apr 8, 2026 •

edited

Loading

wpmobilebot commented Apr 8, 2026 •

edited

Loading