Add mock Claude client support for E2E testing by mtsgrd · Pull Request #12220 · gitbutlerapp/gitbutler

mtsgrd · 2026-02-04T21:44:14Z

Introduces a testing framework for Claude integration that allows Playwright
tests to run with deterministic, pre-recorded Claude interactions instead of
the real Claude CLI. This enables reliable, fast E2E testing of permission
flows and user interactions without requiring an actual Claude API connection.

Key components:

Mock scenario loader (mock_scenario.rs) that reads session snapshots from
JSON files and creates mock transports for the Claude Agent SDK
Three initial test scenarios covering common permission flows:
- permission-bash-echo.json - Basic Bash tool approval
- permission-wildcard-test.json - Wildcard permission scoping
- ask-user-question.json - AskUserQuestion tool interaction
Integration in session.rs via testing feature flag and CLAUDE_MOCK_SCENARIO
environment variable
E2E test suite (claudePermissions.spec.ts) demonstrating permission approval
UI interactions
Improved test stability with process cleanup and RUST_LOG passthrough

The mock scenarios use the SDK's SessionSnapshot format and handle permission
callback timing automatically - when a control_request appears in the scenario,
subsequent messages are held until the SDK responds with control_response.

Each test specifies its own scenario by passing CLAUDE_MOCK_SCENARIO to
startGitButler(), which spawns a fresh but-server process with that env
var.

This is part 2 of 2 in a stack made with GitButler:

vercel · 2026-02-04T21:44:18Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
gitbutler-web	Ready	Preview, Comment	Feb 5, 2026 11:42am

estib-vega · 2026-02-05T09:58:34Z

oooh cool

Introduces a testing framework for Claude integration that allows Playwright tests to run with deterministic, pre-recorded Claude interactions instead of the real Claude CLI. This enables reliable, fast E2E testing of permission flows and user interactions without requiring an actual Claude API connection. Key components: - Mock scenario loader (mock_scenario.rs) that reads session snapshots from JSON files and creates mock transports for the Claude Agent SDK - Three initial test scenarios covering common permission flows: * permission-bash-echo.json - Basic Bash tool approval * permission-wildcard-test.json - Wildcard permission scoping * ask-user-question.json - AskUserQuestion tool interaction - Integration in session.rs via `testing` feature flag and CLAUDE_MOCK_SCENARIO environment variable - E2E test suite (claudePermissions.spec.ts) demonstrating permission approval UI interactions - Improved test stability with process cleanup and RUST_LOG passthrough The mock scenarios use the SDK's SessionSnapshot format and handle permission callback timing automatically - when a control_request appears in the scenario, subsequent messages are held until the SDK responds with control_response. Each test specifies its own scenario by passing CLAUDE_MOCK_SCENARIO to startGitButler(), which spawns a fresh but-server process with that env var.

mtsgrd mentioned this pull request Feb 4, 2026

Migrate Claude integration from binary to Rust SDK #12132

Merged

github-actions bot added rust Pull requests that update Rust code @gitbutler/desktop @gitbutler/ui labels Feb 4, 2026

vercel bot deployed to Preview February 4, 2026 21:46 View deployment

mtsgrd force-pushed the mg-branch-5 branch from 0cf94bf to 91ab4d3 Compare February 4, 2026 21:49

vercel bot deployed to Preview February 4, 2026 21:49 View deployment

mtsgrd force-pushed the claude-sdk branch from 20c1e66 to fc76f14 Compare February 4, 2026 22:05

mtsgrd force-pushed the mg-branch-5 branch from 91ab4d3 to 987f991 Compare February 4, 2026 22:05

vercel bot deployed to Preview February 4, 2026 22:06 View deployment

mtsgrd force-pushed the mg-branch-5 branch from 987f991 to b63ecd5 Compare February 4, 2026 22:08

vercel bot deployed to Preview February 4, 2026 22:08 View deployment

mtsgrd force-pushed the mg-branch-5 branch from b63ecd5 to 211e663 Compare February 4, 2026 22:09

vercel bot deployed to Preview February 4, 2026 22:10 View deployment

mtsgrd force-pushed the mg-branch-5 branch from 211e663 to e707569 Compare February 4, 2026 22:20

vercel bot deployed to Preview February 4, 2026 22:21 View deployment

mtsgrd force-pushed the claude-sdk branch from fc76f14 to f54ef78 Compare February 4, 2026 22:37

mtsgrd force-pushed the mg-branch-5 branch from e707569 to b2e3917 Compare February 4, 2026 22:37

vercel bot deployed to Preview February 4, 2026 22:38 View deployment

mtsgrd force-pushed the claude-sdk branch from f54ef78 to ca23747 Compare February 4, 2026 23:07

mtsgrd force-pushed the mg-branch-5 branch from b2e3917 to 0914d2b Compare February 4, 2026 23:07

vercel bot deployed to Preview February 4, 2026 23:07 View deployment

mtsgrd force-pushed the mg-branch-5 branch from 0914d2b to 16aeef8 Compare February 4, 2026 23:11

vercel bot deployed to Preview February 4, 2026 23:11 View deployment

mtsgrd force-pushed the claude-sdk branch from ca23747 to ee1249d Compare February 4, 2026 23:46

mtsgrd force-pushed the mg-branch-5 branch from 16aeef8 to 7499767 Compare February 4, 2026 23:46

vercel bot deployed to Preview February 4, 2026 23:47 View deployment

mtsgrd force-pushed the claude-sdk branch from ee1249d to 22e7c85 Compare February 5, 2026 00:12

mtsgrd force-pushed the mg-branch-5 branch from 7499767 to d66379c Compare February 5, 2026 00:13

vercel bot deployed to Preview February 5, 2026 00:13 View deployment

mtsgrd requested a review from estib-vega February 5, 2026 09:27

mtsgrd force-pushed the claude-sdk branch from 22e7c85 to 2ae45b5 Compare February 5, 2026 09:32

mtsgrd force-pushed the mg-branch-5 branch from d66379c to b15a068 Compare February 5, 2026 09:32

vercel bot deployed to Preview February 5, 2026 09:32 View deployment

mtsgrd force-pushed the claude-sdk branch from 2ae45b5 to f6526a7 Compare February 5, 2026 09:40

mtsgrd force-pushed the mg-branch-5 branch from b15a068 to c384b62 Compare February 5, 2026 09:40

vercel bot deployed to Preview February 5, 2026 09:41 View deployment

mtsgrd force-pushed the claude-sdk branch from f6526a7 to f2a3228 Compare February 5, 2026 11:21

mtsgrd force-pushed the mg-branch-5 branch from c384b62 to ddfbeb9 Compare February 5, 2026 11:22

vercel bot deployed to Preview February 5, 2026 11:22 View deployment

mtsgrd force-pushed the claude-sdk branch from f2a3228 to 506e2ed Compare February 5, 2026 11:23

mtsgrd force-pushed the mg-branch-5 branch from ddfbeb9 to 3ae86d2 Compare February 5, 2026 11:23

vercel bot deployed to Preview February 5, 2026 11:23 View deployment

Base automatically changed from claude-sdk to master February 5, 2026 11:35

mtsgrd force-pushed the mg-branch-5 branch from 3ae86d2 to 64f0f3f Compare February 5, 2026 11:42

vercel bot deployed to Preview February 5, 2026 11:42 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add mock Claude client support for E2E testing#12220

Add mock Claude client support for E2E testing#12220
mtsgrd wants to merge 1 commit intomasterfrom
mg-branch-5

mtsgrd commented Feb 4, 2026 •

edited

Loading

Uh oh!

vercel bot commented Feb 4, 2026 •

edited

Loading

Uh oh!

estib-vega commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mtsgrd commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel bot commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

estib-vega commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mtsgrd commented Feb 4, 2026 •

edited

Loading

vercel bot commented Feb 4, 2026 •

edited

Loading