Skip to content

chore(deps): widen agent-eval to ^0.91 + bump agent-runtime ^0.50#25

Merged
drewstone merged 1 commit into
mainfrom
chore/widen-agent-eval-0.91
Jun 14, 2026
Merged

chore(deps): widen agent-eval to ^0.91 + bump agent-runtime ^0.50#25
drewstone merged 1 commit into
mainfrom
chore/widen-agent-eval-0.91

Conversation

@drewstone

Copy link
Copy Markdown
Contributor

What

  • @tangle-network/agent-eval: >=0.77.0 <0.80.0^0.91.0
  • @tangle-network/agent-runtime: ^0.44.0^0.50.0

Why

The <0.80 ceiling on agent-eval blocked the fleet from a consistent agent-eval and risked a duplicate install of the substrate everyone depends on. 0.91 is what the rest of the fleet uses. After the bump, agent-eval resolves to a single 0.91.0 (agent-runtime 0.50 consumes it as a peer at the same version — no duplicate).

Substrate API change handled

agent-runtime 0.50 removed createFanoutVoteDriver from the /loops surface (it's gone from the package entirely, not renamed). multiHarnessResearcherFanout used it to build a "fanout of N harnesses, pick the best valid output" topology.

Reconstructed the identical semantics with the supported primitive: createDriver + a single-fanout-then-stop planner. The kernel round-robins agentRuns across the N branches and selects the winner across iterations via defaultSelectWinner — same behavior, current API. The driver's action type moved from the old 'pick-winner' | 'fail' to the kernel's DriverDecision ('continue' | 'done').

Two tests asserted the old driver's internal name ('fanout-vote') and decision strings ('pick-winner' / 'fail'); updated to the new createDriver contract ('dynamic' / 'done'). The winner-selection assertions — the actual behavior under test — are unchanged and still pass.

Verification

  • pnpm install — agent-eval 0.91.0 (single install), agent-runtime 0.50.0
  • pnpm run build — pass
  • pnpm run typecheck — pass (0 errors)
  • pnpm test — 110 passed / 5 skipped (live-network)
  • pnpm run lint — pass (no new warnings)

Widen the agent-eval pin from `>=0.77.0 <0.80.0` to `^0.91.0` (the
version the rest of the fleet uses) and bump agent-runtime `^0.44.0`
to `^0.50.0`. The old `<0.80` ceiling blocked a consistent fleet-wide
agent-eval and risked a duplicate substrate install; both now resolve
to a single agent-eval@0.91.0.

agent-runtime 0.50 removed `createFanoutVoteDriver` from the /loops
surface. Reconstruct the same single-fanout-then-stop topology with
`createDriver` + a planner; the kernel selects the winner across
iterations via `defaultSelectWinner`. Update the two tests that
asserted the old driver's name/decision strings to the new
DriverDecision contract (winner-selection behavior unchanged).

@tangletools tangletools left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ Auto-approved PR — dcf27d24

Blanket team auto-approval is enabled for this reviewer service.
The full PR reviewer audit still runs separately and will publish findings if it detects issues.

tangletools · auto-approval · reason: blanket_auto_approve · 2026-06-14T00:55:35Z

@drewstone drewstone merged commit 56b1d1f into main Jun 14, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants