4 News Express: Show HN: Meta-agent: self-improving agent harnesses from live traces https://ift.tt/EAjXO24

Monday, April 6, 2026

Show HN: Meta-agent: self-improving agent harnesses from live traces https://ift.tt/EAjXO24

Show HN: Meta-agent: self-improving agent harnesses from live traces We built meta-agent: an open-source library that automatically and continuously improves agent harnesses from production traces. Point it at an existing agent, a stream of unlabeled production traces, and a small labeled holdout set. An LLM judge scores unlabeled production traces as they stream. A proposer reads failed traces and writes one targeted harness update at a time, such as changes to prompts, hooks, tools, or subagents. The update is kept only if it improves holdout accuracy. On tau-bench v3 airline, meta-agent improved holdout accuracy from 67% to 87%. We open-sourced meta-agent. It currently supports Claude Agent SDK, with more frameworks coming soon. Try it here: https://ift.tt/v8D0M3n https://ift.tt/v8D0M3n April 7, 2026 at 12:52AM

4 News Express

Monday, April 6, 2026

Show HN: Meta-agent: self-improving agent harnesses from live traces https://ift.tt/EAjXO24

No comments:

Show HN: Rudo - A small, elegant dock for Wayland https://ift.tt/BDuXye0

Footer Social Widget

Followers