Friday, May 15, 2026

Show HN: SwarmWright, structured multi-agent AI defined in markdowns https://ift.tt/RFCJ9ov

Show HN: SwarmWright, structured multi-agent AI defined in markdowns I had a bunch of custom AI pipelines and a growing folder of markdown files and Python scripts holding it together. Built this to give that chaos some structure. Agents are markdown files, topology is a JSON file the runtime enforces hard. The agents are still fully autonomous: they make their own decisions, but the graph they operate in isn't. You declare who can call whom upfront and the runtime holds that line. No auth yet, fine if you don't expose the port, i guess. Two Docker commands to run it. https://ift.tt/5WlybUh May 16, 2026 at 02:20AM

Show HN: Epiq – Distributed Git based issue tracker TUI https://ift.tt/4qAkUoY

Show HN: Epiq – Distributed Git based issue tracker TUI Issue trackers typically live outside of your workflow, with poor ergonomics. Epiq aims to solve that, bringing issue tracking into your terminal. Multi-user collaboration is achieved via git using user-scoped immutable event logs that converge in memory. Put my all into it. Let me know what you think. https://ljtn.github.io/epiq/ May 16, 2026 at 05:48AM

Show HN: Browser based sythesizer, drum machine and squencer https://ift.tt/1SZJhEP

Show HN: Browser based sythesizer, drum machine and squencer Inspired by the recent Boards Of Canada announcement, I've been in a low-fi electronica mood lately and was going back and forth with Claude on how to design similar instruments in the browser that fit the genre. One thing led to another and pretty soon I had a fully browser based polyphonic synthesizer / drum machine / sequencer. The interface and workflow was heavily inspired by the Rebirth338 application released back in the 90's, but with lo-fi synth voices rather than the original 303 & 808 emulation. I know there's a significant overlap of developers and musicians and I though some of you may enjoy playing with the app, or at least listening to the resulting album. I've also open sourced track 1 of the album via the performance script used to record it. It's in the repo. Bandcamp link to the resulting album: https://ift.tt/PrqJw5H... https://ift.tt/GQaPU79 May 16, 2026 at 01:37AM

Show HN: Claude Code vs. Codex Global Usage Leaderboard https://ift.tt/Jmj06ED

Show HN: Claude Code vs. Codex Global Usage Leaderboard https://ift.tt/nSi28wc May 16, 2026 at 12:48AM

Thursday, May 14, 2026

Show HN: Visualizing Tiny LLMs from OpenAI's Parameter Golf https://ift.tt/5xdpbrm

Show HN: Visualizing Tiny LLMs from OpenAI's Parameter Golf The two from parameter golf (one I trained, one was the baseline) are just 16MB each! They produce barely plausible English https://ift.tt/mfCbrL4 May 15, 2026 at 12:22AM

Show HN: 3D-Agent – AI that edits Blender scenes through the Python API https://ift.tt/sNUf67O

Show HN: 3D-Agent – AI that edits Blender scenes through the Python API https://ift.tt/zJ8fmu0 May 14, 2026 at 08:17PM

Wednesday, May 13, 2026

Show HN: Nibble https://ift.tt/fN5T23V

Show HN: Nibble An attempt at a single pass LLVM frontend in ~3000 lines of C without external dependencies, malloc, or an AST. Included are some graphical examples. The IR isn't perfect, and the README touches on one particular downfall https://ift.tt/eCyutia May 14, 2026 at 07:16AM

Show HN: Rotunda - A browser built for agents with simulated typing https://ift.tt/pQZihvo

Show HN: Rotunda - A browser built for agents with simulated typing Hi HN! Pierce here. Rotunda is a firefox fork primarily intended for agent use, which I’ve been hacking on nights/weekends. There was a [lengthy]( https://ift.tt/f9iTM4S ) discussion last week on how expensive computer use models are. The cost is going to drop eventually, but I think on some level it's still usually the wrong primitive. The web gives us access to beautiful structured formats, plaintext, etc... why throw that away if we don't have to? I realized at some point that for 99% of automations I just want agents to be able to control my Chrome instance. But that’s easier said that done: CDP (the Chrome automation protocol) leaks a ton of state about being programmatically controlled, either by toggling window attributes or by running `page.evaluate()` commands right in the page context. Plus if you look at an automation running it's pretty obvious what happens: the mouse jumps around, fields are filled instantly, etc. Rotunda tries to fix this. Its standout features: - Realistic simulation of mouse movements and keyboard commands, powered by a trained RNN on my own timing patterns from the last week. (still feel weird about opting-in to a key logger but whatever) - Doesn’t lie about its host specs, only fibs about some client side details. Stealth browsers are too easy to flag statistically when you’re adding noise to canvas pixels or audio pipelines. - It runs on your local device with a CLI or Playwright API accessible to Claude, Codex, or whatever your harness-de-jure today looks like. - Patches modern Firefox (150) with an agentic harness to keep this updated over time MPL-2.0 on GitHub: https://ift.tt/LVZlNIT Longer writeup on the design choices: https://ift.tt/OulikUp Also check out the demo on the site! https://www.rotunda.sh/ Pretty excited by how this turned out but we’re still super early. Give it a try and please flag any issues! https://ift.tt/LVZlNIT May 13, 2026 at 07:14PM

Show HN: Micromort Risk Visualizer https://ift.tt/vcJmMNA

Show HN: Micromort Risk Visualizer https://boxed.github.io/micromort/ May 14, 2026 at 12:09AM

Tuesday, May 12, 2026

Show HN: Duckflix, an open-source self-hosted media streaming platform https://ift.tt/CqBgR9T

Show HN: Duckflix, an open-source self-hosted media streaming platform I’ve been working on Duckflix, a self-hosted media streaming platform. It started as a full-stack project to combine a clean streaming UI with a Bun/Elysia backend, FFmpeg processing, SQLite, Docker deployment, and addon support. Website: https://duckflix.fun Demo: https://demo.duckflix.fun GitHub: https://ift.tt/L6rZ0Aa https://ift.tt/L6rZ0Aa May 13, 2026 at 02:53AM

Show HN: GIF Pile. a site to make piles of GIFs https://ift.tt/2VUMheC

Show HN: GIF Pile. a site to make piles of GIFs I'm quite fond of obnoxious looking gifs in a post-ironic way as a manner of shitposting and or injecting humor into a chat. The issue with this however is that, for no real good reason at all, the simple usecase of "Have image/gif background, bombard with garbage" had no real good tooling. There's gif editors out there, EZgif my beloved is probably my most used non-search-indexing-slash-social-media-site, but they're kinda clunky for my specific usecase of making digital eye-sandpaper bombastic garbage. Other options are bleak and gave me the mark of the beast via shitty watermarks. I just wanted a pile of gifs on top of each other, and thus far the "easiest" way was to bust open a video editor, muck around with it, mess up exporting as a gif directly, get mad, export it as a 4 second mp4, and then use ffmpeg to get it working. is this probably moronic? yes. am I likely to have missed a decent tool? yes. Did I give up looking after sending 4 dollars to some Indian guy for "No watermarks ever for 4$", only for that "ever" to be a year, and then the clunky weird af login process not work? absolutely. (Fuck you, you know who you are) This took me a few hours (most of which was dealing with the fact I don't do webshit normally and the clunk that one would expect from that), and is a minimal site for my personal minimal usecase. It's static because I'm not going to deal w/ hosting other people's shit and I don't want to deal with that can of worms. all processing is done locally on your browser. Yes, this means that using a 4k image as a base layer for your gif pile will make it take an age. It'll work eventually though. This will never have a watermark unless I'm bought out (total investment thus far has been 14 bucks, 4 of which was that one dude fucking me), in which case I probably earned it. at most I'll likely throw adsense on there at some point to scrape a few cents from the people who can't figure out adblock if it gets popular enough for me to warrant it. There's no timelines or anything like that. literally just a pile of gifs. thus far my primary usecase has been overlaying text gifs from the various fancy text generator sites onto glitter backgrounds with uncomfortable rat GIFs to call people poor on the internet. this makes me happy. There's likely to be obvious UI, UX, or other U-whatever fuckups. If you point them out and I deem it pedantic I'll probably laugh at you. if it's helpful I'll probably implement it when I get a bit. Surprisingly, works on mobile. CSS is exceedingly generic and souless atm, just went off vauge memories of ss13's TGUI. I'll likely scrap the CSS entirely and go full neocities at some point because that's more soulful. https://gifpile.com/ May 13, 2026 at 02:41AM

Show HN: I submitted 316 AI-generated PRs to open source https://ift.tt/xTCjNA4

Show HN: I submitted 316 AI-generated PRs to open source https://june.kim/speedrunning-open-source May 12, 2026 at 11:42PM

Show HN: Cook a Django project well, the agent-skill take on cookiecutter https://ift.tt/1lfahQ0

Show HN: Cook a Django project well, the agent-skill take on cookiecutter https://ift.tt/ejogyrp May 13, 2026 at 12:03AM

Monday, May 11, 2026

Sunday, May 10, 2026

Show HN: adamsreview – better multi-agent PR reviews for Claude Code https://ift.tt/MHScbfD

Show HN: adamsreview – better multi-agent PR reviews for Claude Code I built adamsreview, a Claude Code plugin that runs deeper, multi-stage PR reviews using parallel sub-agents, validation passes, persistent JSON state, and optional ensemble review via Codex CLI and PR bot comments. On my own PRs, it has been catching dramatically more real bugs than Claude’s built-in /review, /ultrareview, CodeRabbit, Greptile, and Codex’s built-in review, while producing fewer false positives. adamsreview is six Claude Code slash commands packaged as a plugin: review, codex-review, add, promote, walkthrough, and fix. I modeled it after the built-in /review command and extended it meaningfully. You can clear context between review stages because state is stored in JSON artifacts on disk, with built-in scripts for keeping it updated. The walkthrough command uses Claude’s AskUserQuestion feature to walk you through uncertain findings or items needing human review one by one. Then, the fix command dispatches per-fix-group agents and re-reviews the work with Opus, reverting any regressions before committing survivors. It runs against your regular Claude Code subscription (Max plan recommended), unlike /ultrareview, which charges against your Extra Usage pool. I would love feedback from Claude Code users, pro devs, and anyone with strong opinions about AI code reviews. Repo: https://ift.tt/W0ai3Iv Install: /plugin marketplace add adamjgmiller/adamsreview, /plugin install adamsreview@adamsreview https://ift.tt/W0ai3Iv May 11, 2026 at 07:36AM

Show HN: I trained a chess engine to play like humans https://ift.tt/DLHqNv4

Show HN: I trained a chess engine to play like humans I built 1e4.ai - a chess web app where you play against neural networks trained to mimic human Lichess players at specific Elo ranges. There's a separate model for each 100-point rating bucket from ~800 to 2200+, and the bots not only choose human-like moves but also burn clock time, play worse under time pressure, and blunder in human-like ways. Live demo: https://1e4.ai Code: https://ift.tt/Pef4nSL A few things that might be interesting: - Trained on almost a full year of Lichess blitz games, around 1B total games - Architecture is an a small (~9MM parameters) transformer-based network that takes the board, recent move history, the player's rating, and remaining clock time as input. Three separate models per rating bucket: move, clock-usage, and win probability. The clock model is what makes the bots feel humanish under time pressure rather than instant. Because the move model takes the clock as one input parameter, it also learns to blunder under time pressure like a human might. - Because the network is so tiny, no GPU is needed for inference - it runs easily on a local CPU - Downside of the tiny network is that it's a bit weak as you turn up the rating past around 1700. It can spot short tactics but not long multi-move combinations. - Initial training on a rented 8xH100 cluster, then fine-tunes on my local GPU for different rating ranges - Inspired by Maia-2 and DeepMind's "Grandmaster-Level Chess Without Search". On a held-out Lichess blitz benchmark, the it beats Maia-2 blitz on top-1 move prediction (56.7% vs 52.7%) and pretty substantially on win-probability calibration (Brier 0.176 vs 0.272). Numbers and code in https://ift.tt/Ro8ap75... - The data pipeline is C++ via nanobind, then training with Pytorch. Getting this right was actually the thing I spent the most time on. Pre-shuffling the dataset and then being able to read the shuffled dataset sequentially at training time kept the GPU utilization high. Without this it spent a huge percentage of time on I/O while the GPU sat idle. Happy to answer questions about the rating-conditioning, the clock model, or the data pipeline. May 11, 2026 at 04:01AM

Show HN: Hustler Bingo – a tiny bingo game about startup Twitter clichés https://ift.tt/GOmI4Ap

Show HN: Hustler Bingo – a tiny bingo game about startup Twitter clichés I built this after my brother started complaining that I got too much into brainrot culture. It's just for fun nothing serious, but was able to test vercel, tanstack start and convex without high stakes. Have fun! This is the game where lower score is goood for your mental health https://ift.tt/60HIz8J May 11, 2026 at 02:06AM

Show HN: Mosaic – arrange iOS icons by color using an evolutionary algorithm https://ift.tt/RxocrGH

Show HN: Mosaic – arrange iOS icons by color using an evolutionary algorithm It started out as a way for me to freshen up my C++ skills during COVID. But life got in the way and it was put on ice. Luckily, coding LLMs came to the rescue and allowed me to bring it to a point where I feel comfortable sharing it. https://ift.tt/HOcKJgS May 10, 2026 at 11:59PM

Saturday, May 9, 2026

Show HN: Free OSS transcription app I made and found it's faster than wispr flow https://ift.tt/2h9d6Kn

Show HN: Free OSS transcription app I made and found it's faster than wispr flow title doesn't let nuance, ofc it's not the app that's faster but the way you can use it with Groq inference for example. https://mumbli.app/ May 10, 2026 at 03:07AM

Show HN: I built a screen recorder that captures console logs, requests and more https://ift.tt/dPD6hjW

Show HN: I built a screen recorder that captures console logs, requests and more https://userplane.io/ May 17, 2026 at 01:04AM