Sunday, October 12, 2025

Show HN: I built a simple ambient sound app with no ads or subscriptions https://ift.tt/QbWEPlJ

Show HN: I built a simple ambient sound app with no ads or subscriptions I’ve always liked having background noise while working or falling asleep, but I got frustrated that most “white noise” or ambient sound apps are either paywalled, stuffed with ads, or try to upsell subscriptions for basic features. So I made Ambi, a small iOS app with a clean interface and a set of freely available ambient sounds — rain, waves, wind, birds, that sort of thing. You can mix them, adjust volume levels, and just let it play all night or while you work. Everything works offline and there are no hidden catches. It’s something I built for myself first, but I figured others might find it useful too. Feedback, bugs, and suggestions are all welcome. https://ift.tt/QZkAjUs... https://ambisounds.app/ October 12, 2025 at 08:19PM

Saturday, October 11, 2025

Show HN: Solving the cluster 1 problem with vCluster standalone https://ift.tt/HrxtRDk

Show HN: Solving the cluster 1 problem with vCluster standalone vcluster is an open source tool for Kubernetes multi tenancy and over the years it has matured to have hosted controlplane virtual cluster, shared virtual clusters but the host cluster problem was always there. With vcluster standalone, you can now create the first cluster also with the same developer experience and consolidate the multiple vendor problem. With this, you can now use vcluster for entire multi tenancy spectrum. Feel free to discuss, happy to answer any questuons. https://ift.tt/UQX0rGb October 8, 2025 at 10:20PM

Show HN: Sprite Garden - HTML Canvas 2D sandbox and farming https://ift.tt/P19qxuD

Show HN: Sprite Garden - HTML Canvas 2D sandbox and farming Sprite Garden: https://kherrick.github.io/sprite-garden/ A 2D sandbox exploration and farming game that runs entirely in the web browser. As a fully HTML, CSS, and JavaScript game, it is highly readable, hackable, and customizable. Included on "globalThis" is the "spriteGarden" global object with the game config and state readily available. Drawing with tiles is as easy as opening dev tools (use the menu in the browser as keyboard is captured), or entering the "Konami Code," for a full screen view and a map editor. - Share games from the world state manager - Explore unique procedurally generated biomes - Dig for resources like coal, iron, and gold - Use collected materials to place blocks and shape the world - Discover underground cave systems filled with resources - Plant and harvest different crops with "realistic" growth cycles Examples: - Preparing a QR Code to be mined: https://gist.github.com/kherrick/1191ae457e1f6e1a65031c38c2d... - Drawing a heart in the sky: https://gist.github.com/kherrick/3dc9af05bccc126b11cc26fb30a... - Entering the Konami Code (map editor / fullscreen): https://gist.github.com/kherrick/effbe1463d9b78da046f27c5d42... I'm unsure how the game should be taken further, or whether it should progress. Some potential ideas for the future include: - Input Box with JS Execution: Provide a safe, sandboxed input area in the game's UI where players can write small JS functions or scripts (instead of exposing it on globalThis). - API Exposure: Expose a controlled API or object representing game state and functions, like terrain manipulation, crop growth, player movement, to the user script so players can automate or modify behaviors. - Event Hooks: Allow players to register hooks into game events (e.g., world update, planting crops) where their custom code runs, enabling mods or custom automation. - Multiplayer: Use WebRTC to allow many players in the same world. - Actual gamification: Make reasons to play, health meter, powerups, plant combinations, enemies? - Better mobile controls: Currently on screen, no swiping for movement. - Easier building with blocks: Currently block position based on location of player. Also featured on: - Microsoft Store: https://ift.tt/ut48Uyk - Wayback Machine: https://ift.tt/SE5c28t.... Feedback is highly welcome, and source is available at: https://ift.tt/1DL7ikU https://kherrick.github.io/sprite-garden/ October 12, 2025 at 03:15AM

Friday, October 10, 2025

Show HN: Praxos – Webhooks for Your Life https://ift.tt/1Rcakjn

Show HN: Praxos – Webhooks for Your Life Hello HN, Lucas and Soheil here from Praxos ( https://mypraxos.com/ )! We’ve been working on an AI personal assistant for a while now, and today we're sharing about our new webhooks feature with you. You can now add webhooks and triggers by asking Praxos over text or voice to set them up for you. Webhooks listen for conditions that, when they happen, trigger another action. Webhooks can execute one time or be indefinite. They can execute any action currently supported by Praxos. They are implemented for email and calendar (Gmail, Outlook), Notion, Slack, Discord, Trello, Dropbox, Drive, iMessage, Whatsapp and Telegram. Examples include: –"When a new task is added to my Trello ‘Urgent’ list, create a two-hour block on my Google Calendar within my next work window, and send me a reminder as soon as it happens. " [this ties nicely to the next one] –"If my calendar says I’m in a focus block, auto-reply on Slack saying I’m working on the latest Trello Urgent task." –“When a meeting transcript is ready from Fireflies and lands on my email, extract decisions and next steps, then post a 5-bullet summary in #team-updates on Slack” [ties to next point]. –“When a meeting transcript from work finishes processing, summarize it, and post key decisions to Slack. But delay the notification until after my kid’s bedtime. If the summary includes tasks due tomorrow, block off my calendar in the morning and text me a reminder after I wake up.” –"My goal is to read 12 books this year. Every month, send me a list of 10 books and their links to Goodreads based on what I like. Every Saturday morning, ask me how far along I am with my reading. If you find I purchased a book (i.e.: from checking my email) then add it to my reading list, and ask me if I've started reading it." –"Every time I get a receipt from Uber Eats or Doordash, extract the date, bill, and meal and add those to my personal finances spreadsheet on Google Sheets." –“When a new transaction appears in my bank email or statement, extract the merchant, category, and amount, then log it in my ‘Spending Tracker’ Google Sheet. If total monthly spend crosses my budget limit, post a summary to my private WhatsApp chat with the top three categories that caused it.” –"Remind me to pay my credit card and bills each time a new email comes in. Ask me if I have done so one date later if I don't tell you I already have done this." –“At month-end, compile all financial logs from Sheets, receipts, and transcripts of my finance calls, then generate a single ‘Monthly Snapshot’ PDF. If my savings rate improved, add a green badge, and congratulate me. If it worsened, send me a summary with trends.” –"Every Sunday, check my latest additions to Google Photos and send a curation to my mom and grandma over Whatsapp." –"Review user feedback on the Praxos Discord channel and add it to our User Feedback page on Notion. Keep a counter for repeat requests." –"If I receive an email from Lucas, and I'm coding, respond to him and tell him I'm busy. Also remind him to check what I'm up to on Trello." Curious? Try it out for free for 7 days at https://ift.tt/JxMi5aj ! October 11, 2025 at 12:00AM

Show HN: Multiple choice video webgame experiment https://ift.tt/VezWH4T

Show HN: Multiple choice video webgame experiment Hey all, just wanted to share a little game experiment. It's a rooms & keys kind of adventure with a lot of random deaths. It plays a Veo3-generated video in response to clicks, with Gemini used for coding. Prompting the videos was fun, but trying to vibe code everything was not. In the future I'll go back to using LLMs more sparingly for isolated functions, or at least try not to have it create anything that requires seeing the output to debug. https://ift.tt/sVFEQHh October 10, 2025 at 10:39PM

Show HN: Iframetest.com https://ift.tt/P4iOsvX

Show HN: Iframetest.com https://iframetest.com/ October 6, 2025 at 03:25PM

Thursday, October 9, 2025

Show HN: Open-Source Voice AI Badge Powered by ESP32+WebRTC https://ift.tt/iL7u2CW

Show HN: Open-Source Voice AI Badge Powered by ESP32+WebRTC hi! video[0] The idea is you could carry around this hardware and ask it any questions about the conference. Who is speaking, what are they speaking about etc... it connects via WebRTC to a LLM and you get a bunch of info. This is a workshop/demo project I did for a conference. When I was talking to the organizers I mentioned that I enjoy doing hardware + WebRTC projects. They thought that was cool and so we ran with it. I have been doing these ESP32 + voice ai projects for a bit now. Started with an embedded sdk for livekit[1] that jul 2024 and been noodling with it since then. This code then found its way into pipecat/livekit etc... So I hope it inspires you to go build with hardware and webrtc. It's a REALLY fun space right now. Lots of different cheap microcontrollers and even more cool projects. [0] https://www.youtube.com/watch?v=gPuNpaL9ig8 [1] https://ift.tt/VJARHqK https://ift.tt/pnzcEyu October 10, 2025 at 02:25AM

Show HN: Created macOS app to help you keep your distance from your screen https://ift.tt/RITlsb0

Show HN: Created macOS app to help you keep your distance from your screen Hey everyone, If you're anything like me, you spend a good chunk of your day (and night) on your computer. I often find that when I'm zoned in, my posture gets worse and worse and my face ends up being really close to the screen. And over a course of a workday, when I finally unplug, my eyes have a hard time focusing on things that are far away. This has become a big enough problem for me that I decided to create an app to help me keep my face far enough from the screen. Now, I could've gone with a simple notification with a timer built into it but, as with all reminder notification, they soon become noise for me and I end up just dismissing it. I needed something to actively force me to move back. Which is where FarSight comes in. It uses your camera to gauge your distance and blurs the entire screen if it detects that you are getting close enough for a certain period of time. I made it so that it won't be extremely annoying and disruptive (e.g. blurring the screen every time you cross the line) but just enough of a nuisance to be helpful. I've been using it everyday since creating it and it's definitely helped me with eye strain, double vision, and surprisingly, my posture as well. I'm not sure if I'll keep it free forever but I wanted to release it first to ask for feedback. I only have the app in MacOS so if it has enough interest, I'll invest into making Windows counterpart. https://ift.tt/45fepBr... Also, in case anyone is wondering, no data is collected and the snapshots during the app's usage are not saved but only used to calculate the distance. October 10, 2025 at 01:57AM

Wednesday, October 8, 2025

Show HN: Spica – OSS Tool to Generate Infinite Length Sora-2 Videos https://ift.tt/khVWpoz

Show HN: Spica – OSS Tool to Generate Infinite Length Sora-2 Videos https://ift.tt/O7pmuyS October 9, 2025 at 12:04AM

Show HN: KI Song Erstellen Kostenlos – AI Music Generator FüR Deutsche Musik https://ift.tt/BVtwqv4

Show HN: KI Song Erstellen Kostenlos – AI Music Generator FüR Deutsche Musik Kostenloser KI-Musikgenerator für deutsche Songs. Text rein → professioneller Song in wenigen Minuten. Gebaut für Content Creator, die Copyright-freie Musik brauchen. https://ift.tt/UT7QzBO GitHub: https://ift.tt/bgTrk19 Probiert es aus! https://ift.tt/UT7QzBO October 8, 2025 at 10:26PM

Tuesday, October 7, 2025

Show HN: Agentic Design Patterns – Python Edition, from the Codex Codebase https://ift.tt/hIUK7va

Show HN: Agentic Design Patterns – Python Edition, from the Codex Codebase While reading Agentic Design Patterns by Antonio Gulli, I wanted to see how these patterns look in real code. I cloned the OpenAI Codex repo (the open-source AI coding assistant that recently trended on HN) — but it was in Rust. So, I used an Cursor to help me extract and translate 18+ agentic patterns from Codex’s codebase into Python. That small experiment turned into a full open-source guide: GitHub: Codex Agentic Patterns https://ift.tt/9upZHC7 Each pattern comes with: A short explanation and code sample A runnable exercise and agent snippet A summary of how Codex used the pattern (e.g., prompt chaining, tool orchestration, reflection loops, sandbox escalation) One full working Python agent that ties it all together If you’ve read the agentic design patterns book or explored Codex, this is a bridge between theory and practice — focused on runnable, open examples instead of abstract diagrams. It’s completely free and open-source. Would love feedback, ideas, or even new patterns from your own agent experiments. https://artvandelay.github.io/codex-agentic-patterns/ October 8, 2025 at 04:11AM

Show HN: DidMySettingsChange – A tool that checks changed windows settings https://ift.tt/t6SViur

Show HN: DidMySettingsChange – A tool that checks changed windows settings Microsoft has been under heavy scrutiny with how they manage Windows over the years, particularly concerning privacy and telemetry settings. Many users find that after disabling certain settings, these settings are mysteriously re-enabled after updates or without any apparent reason. DidMySettingsChange is a Python script designed to help users keep track of their Windows privacy and telemetry settings, ensuring that they stay in control of their privacy without the hassle of manually checking each setting. Features Comprehensive Checks: Automatically scans all known Windows privacy and telemetry settings. Change Detection: Alerts users if any settings have been changed from their preferred state. Customizable Configuration: Allows users to specify which settings to monitor. Easy to Use: Simple command-line interface that provides clear and concise output. Logs and Reports: Generates detailed logs and reports for auditing and troubleshooting. https://ift.tt/3bKGAHD October 6, 2025 at 04:19AM

Show HN: I'm building a browser for reverse engineers https://ift.tt/tyQYdjA

Show HN: I'm building a browser for reverse engineers https://ift.tt/b2yRU5C October 6, 2025 at 09:02PM

Show HN: Gotask, a simple task manager CLI built using Golang https://ift.tt/ZA9jVzt

Show HN: Gotask, a simple task manager CLI built using Golang Hey folks, Gotask is a simple golang CLI I made to explore some aspects of the Go programming language. https://ift.tt/lgN6uwI October 8, 2025 at 12:20AM

Monday, October 6, 2025

Show HN: TinqerJS– LINQ-inspired QueryBuilder for TypeScript + Postgres/SQLite https://ift.tt/7i8JCWk

Show HN: TinqerJS– LINQ-inspired QueryBuilder for TypeScript + Postgres/SQLite https://tinqerjs.org October 6, 2025 at 08:58PM

Show HN: I've build a platform for writing technical/scientific documents https://ift.tt/hucG4KP

Show HN: I've build a platform for writing technical/scientific documents https://ift.tt/0beoYHD October 6, 2025 at 04:28PM

Show HN: I Built a Transcription CLI Because Uploading 4GB Videos Was Killing Me https://ift.tt/hAY9QL2

Show HN: I Built a Transcription CLI Because Uploading 4GB Videos Was Killing Me https://ift.tt/bRZSUvg October 6, 2025 at 11:52PM

Show HN: Volant– spin up real microVMs in 10 seconds(Docker images or initramfs) https://ift.tt/EgG5ATX

Show HN: Volant– spin up real microVMs in 10 seconds(Docker images or initramfs) I’ve been building Volant, a modular microVM orchestration engine that makes running microVMs feel as simple as Docker. It supports cloud-init, GPU/VFIO passthrough (yes, you can run AI/ML workloads in isolated microVMs), booting Docker images via a plugin system, and Kubernetes-style deployments with replication, all from a single CLI(soon to be web UI, see next) Coming soon: a built-in PaaS mode with snapshot-based cold start elimination, sort of like Dokploy, but designed for serverless workloads that boot from memory snapshots instead of containers. Volant is intentionally a bit opinionated to make microVMs more accessible, but it’s fully extensible for power users. Check out the README and the docs for more details. It’s free and open source (under BSL), would love to hear feedback or thoughts from anyone! tl;dr: 6-second GIF in the README shows the full flow: install → create VM → get HTTP 200. https://ift.tt/p3AQNum October 6, 2025 at 04:24AM

Sunday, October 5, 2025

Show HN: A Node.js CLI tool to generate ai.txt, llms.txt, robots.txt, humans.txt https://ift.tt/OfDnGeR

Show HN: A Node.js CLI tool to generate ai.txt, llms.txt, robots.txt, humans.txt https://ift.tt/oNZKUrc October 6, 2025 at 09:28AM

Show HN: High-fidelity, compact, and real time rendering of university campus https://ift.tt/ExjboKt

Show HN: High-fidelity, compact, and real time rendering of university campus Technical thread: https://ift.tt/X8UBZ4n https://hoanh.space/aalto/ October 6, 2025 at 05:21AM

Saturday, October 4, 2025

Show HN: An open-source, RL-native observability framework we've been missing https://ift.tt/ietSHwr

Show HN: An open-source, RL-native observability framework we've been missing The RL ecosystem is maturing— verifiers are standardizing how we build and share environments. However, as it grows, we need observability tooling that actually understands RL primitives. Running RL experiments without visibility into rollout quality, reward distributions, or failure modes is a waste of time. Monitor provides live tracking, per-example inspection, and programmatic access—see what's happening during runs and debug what went wrong afterward. https://ift.tt/0Lz1VIO October 5, 2025 at 03:05AM

Show HN: World Amazing Framework: Like Django for Civilization https://ift.tt/cBZgEuj

Show HN: World Amazing Framework: Like Django for Civilization Any initial thoughts? This framework is meant to be a tool for construction, so if you want to play around with it for creating potential specific implementations, you can drop the contents of the website, the GitHub README, and the entire overview.md into an AI chat, and that should be enough to use the framework, at least conceptually. Would y'all want me to pre-prime a chat in Google AI Studio with the full context of the plan and some basic direction for discourse? I can share a link to a ready-to-go environment. The core documentation should answer most mechanical questions. And if you feed the docs into an AI chat, you can ask it any question you may have, or to simply ask it to explain something in different ways, or hypothesize solutions to any world issue, either systemic or regional. Gemini Pro 2.5 can take the full doc in one prompt, and its ability to co-create ideas is remarkable. I've been using it mostly through the AI Studio interface. Much of the overview is as much my work as it is a synthesis of my collaboration with Gemini Pro 2.5, ChatGPT-4o, and some early contributions from GPT-4 about a year ago. Before LLMs, I was building out pamphlet-style pages on a website (that are up at whomanatee.org, which is the base wrapper implementation of the framework), and I was planning to use them as talking points. I was anticipating that much of the deep thinking would have to happen in slow, public discourse. With LLMs, I've been able to stress-test these ideas from every possible angle, using any past event or theory to see if the framework could withstand scrutiny. At one point, a model argued that Adam Smith would have rejected this idea as fantasy. So I worked with it to develop an economic plan that "synthetic Adam" praised. It's incredible that we now have the ability to get synthesized thoughts from almost any perspective. You could ask it, "What would Barack Obama think of this plan? And using the framework, what would be your response to any hesitations he may have?" And it responds with incredible analysis, synthesis, and feedback. https://ift.tt/8QzGKWh October 5, 2025 at 03:44AM

Show HN: Run – a CLI universal code runner I built while learning Rust https://ift.tt/sM0NdnZ

Show HN: Run – a CLI universal code runner I built while learning Rust Hi HN — I’m learning Rust and decided to build a universal CLI for running code in many languages. The tool, Run, aims to be a single, minimal dependency utility for: running one-off snippets (from CLI flags), running files, reading and executing piped stdin, and providing language-specific REPLs that you can switch between interactively. I designed it to support both interpreted languages (Python, JS, Ruby, etc.) and compiled languages (Rust, Go, C/C++). It detects languages from flags or file extensions, can compile temporary files for compiled languages, and exposes a unified REPL experience with commands like :help, :lang, and :quit. Install: cargo install run-kit (or use the platform downloads on GitHub). Source & releases: https://ift.tt/TDCoZ2l I used Rust while following the official learning resources and used AI to speed up development, so I expect there are bugs and rough edges. I’d love feedback on: usability and UX of the REPL, edge cases for piping input to language runtimes, security considerations (sandboxing/resource limits), packaging and cross-platform distribution. Thanks — I’ll try to answer questions and share design notes. https://ift.tt/TDCoZ2l October 5, 2025 at 12:04AM

Friday, October 3, 2025

Show HN: Beacon (open source) – Built after AWS billed me 700% more for RDS https://ift.tt/sRJM4H0

Show HN: Beacon (open source) – Built after AWS billed me 700% more for RDS I've been hosting my side project on AWS. I was paying an okay price for not managing infrastructure at all. I moved everything to AWS Ligthsail after my startup credits run out. The project was initially a success and made several thousand euros per month in revenue. Then came covid with new regulations, and suddenly my customers were non existent (the problem it solved was no longer there). After that it was not making me money, I was paying it from my own pocket to maintain it, thinking maybe it will come back. Then one day, after some ignored spam AWS emails, I got a huge charge on my card, along with a bill from AWS. The charge was orders of magnitude higher than the previous charges."WTF??" I said to myself while rushing to log into the dashboard to see what the issue was. No DDoS, no misconfiguration, nothing unusual. I logged into the root account to look at the billing page, and there it was:RDS PostgreSQL legacy fee ~€200 because I did not upgrade to Postgres 16 (from 13). I was baffled. I paid monthly €25 (27% tax included) for the smallest RDS instance, then I see this monster fee for something I think should cost maybe €2. I mean AWS just has to run it in a different environment. For €200 I could buy them a new server to run it for me. That's when I had the realization: "I have a spare Raspberry Pi 3, I'll just host everything on that. That will be for free." But self-hosting came with it's own challenges, especially on a resource-constrained device. I needed better tools to deploy and monitor my application. SSH-ing into the Raspberry Pi every time I wanted to deploy a newer version was a pain in the ass. So was debugging issues. Existing deployment and monitoring solutions were either too expensive, too complex, or didn't work well with resource-constrained devices like Raspberry Pi. Examples: * Grafana/Prometheus for monitoring: Over-engineered for my needs. * OpenSearch/ELK for logs: A nightmare on low-resource devices. * Metabase for dashboards: A ram hungry monster that eats up more resources than if I hosted 100 applications. And to access the db remotely opening a port and putting it behind Cloudflare Zero Trust is much easier than setting up Metabase. So I decided to build my own deployment and montitoring agent, and why not make it opensource? The agent can currently deploy applications from github by polling release tags, monitor device metrics, alert when thresholds are reached, forward logs to cloud dashboard. It's still in development, with features improving every week. If you are interested, give it a start on Github. https://beaconinfra.dev October 4, 2025 at 01:52AM

Show HN: Was pissed about Google Docs, So I made an Text Editor myself https://ift.tt/VZdU4DR

Show HN: Was pissed about Google Docs, So I made an Text Editor myself It’s been a while since I’ve started to write a book . The process of creation of it has not been easy , first because I’m not a writer , I’ve created well though out internet posts here and there, which ended up creating my first book. It was a good experience , but then I’ve started to think that a book that just gathered my thoughts online it’s not entirely “writing” a book , I needed more. And than I’ve opened google docs and start typing. Then I started to figure out what I wanted to write: should it be a fantasy story, a self-biography, or an observation of the world? I believe most writers have this figured out beforehand, but not me. I began writing pieces to see if they would fit together and make sense. I started gathering philosophical anecdotes based on my core beliefs and sensed something brewing. When I finally decided what the book would be about, and what I wanted to write, the type of writing I wanted to do, I saw an already sizable document with ideas scattered throughout it. That was good for me, as I could just join the pieces, but I didn’t want to be trapped in writing that could be repetitive. I wanted to have the ideas, philosophy, the whole reason why the book is like this, stored in a place I could easily access. I'm planning to use AI as a memory dump, where I can add information during a conversation. Then, whenever I consult it, I can check if I've already written something and if it reflects the temper and pace I want for my book. Everything seems fine, but we encountered a few problems. First, the AI's writing was a conundrum of errors. I could gain assistance and a sense of what to write, but the AI itself, due to our prolonged interchange, started to hallucinate and produce nonsense or "forget" our conversation. The second issue was that the AI couldn't consistently verify what was already written. As the text grew larger, the context window began to shrink, and the more I used the AI tool, the less helpful it became. So I decided to search for a tool that could do what I wanted. I found elements in each of the products I've used: some were extremely satisfying to write with, others had good features to enhance text, some allowed me to organize my book by scattering ideas effectively, and still others used AI for correction and proofreading tasks. The solutions for this market are diverse and offer numerous approaches. I could easily transition between tools, but I wanted something unified to keep my writing process in one place. That’s why I created this text editor and called it SourcePilot. It’s a tool that identifies your writing style as you write, allowing you to add notes, sources, and videos, and to use them as context for the AI, enabling more nuanced outputs tailored to your writing. It was interesting to build, and I’m providing a link you can try. It’s a desktop app, and you can use it for free, depending on the hardware you have. I’m looking for people who could give me feedback on what's wrong with it. People who could not install it (I’ve built it on Mac and could not test Linux and Windows), or have problems logging in. I keep getting loads of problems because I’m using the tool right now as I write this text. I'm planning to launch a new version soon, featuring an anti-slop algorithm I’ve developed, along with document branching. I just want to see if there are people interested in using it at the moment. If there aren't users, that's fine. I think I’ve made something for myself anyway. :) Thank you for your attention if you made it this far. You’re greatly appreciated. Cheers! https://sourcepilot.co/ October 4, 2025 at 01:28AM

Show HN: FLE v0.3 – Claude Code Plays Factorio https://ift.tt/rRZOtnd

Show HN: FLE v0.3 – Claude Code Plays Factorio We're excited to release v0.3.0 of the Factorio Learning Environment (FLE), an open-source environment for evaluating AI agents on long-horizon planning, spatial reasoning, and automation tasks. == What is FLE? == FLE uses the game Factorio to test whether AI can handle complex, open-ended engineering challenges. Agents write Python code to build automated factories, progressing from simple resource extraction (~30 units/min) to sophisticated production chains (millions of units/sec). == What's new in 0.3.0 == - Headless scaling: No longer needs the game client, enabling massive parallelization! - OpenAI Gym compatibility: Standard interface for RL research - Claude Code integration: We're livestreaming Claude playing Factorio [on Twitch]( https://ift.tt/VJ1XEDr ) - Better tooling and SDK: 1-line CLI commands to run evaluations (with W&B logging) == Key findings == We evaluated frontier models (Claude Opus 4.1, GPT-5, Gemini 2.5 Pro, Grok 4) on 24 production automation tasks of increasing complexity. Even the best models struggle: - Most models still rely on semi-manual strategies rather than true automation - Agents rarely define helper functions or abstractions, limiting their ability to scale - Error recovery remains difficult – agents often get stuck in repetitive failure loops The performance gap between models on FLE correlates more closely with real-world task benchmarks (like GDPVal) than with traditional coding/reasoning evals. == Why this matters == Unlike benchmarks based on exams that saturate quickly, Factorio's exponential complexity scaling means there's effectively no performance ceiling. The skills needed - system debugging, constraint satisfaction, logistics optimization - transfer directly to real challenges. == Try it yourself == >>> uv add factorio-learning-environment >>> uv add "factorio-learning-environment[eval]" >>> fle cluster start >>> fle eval --config configs/gym_run_config.json We're looking for researchers, engineers, and modders interested in pushing the boundaries of agent capabilities. Join our Discord if you want to contribute. We look forward to meeting you and seeing what you can build! -- FLE Team https://jackhopkins.github.io/factorio-learning-environment/versions/0.3.0.html October 4, 2025 at 01:02AM

Thursday, October 2, 2025

Show HN: BetterBrain – Dementia prevention, covered by insurance https://ift.tt/UpQ6Pam

Show HN: BetterBrain – Dementia prevention, covered by insurance Hey all! I’ve been building BetterBrain for the past few months, which is the first dementia prevention program entirely covered by insurance. BetterBrain combines expert clinicians, comprehensive testing and state of the art AI - and for many insurance plans is $0. Research shows that dementia can be detected up to 20 years in advance. Despite this, many people at risk of dementia overlook regular brain health assessments. Many members of our founding team have family members affected by neurodegenerative disease. We’re also hiring aggressively if anyone is interested in changing the future of treating neurodegenerative disease. Would love to talk to anyone interested https://ift.tt/9KpFf6u https://ift.tt/9KpFf6u October 3, 2025 at 07:33AM

Show HN: Uber for Flights https://ift.tt/SYxwLE3

Show HN: Uber for Flights My friend and I built BookMyFlight to finally modernize flight search + booking. Why we built this: - Personalization. I fly the same route every month, and there’s no platform that knows my preferences so that I can open it, find and book my flight, and close it within a minute. - Booking is slow. I hate seeing a long clunky airline form each time I need to book. I want booking a flight to feel more like booking an Uber. How it works: 1. Optionally make an account and save your traveler preferences. Personally, I've specified my routine route as SFO to CLE and that I only want red-eye direct flights for this route. 2. Search for flights using chat or the search panel. Chat feels especially time-saving when you have preferences saved (e.g. I just say “search my routine trip"). 3. Once you find the flight you want, use the one-click book feature which books your flight directly with the airline. The first time you book a flight, you’ll have to fill out your traveler info, but you won't see that form after that. Notes: - Your booking is directly with the airline (this means when something goes wrong, you get direct support from the airline—not a third-party) - You can add your rewards numbers for each airline to keep earning points/status The ultimate goal is to create the best possible experience that every traveler wants, but that OTAs and airlines don’t care to create. Also very receptive to hearing pain points from frequent flyers; we think this space is really outdated and could use some innovation. Try it out and let us know what you think :) https://bookmyflight.ai October 3, 2025 at 01:29AM

Show HN: Enhance – A Terminal UI for GitHub Actions https://ift.tt/LvV3546

Show HN: Enhance – A Terminal UI for GitHub Actions I'm very excited to share what I've been working on lately! Introducing ENHANCE, a terminal UI for GitHub Actions that lets you easily see and interact with your PRs checks. It's available under a sponsorware model. Get more info on the site: -> https://ift.tt/P0NGvyK This is an attempt to make my OSS development something sustainable. Happy to hear feedback about the model as well as the tool! Cheers! https://ift.tt/IABamDu October 3, 2025 at 12:49AM

Show HN: Photo AI Editor – Edit, Transform and Enhance Photos with Text Prompt https://ift.tt/hMN64IH

Show HN: Photo AI Editor – Edit, Transform and Enhance Photos with Text Prompt https://ift.tt/6U5kwBW October 2, 2025 at 12:19PM

Wednesday, October 1, 2025

Show HN: Rostra is a P2P (f2f) social network https://ift.tt/URt16um

Show HN: Rostra is a P2P (f2f) social network A public instance is available at https://rostra.me/ . It will default to showing the interface from the perspective of my own identity, in a read-only mode. Click "Logout" and then "Random" to generate your own identity to play with. https://app.radicle.xyz/nodes/radicle.dpc.pw/rad%3AzzK566qFsZnXomX2juRjxj9K1LuF October 2, 2025 at 03:40AM

Show HN: Open-source project – HTTP cache and reverse proxy https://ift.tt/mjT5eEL

Show HN: Open-source project – HTTP cache and reverse proxy https://borislavv.github.io/advcache.dev/ October 1, 2025 at 01:11PM

Show HN: Ocrisp, One-Click RAG Implementation, Simple and Portable https://ift.tt/iehtcM0

Show HN: Ocrisp, One-Click RAG Implementation, Simple and Portable https://ift.tt/sjAngzL October 1, 2025 at 08:23PM

Show HN: I built a simple ambient sound app with no ads or subscriptions https://ift.tt/QbWEPlJ

Show HN: I built a simple ambient sound app with no ads or subscriptions I’ve always liked having background noise while working or falling ...