Sunday, October 5, 2025

Show HN: A Node.js CLI tool to generate ai.txt, llms.txt, robots.txt, humans.txt https://ift.tt/OfDnGeR

Show HN: A Node.js CLI tool to generate ai.txt, llms.txt, robots.txt, humans.txt https://ift.tt/oNZKUrc October 6, 2025 at 09:28AM

Show HN: High-fidelity, compact, and real time rendering of university campus https://ift.tt/ExjboKt

Show HN: High-fidelity, compact, and real time rendering of university campus Technical thread: https://ift.tt/X8UBZ4n https://hoanh.space/aalto/ October 6, 2025 at 05:21AM

Saturday, October 4, 2025

Show HN: An open-source, RL-native observability framework we've been missing https://ift.tt/ietSHwr

Show HN: An open-source, RL-native observability framework we've been missing The RL ecosystem is maturing— verifiers are standardizing how we build and share environments. However, as it grows, we need observability tooling that actually understands RL primitives. Running RL experiments without visibility into rollout quality, reward distributions, or failure modes is a waste of time. Monitor provides live tracking, per-example inspection, and programmatic access—see what's happening during runs and debug what went wrong afterward. https://ift.tt/0Lz1VIO October 5, 2025 at 03:05AM

Show HN: World Amazing Framework: Like Django for Civilization https://ift.tt/cBZgEuj

Show HN: World Amazing Framework: Like Django for Civilization Any initial thoughts? This framework is meant to be a tool for construction, so if you want to play around with it for creating potential specific implementations, you can drop the contents of the website, the GitHub README, and the entire overview.md into an AI chat, and that should be enough to use the framework, at least conceptually. Would y'all want me to pre-prime a chat in Google AI Studio with the full context of the plan and some basic direction for discourse? I can share a link to a ready-to-go environment. The core documentation should answer most mechanical questions. And if you feed the docs into an AI chat, you can ask it any question you may have, or to simply ask it to explain something in different ways, or hypothesize solutions to any world issue, either systemic or regional. Gemini Pro 2.5 can take the full doc in one prompt, and its ability to co-create ideas is remarkable. I've been using it mostly through the AI Studio interface. Much of the overview is as much my work as it is a synthesis of my collaboration with Gemini Pro 2.5, ChatGPT-4o, and some early contributions from GPT-4 about a year ago. Before LLMs, I was building out pamphlet-style pages on a website (that are up at whomanatee.org, which is the base wrapper implementation of the framework), and I was planning to use them as talking points. I was anticipating that much of the deep thinking would have to happen in slow, public discourse. With LLMs, I've been able to stress-test these ideas from every possible angle, using any past event or theory to see if the framework could withstand scrutiny. At one point, a model argued that Adam Smith would have rejected this idea as fantasy. So I worked with it to develop an economic plan that "synthetic Adam" praised. It's incredible that we now have the ability to get synthesized thoughts from almost any perspective. You could ask it, "What would Barack Obama think of this plan? And using the framework, what would be your response to any hesitations he may have?" And it responds with incredible analysis, synthesis, and feedback. https://ift.tt/8QzGKWh October 5, 2025 at 03:44AM

Show HN: Run – a CLI universal code runner I built while learning Rust https://ift.tt/sM0NdnZ

Show HN: Run – a CLI universal code runner I built while learning Rust Hi HN — I’m learning Rust and decided to build a universal CLI for running code in many languages. The tool, Run, aims to be a single, minimal dependency utility for: running one-off snippets (from CLI flags), running files, reading and executing piped stdin, and providing language-specific REPLs that you can switch between interactively. I designed it to support both interpreted languages (Python, JS, Ruby, etc.) and compiled languages (Rust, Go, C/C++). It detects languages from flags or file extensions, can compile temporary files for compiled languages, and exposes a unified REPL experience with commands like :help, :lang, and :quit. Install: cargo install run-kit (or use the platform downloads on GitHub). Source & releases: https://ift.tt/TDCoZ2l I used Rust while following the official learning resources and used AI to speed up development, so I expect there are bugs and rough edges. I’d love feedback on: usability and UX of the REPL, edge cases for piping input to language runtimes, security considerations (sandboxing/resource limits), packaging and cross-platform distribution. Thanks — I’ll try to answer questions and share design notes. https://ift.tt/TDCoZ2l October 5, 2025 at 12:04AM

Friday, October 3, 2025

Show HN: Beacon (open source) – Built after AWS billed me 700% more for RDS https://ift.tt/sRJM4H0

Show HN: Beacon (open source) – Built after AWS billed me 700% more for RDS I've been hosting my side project on AWS. I was paying an okay price for not managing infrastructure at all. I moved everything to AWS Ligthsail after my startup credits run out. The project was initially a success and made several thousand euros per month in revenue. Then came covid with new regulations, and suddenly my customers were non existent (the problem it solved was no longer there). After that it was not making me money, I was paying it from my own pocket to maintain it, thinking maybe it will come back. Then one day, after some ignored spam AWS emails, I got a huge charge on my card, along with a bill from AWS. The charge was orders of magnitude higher than the previous charges."WTF??" I said to myself while rushing to log into the dashboard to see what the issue was. No DDoS, no misconfiguration, nothing unusual. I logged into the root account to look at the billing page, and there it was:RDS PostgreSQL legacy fee ~€200 because I did not upgrade to Postgres 16 (from 13). I was baffled. I paid monthly €25 (27% tax included) for the smallest RDS instance, then I see this monster fee for something I think should cost maybe €2. I mean AWS just has to run it in a different environment. For €200 I could buy them a new server to run it for me. That's when I had the realization: "I have a spare Raspberry Pi 3, I'll just host everything on that. That will be for free." But self-hosting came with it's own challenges, especially on a resource-constrained device. I needed better tools to deploy and monitor my application. SSH-ing into the Raspberry Pi every time I wanted to deploy a newer version was a pain in the ass. So was debugging issues. Existing deployment and monitoring solutions were either too expensive, too complex, or didn't work well with resource-constrained devices like Raspberry Pi. Examples: * Grafana/Prometheus for monitoring: Over-engineered for my needs. * OpenSearch/ELK for logs: A nightmare on low-resource devices. * Metabase for dashboards: A ram hungry monster that eats up more resources than if I hosted 100 applications. And to access the db remotely opening a port and putting it behind Cloudflare Zero Trust is much easier than setting up Metabase. So I decided to build my own deployment and montitoring agent, and why not make it opensource? The agent can currently deploy applications from github by polling release tags, monitor device metrics, alert when thresholds are reached, forward logs to cloud dashboard. It's still in development, with features improving every week. If you are interested, give it a start on Github. https://beaconinfra.dev October 4, 2025 at 01:52AM

Show HN: Was pissed about Google Docs, So I made an Text Editor myself https://ift.tt/VZdU4DR

Show HN: Was pissed about Google Docs, So I made an Text Editor myself It’s been a while since I’ve started to write a book . The process of creation of it has not been easy , first because I’m not a writer , I’ve created well though out internet posts here and there, which ended up creating my first book. It was a good experience , but then I’ve started to think that a book that just gathered my thoughts online it’s not entirely “writing” a book , I needed more. And than I’ve opened google docs and start typing. Then I started to figure out what I wanted to write: should it be a fantasy story, a self-biography, or an observation of the world? I believe most writers have this figured out beforehand, but not me. I began writing pieces to see if they would fit together and make sense. I started gathering philosophical anecdotes based on my core beliefs and sensed something brewing. When I finally decided what the book would be about, and what I wanted to write, the type of writing I wanted to do, I saw an already sizable document with ideas scattered throughout it. That was good for me, as I could just join the pieces, but I didn’t want to be trapped in writing that could be repetitive. I wanted to have the ideas, philosophy, the whole reason why the book is like this, stored in a place I could easily access. I'm planning to use AI as a memory dump, where I can add information during a conversation. Then, whenever I consult it, I can check if I've already written something and if it reflects the temper and pace I want for my book. Everything seems fine, but we encountered a few problems. First, the AI's writing was a conundrum of errors. I could gain assistance and a sense of what to write, but the AI itself, due to our prolonged interchange, started to hallucinate and produce nonsense or "forget" our conversation. The second issue was that the AI couldn't consistently verify what was already written. As the text grew larger, the context window began to shrink, and the more I used the AI tool, the less helpful it became. So I decided to search for a tool that could do what I wanted. I found elements in each of the products I've used: some were extremely satisfying to write with, others had good features to enhance text, some allowed me to organize my book by scattering ideas effectively, and still others used AI for correction and proofreading tasks. The solutions for this market are diverse and offer numerous approaches. I could easily transition between tools, but I wanted something unified to keep my writing process in one place. That’s why I created this text editor and called it SourcePilot. It’s a tool that identifies your writing style as you write, allowing you to add notes, sources, and videos, and to use them as context for the AI, enabling more nuanced outputs tailored to your writing. It was interesting to build, and I’m providing a link you can try. It’s a desktop app, and you can use it for free, depending on the hardware you have. I’m looking for people who could give me feedback on what's wrong with it. People who could not install it (I’ve built it on Mac and could not test Linux and Windows), or have problems logging in. I keep getting loads of problems because I’m using the tool right now as I write this text. I'm planning to launch a new version soon, featuring an anti-slop algorithm I’ve developed, along with document branching. I just want to see if there are people interested in using it at the moment. If there aren't users, that's fine. I think I’ve made something for myself anyway. :) Thank you for your attention if you made it this far. You’re greatly appreciated. Cheers! https://sourcepilot.co/ October 4, 2025 at 01:28AM

Show HN: FLE v0.3 – Claude Code Plays Factorio https://ift.tt/rRZOtnd

Show HN: FLE v0.3 – Claude Code Plays Factorio We're excited to release v0.3.0 of the Factorio Learning Environment (FLE), an open-source environment for evaluating AI agents on long-horizon planning, spatial reasoning, and automation tasks. == What is FLE? == FLE uses the game Factorio to test whether AI can handle complex, open-ended engineering challenges. Agents write Python code to build automated factories, progressing from simple resource extraction (~30 units/min) to sophisticated production chains (millions of units/sec). == What's new in 0.3.0 == - Headless scaling: No longer needs the game client, enabling massive parallelization! - OpenAI Gym compatibility: Standard interface for RL research - Claude Code integration: We're livestreaming Claude playing Factorio [on Twitch]( https://ift.tt/VJ1XEDr ) - Better tooling and SDK: 1-line CLI commands to run evaluations (with W&B logging) == Key findings == We evaluated frontier models (Claude Opus 4.1, GPT-5, Gemini 2.5 Pro, Grok 4) on 24 production automation tasks of increasing complexity. Even the best models struggle: - Most models still rely on semi-manual strategies rather than true automation - Agents rarely define helper functions or abstractions, limiting their ability to scale - Error recovery remains difficult – agents often get stuck in repetitive failure loops The performance gap between models on FLE correlates more closely with real-world task benchmarks (like GDPVal) than with traditional coding/reasoning evals. == Why this matters == Unlike benchmarks based on exams that saturate quickly, Factorio's exponential complexity scaling means there's effectively no performance ceiling. The skills needed - system debugging, constraint satisfaction, logistics optimization - transfer directly to real challenges. == Try it yourself == >>> uv add factorio-learning-environment >>> uv add "factorio-learning-environment[eval]" >>> fle cluster start >>> fle eval --config configs/gym_run_config.json We're looking for researchers, engineers, and modders interested in pushing the boundaries of agent capabilities. Join our Discord if you want to contribute. We look forward to meeting you and seeing what you can build! -- FLE Team https://jackhopkins.github.io/factorio-learning-environment/versions/0.3.0.html October 4, 2025 at 01:02AM

Thursday, October 2, 2025

Show HN: BetterBrain – Dementia prevention, covered by insurance https://ift.tt/UpQ6Pam

Show HN: BetterBrain – Dementia prevention, covered by insurance Hey all! I’ve been building BetterBrain for the past few months, which is the first dementia prevention program entirely covered by insurance. BetterBrain combines expert clinicians, comprehensive testing and state of the art AI - and for many insurance plans is $0. Research shows that dementia can be detected up to 20 years in advance. Despite this, many people at risk of dementia overlook regular brain health assessments. Many members of our founding team have family members affected by neurodegenerative disease. We’re also hiring aggressively if anyone is interested in changing the future of treating neurodegenerative disease. Would love to talk to anyone interested https://ift.tt/9KpFf6u https://ift.tt/9KpFf6u October 3, 2025 at 07:33AM

Show HN: Uber for Flights https://ift.tt/SYxwLE3

Show HN: Uber for Flights My friend and I built BookMyFlight to finally modernize flight search + booking. Why we built this: - Personalization. I fly the same route every month, and there’s no platform that knows my preferences so that I can open it, find and book my flight, and close it within a minute. - Booking is slow. I hate seeing a long clunky airline form each time I need to book. I want booking a flight to feel more like booking an Uber. How it works: 1. Optionally make an account and save your traveler preferences. Personally, I've specified my routine route as SFO to CLE and that I only want red-eye direct flights for this route. 2. Search for flights using chat or the search panel. Chat feels especially time-saving when you have preferences saved (e.g. I just say “search my routine trip"). 3. Once you find the flight you want, use the one-click book feature which books your flight directly with the airline. The first time you book a flight, you’ll have to fill out your traveler info, but you won't see that form after that. Notes: - Your booking is directly with the airline (this means when something goes wrong, you get direct support from the airline—not a third-party) - You can add your rewards numbers for each airline to keep earning points/status The ultimate goal is to create the best possible experience that every traveler wants, but that OTAs and airlines don’t care to create. Also very receptive to hearing pain points from frequent flyers; we think this space is really outdated and could use some innovation. Try it out and let us know what you think :) https://bookmyflight.ai October 3, 2025 at 01:29AM

Show HN: Enhance – A Terminal UI for GitHub Actions https://ift.tt/LvV3546

Show HN: Enhance – A Terminal UI for GitHub Actions I'm very excited to share what I've been working on lately! Introducing ENHANCE, a terminal UI for GitHub Actions that lets you easily see and interact with your PRs checks. It's available under a sponsorware model. Get more info on the site: -> https://ift.tt/P0NGvyK This is an attempt to make my OSS development something sustainable. Happy to hear feedback about the model as well as the tool! Cheers! https://ift.tt/IABamDu October 3, 2025 at 12:49AM

Show HN: Photo AI Editor – Edit, Transform and Enhance Photos with Text Prompt https://ift.tt/hMN64IH

Show HN: Photo AI Editor – Edit, Transform and Enhance Photos with Text Prompt https://ift.tt/6U5kwBW October 2, 2025 at 12:19PM

Wednesday, October 1, 2025

Show HN: Rostra is a P2P (f2f) social network https://ift.tt/URt16um

Show HN: Rostra is a P2P (f2f) social network A public instance is available at https://rostra.me/ . It will default to showing the interface from the perspective of my own identity, in a read-only mode. Click "Logout" and then "Random" to generate your own identity to play with. https://app.radicle.xyz/nodes/radicle.dpc.pw/rad%3AzzK566qFsZnXomX2juRjxj9K1LuF October 2, 2025 at 03:40AM

Show HN: Open-source project – HTTP cache and reverse proxy https://ift.tt/mjT5eEL

Show HN: Open-source project – HTTP cache and reverse proxy https://borislavv.github.io/advcache.dev/ October 1, 2025 at 01:11PM

Show HN: Ocrisp, One-Click RAG Implementation, Simple and Portable https://ift.tt/iehtcM0

Show HN: Ocrisp, One-Click RAG Implementation, Simple and Portable https://ift.tt/sjAngzL October 1, 2025 at 08:23PM

Tuesday, September 30, 2025

Show HN: Rust BPE tokenizer for Qwen models that's 12x faster than HuggingFace https://ift.tt/bRGp4zF

Show HN: Rust BPE tokenizer for Qwen models that's 12x faster than HuggingFace https://ift.tt/NXlcTM2 September 30, 2025 at 11:58PM

Show HN: Glide, an extensible, keyboard-focused web browser https://ift.tt/KQTCHsl

Show HN: Glide, an extensible, keyboard-focused web browser https://ift.tt/fYH90ne October 1, 2025 at 01:13AM

Show HN: Build AI Agents in WhatsApp (Beta) https://ift.tt/x2CKpXQ

Show HN: Build AI Agents in WhatsApp (Beta) https://ift.tt/LxI4nNJ October 1, 2025 at 12:55AM

Show HN: I got tired of spreadsheets, so I built a Python GUI to track invoices https://ift.tt/Q0p2t8S

Show HN: I got tired of spreadsheets, so I built a Python GUI to track invoices Hey HN, As a freelancer, I found myself spending time every month manually opening PDF and Word invoices, copying the details into a spreadsheet, and tracking payments. It was tedious and error-prone. I decided to build a simple desktop app to automate this. It's a GUI built with Python and Tkinter that points to a folder of invoices, parses the key details (invoice #, amount, date), and stores everything in a local SQLite database for tracking and analysis. It's been quite a time- and headache-saver for me. A Note on Simplicity & Caveats I'm sharing this in case it's useful to any other freelancers or businesses, but I want to be upfront about its limitations: The UI is very basic. It's built with vanilla Tkinter and is all about function over form. It's not the prettiest app, but it gets the job done. The core automation relies on a "patterns" feature that matches invoice prefixes to clients. This is super useful for my own workflow but might be a bit niche if your invoice naming is less consistent. You can still use manual entry if patterns don't work for you. The stats dashboard is Euro-centric right now and aggregates all currencies into a total shown in Euros (€). I plan to fix this later. It's a simple personal project that solves a personal problem. The code is on GitHub, and I'd love to hear any feedback or suggestions you might have. https://ift.tt/dUJMQ1Z September 30, 2025 at 08:33PM

Monday, September 29, 2025

Show HN: Resrap – A Parser but in Reverse https://ift.tt/WA72UMI

Show HN: Resrap – A Parser but in Reverse I built Resrap, a Go package that takes a grammar in ABNF format and generates infinitely long sequences of syntactically correct code...either completely randomly or with seeds for a deterministic generation. ABNF is a modified version of EBNF( https://ift.tt/BQdfirZ... ) I made for this project, basically means you can specify when generating code 20% of lines will be if{} blocks and 50% will be while{} blocks which allows for more natural code generation, plus support for infinite generation of code. It’s very fast...it generated ~40 million tokens of C syntax in about 26 seconds on my laptop and supports multithreading which actually saw boosts in performance since its very easy to parallelize. I originally made this for a typing-test project (I didn’t want to store code snippets manually), but it turned out to be useful in other contexts too, like: - Stress-testing parsers and linters - Creating non-copyrighted “lorem ipsum” code for tech demos - Generating those endless “hacker” code scenes you see in movies Curious what other cool things people might do with it! Github: https://ift.tt/F9uz1Gg Website: https://ift.tt/lbgzrK7 https://ift.tt/lbgzrK7 September 30, 2025 at 12:20AM

Show HN: Promptlet – Mac app to help you stop typing "ultrathink" over and over https://ift.tt/KSDsqyv

Show HN: Promptlet – Mac app to help you stop typing "ultrathink" over and over https://ift.tt/iRynIck October 13, 2025 at 02:29AM...