Monday, December 16, 2024

Show HN: NCompass Technologies – yet another AI Inference API, but hear us out https://ift.tt/6pvm7HW

Show HN: NCompass Technologies – yet another AI Inference API, but hear us out Hello HackerNews! I’m excited to share what we’ve been working on at nCompass Technologies: an AI inference platform that gives you a scalable and reliable API to access any open-source AI model — with no rate limits. We don't have rate limits as optimizations we made to our AI model serving software enable us to support a high number of concurrent requests without degrading quality of service for you as a user. If you’re thinking, well aren’t there a bunch of these already? So were we when we started nCompass. When using other APIs, we found that they weren’t reliable enough to be able to use open source models in production environments. To resolve this, we're building an AI inference engine that enable you, as an end user, to reliably use open source models in production. Underlying this API, we’re building optimizations at the hosting, scheduling and kernel levels with the single goal of minimizing the number of GPUs required to maximize the number of concurrent requests you can serve, without degrading quality of service. We’re still building a lot of our optimizations, but we’ve released what we have so far via our API. Compared to vLLM, we currently keep time-to-first-token (TTFT) 2-4x lower than vLLM at the equivalent concurrent request rate. You can check out a demo of our API here: https://ift.tt/VrfEi8y As a result of the optimizations we’ve rolled out so far, we’re releasing a few unique features on our API: 1. Rate-Limits: we don’t have any Most other API’s out there have strict rate limits and can be rather unreliable. We don’t want API’s for open source models to remain as a solution for prototypes only. We want people to use these APIs like they do OpenAI’s or Anthropic’s and actually make production grade products on top of open source models. 2. Underserved models: we have them There are a ton of models out there, but not all of them are readily available for people to use if they don’t have access to GPUs. We envision our API becoming a system where anyone can launch any custom model of their choice with minimal cold starts and run the model as a simple API call. Our cold starts for any 8B or 70B model are only 40s and we’ll keep improving this. Towards this goal, we already have models like `ai4bharat/hercule-hi` hosted on our API to support non-english language use cases and models like `Qwen/QwQ-32B-Preview` to support reasoning based use cases. You can find the other models that we host here: https://ift.tt/es9IrGi. We’d love for you to try out our API by following the steps here: https://ift.tt/u8bZMPc . We provide $100 of free credit on sign up to run models, and like we said, go crazy with your requests, we’d love to see if you can break our system :) We’re still actively building out features and optimizations and your input can help shape the future of nCompass. If you have thoughts on our platform or want us to host a specific model, let us know at hello@ncompass.tech. Happy Hacking! https://ift.tt/HlkJQLT December 16, 2024 at 05:37PM

Show HN: GitHub Stars Semantic Search - Find Your Starred Projects https://ift.tt/S5cxMFW

Show HN: GitHub Stars Semantic Search - Find Your Starred Projects https://ift.tt/0C8lTgj December 16, 2024 at 09:06AM

Sunday, December 15, 2024

Show HN: Dbine – Auxiliary tools related to databases https://ift.tt/ZrtRNiF

Show HN: Dbine – Auxiliary tools related to databases https://ift.tt/rnMTKio December 15, 2024 at 11:02PM

Show HN: SmartHome – An Adventure Game https://ift.tt/Qg6dIHw

Show HN: SmartHome – An Adventure Game SmartHome is a free, browser-based game written in vanilla JavaScript and no libraries. I don't want to spoil anything about the gameplay, but if you like text adventures, point-and-click adventure games, puzzle games, escape room games, art games, incremental games, cozy games, and/or RPGs, then this might be your speed. If you find it too hard and don't mind some mild spoilers, then check out the hints page: https://smarthome.steviep.xyz/help Enjoy! https://smarthome.steviep.xyz December 15, 2024 at 10:35PM

Saturday, December 14, 2024

Show HN: 31Memorize–Free vocab builder with FSRS-5 spaced repetition https://ift.tt/5FSNHip

Show HN: 31Memorize–Free vocab builder with FSRS-5 spaced repetition Mangoosh alternative, but cheaper and designed to maximize GRE prep efficiency through targeted learning. Free during beta. Your feedback is much appreciated to help polish the product. https://ift.tt/vyV4nMg December 15, 2024 at 07:47AM

Show HN: AI Powered Daily Budgeting https://ift.tt/KfkqjvL

Show HN: AI Powered Daily Budgeting https://ift.tt/smD4nTt December 15, 2024 at 06:34AM

Show HN: Library to replace box shadows on a webpage with ray traced shadows https://ift.tt/Py2evft

Show HN: Library to replace box shadows on a webpage with ray traced shadows https://ift.tt/fJUoLd3 December 15, 2024 at 02:54AM

Show HN: A simple web game to help learn chords and basic progressions https://ift.tt/otDGIXT

Show HN: A simple web game to help learn chords and basic progressions Hi Hacker News, I've created Chord Nebula, a simple web-based game designed to help users learn and practice piano chords, basic progressions, and harmony fundamentals. The game integrates with MIDI keyboards, allowing you to play chords in real-time and receive immediate feedback based on the key you choose. GitHub Repository: https://ift.tt/lJN7Y6U Live Demo: https://ift.tt/DI8zCV7 Requirements: To use Chord Nebula, you'll need a MIDI keyboard connected to your computer. Current Status: Chord Nebula is still a simple project. I'm committed to improving it based on user feedback and would greatly appreciate any support or contributions from the community. Looking for Feedback and Collaborators: I'm eager to hear your thoughts on Chord Nebula! Whether it's suggestions for new features, improvements, or bug reports, your feedback is invaluable. Additionally, if you're interested in collaborating to enhance the game, feel free to reach out or contribute directly via GitHub. Thanks for taking the time to check out Chord Nebula! https://ift.tt/tD4gpLJ December 14, 2024 at 04:35PM

Friday, December 13, 2024

Show HN: @smoores/epub, a JavaScript library for working with EPUB publications https://ift.tt/aeKqmjx

Show HN: @smoores/epub, a JavaScript library for working with EPUB publications Howdy! I've just written a blog post about this, and I figured I would share it here: https://ift.tt/UcabYhZ . As I've been working on Storyteller[1], I've been developing a library for working with EPUB files, since that's a large amount of the work that Storyteller does. After a friend asked for advice on creating EPUB books in Node.js, I decided to publish Storyteller's EPUB library as a standalone NPM package. I really love the EPUB spec, and I think the Node.js developer community deserves an actively maintained library for working with it! [1]: https://ift.tt/6iX7ou9 https://ift.tt/fv1x6o9 December 14, 2024 at 01:22AM

Show HN: Novel Minds, an AI Book Producer https://ift.tt/8mis7DQ

Show HN: Novel Minds, an AI Book Producer Read AI-illustrated versions of the classics for free at http://novelminds.ai . I hope this inspires people to read the classics and ultimately enriches all books so more people will read them. https://ift.tt/AoEp6ue December 13, 2024 at 10:25PM

Thursday, December 12, 2024

Show HN: I designed an espresso machine and coffee grinder from scratch https://ift.tt/fZgmUqb

Show HN: I designed an espresso machine and coffee grinder from scratch It was a lot of work as a solo project but I hope you guys think it’s cool. When I say “we” in the website it’s only in the most royal sense possible. I also did all the photo/videography. I started out designing a single machine for personal use, but like many things it sort of spiraled out of control from there. I felt like espresso machines were getting very large, plasticky, and app-integrated without actually improving the underlying technologies that make them work. The noisy vibratory pumps in particular are from 1977 and haven’t really changed since then. So I wanted to focus on making the most advanced internals I could and leaving everything else as minimalist as possible. The pump is, as far as I know, completely unique in terms of power density and price. Without spending several thousand dollars, it was difficult to find a machine with a gear pump, and adjustable pressure was also similarly expensive but this machine has those things and costs a normal amount to buy. You can also turn the pressure way down and make filter coffee. I also saw so many people (including myself) using a scale while making espresso, and even putting a cup below the group head to catch drips, entirely negating the drip tray, so I basically designed for that! The profile of the machine is much lighter on the eyes and doesn’t loom in the corner like my old espresso machine did. And for the grinder, basically everything on the market uses conical and flat burrs that have descended from spice grinders, and the same couple of standard sizes. Sometimes larger companies design their own burrs, but only within those existing shapes. There is sort of a rush to put larger and larger burrs into coffee grinders, which makes sense, but with cylindrical burrs, you can increase the cutting surface way more relative to the size of the grinder. When grinders get too big, maintaining alignment becomes mechanically cumbersome, but the cylindrical burr can be very well supported from the inside, and there is the added benefit of hiding the entire motor within the burr itself. The resulting grounds are just outright better than all the other grinders I have used, but obviously this is a matter of taste and my own personal bias. The biggest downside for the grinder is that it doesn’t work with starbucks style oily roasts, because the coffee expands so much while traveling down through the burrs and can sometimes clog up the teeth. It doesn’t hurt the grinder but it does require cleaning (which is tool-free!). Another downside for both machines is the fact that they run on DC power so it’s best if you have a spot in your kitchen to tuck away the power brick. I also made a kit that makes the gear pump a drop-in upgrade for other espresso machines, to reduce noise and add adjustable pressure. https://ift.tt/cPDHAKl The roughest part of this process were the moments midway through development where they weren’t working at all. When the grinder is just jamming itself instantly or the fourth factory in a row tells you the part you’re making is impossible or the pump is alternating between spraying water out the side and into your face and not pumping at all. And the default thought is “Of course it’s not working, if this was going to work someone else would have already made it like this”. The route you’ve taken is fundamentally different enough that there are no existing solutions to draw on. You’re basically feeling around in the dark for months on end, burning money, and then one day, every little cumulative change suddenly adds up to a tasty espresso. And it’s not perfect yet, but you at least can see the road ahead. Anyways, this is way more than I expected to write, thank you for reading! Tell me if you have any questions https://velofuso.com December 13, 2024 at 06:33AM

Show HN: AI-powered, open source LeetCode alternative https://ift.tt/3WXavrU

Show HN: AI-powered, open source LeetCode alternative https://ift.tt/IsveLh4 December 13, 2024 at 04:23AM

Show HN: A mobile app that generates mobile apps in 30 secs https://ift.tt/u5dX4qA

Show HN: A mobile app that generates mobile apps in 30 secs Hi HN! Daniel from YC S21 here, launching a project called aSim ( https://asim.sh/ ), which lets people generate/simulate usable native apps (called Sims). Describe an app you want and aSim will generate it. Then, edit and refine it to better suit your needs. Sims are also shareable via links, and basic app functionality is also available through web (though mobile is much more feature complete). A couple of my favorite Sims so far: - Doom demo: https://ift.tt/AXjQpL2 - Star wars idle game: https://ift.tt/tcoUjfS - Hotdog or Not Hotdog: https://asim.sh/s/3748 - Height guesser: https://ift.tt/pFT1iwg - 2048: https://ift.tt/uLVPs1I Would love feedback around the experience and additional functionality you'd like surfaced! December 12, 2024 at 11:06PM

Show HN: Made ready-to-use zip files with TOP animations on NextJS https://ift.tt/PCXQdIy

Show HN: Made ready-to-use zip files with TOP animations on NextJS I'm a solopreneur passionate about web dev and creating tools for developers. Over the years, I've built countless animations for projects and realized many people struggle with implementing complex visuals easily. So, I made BuildFast for two reasons: -To help developers and designers learn and use high-quality animations built with Next.js and Three.js. -To save time with ready-to-use ZIP files or access the entire collection with login-protected resources. Whether you're looking for inspiration, want to learn from reusable code, or need to quickly boost your project’s visuals, BuildFast is for you. Would love your feedback and thoughts! www.buildfast.es Cheers, Arthur(Faxraddin) https://ift.tt/N45FTcv December 12, 2024 at 09:56PM

Wednesday, December 11, 2024

Show HN: Lfi – a lazy functional sync, async, and concurrent iteration library https://ift.tt/ZEJqMuQ

Show HN: Lfi – a lazy functional sync, async, and concurrent iteration library Hey HN! Roughly 4 years ago I started building a lazy functional iteration library for JS/TS. I had a few goals for the library: - Supporting sync, sequential async, and concurrent async iteration - Limiting it to a small number of orthogonal concepts that compose beautifully to solve problems - Making it fully tree-shakeable I built it for myself and have (mostly) been its only user as I refined it. I've used it in lots of personal projects and really enjoyed it. I recently decided it would be nice to spread that enjoyment so I created a documentation website complete with a playground where you can try out the library. I hope you enjoy using it as much as I do! Looking forward to hearing your thoughts :) https://lfi.dev/ December 12, 2024 at 01:37AM

Show HN: Jp3g: a fast private bulk image to JPEG/WebP converter https://ift.tt/Wj7tKMo

Show HN: Jp3g: a fast private bulk image to JPEG/WebP converter https://jp3g.org/ December 11, 2024 at 09:23PM

Tuesday, December 10, 2024

Show HN: Power-assert for hierarchical data structures https://ift.tt/INCa2Si

Show HN: Power-assert for hierarchical data structures https://ift.tt/6fNYcsi December 11, 2024 at 03:01AM

Show HN: Gentrace – connect to your LLM app code and run/eval it from a UI https://ift.tt/LNFlAKU

Show HN: Gentrace – connect to your LLM app code and run/eval it from a UI Hey HN - Doug from Gentrace here. We originally launched Gentrace via Show HN in August of 2023. Since then, a million products have emerged in the LLM ops category. And what we've noticed is that almost none of them solve the core workflow: testing prompts, parameters, and other changes in your actual app, from a frontend where people can collaborate on the dataset, evals, or experiments to be run. So, we built that and are relaunching the company around that idea. Gentrace is the collaborative LLM app testing and experimentation platform that brings together engineers, PMs, subject matter experts, and more to run and test your actual end-to-end app. To do this, use our SDK to: - connect your app to Gentrace as a live runner over websocket (local) / via webhook (staging, prod) - wrap your parameters (eg prompt, model, top-k) so they become tunable knobs in the front end - edit the parameters and then run / evaluate the actual app code with datasets and evals in Gentrace We think it's great for tuning retrieval systems, upgrading models, and iterating on prompts. It's free to trial. Would love to hear your feedback / what you think. https://gentrace.ai/ December 11, 2024 at 02:05AM

Show HN: Chrome extension – See if HN discussed a website while you browse it https://ift.tt/8ZSu4R0

Show HN: Chrome extension – See if HN discussed a website while you browse it I built a Chrome extension that shows existing Hacker News posts for the current website. Background: Often when I read articles or websites, I find the associated HN posts and discussions at least as interesting as the content itself. I have been wishing for a solution to have all HN discussions for a page I am visiting directly to my fingertips, without searching. In fact, there are already extensions for it, but all of them (at the time of my search) were only developed with Manifest v2 and will therefore soon be deactivated. For this reason (and to get to know the development process of an extension better) I decided to develop a solution myself using the HN Algolia Search API and Manifest v3. The result is a simple little tool that solves my problem perfectly. Feel free to try it out - it's open source and available in Chrome Web Store. https://ift.tt/Aupd6ZS December 10, 2024 at 09:43PM

Monday, December 9, 2024

Show HN: FormML – A DSL for building complex web forms https://ift.tt/lBw3jiA

Show HN: FormML – A DSL for building complex web forms Hi everyone! I wrote a DSL (named Form Modeling Language) for modeling & building complex forms and am glad to share it with you now. Over the years, I’ve encountered many challenges while building complex forms from scratch—challenges that I believe are common, difficult, and yet often overlooked. These include managing interdependent fields, handling intricate validation rules, and maintain good collaboration between technical and non-technical people. FormML is my attempt to address these pain points. The project's README goes into more detail, but in short, FormML offers a model-first approach to form development (inspired by Prisma), focusing on ease of use for both developers and non-developers. Lastly, there is a design question I’d love your input: FormML has a primitive type called decimal , used for high-precision decimal numbers. Since one of FormML's design principles is to be as readable as possible to non-programmers, I’m considering renaming it to currency . However, currency might feel too narrow and not cover all applications. What do you think? - Stick with decimal ? - Switch to currency ? - Support both via aliases? Looking forward to your thoughts and feedback! https://ift.tt/QDTPYz9 December 10, 2024 at 12:16AM

Show HN: Kstack – Skill pack for monitoring/troubleshooting K8s in Claude Code https://ift.tt/GQauRgE

Show HN: Kstack – Skill pack for monitoring/troubleshooting K8s in Claude Code Hi All, Recently I've been using Claude Code a lot for de...