Thursday, February 8, 2024

Show HN: Open-source template for end-to-end streaming analytics https://ift.tt/H4wUZKe

Show HN: Open-source template for end-to-end streaming analytics To help my future self, I decided to build a repository in which I can quickly deploy an end-to-end modern analytics pipeline, from ingestion to fast analytics and business dashboards, including data exploration, time-series forecasting, and monitoring of the stack. Of course, all the components are open source, and you can use this template as a stepping stone for your near-realtime streaming analytics. What's the inspiration? I’ve been working with streaming analytics for a long time. I’ve done not-too-stale analytics with a RDBMs incremental query and a spreadsheet, gone over the micro-batch-looks-almost-like-real-time lambda analytics, and the near-real-time analytics since kappa and afterwards. The range and features of tools today is way better than what we had 15 years ago. What remains constant is the requirement for freshness of data, and for more advanced analytics. This means that you cannot really build a reliable data pipeline for near-realtime analytics at scale using a single component, and every time you start a new project you waste a lot of time just integrating the different moving parts. When the repository starts, the pipeline will collect public events from the GitHub API, send them to a message broker (Apache Kafka), persist them into a fast time-series database (QuestDB), and visualize them on a dashboard (Grafana). It will also provide a web-based development environment (Jupyter Notebook) for data science and machine learning. Monitoring metrics are captured by a server agent (Telegraf) and stored back into the time-series database (QuestDB). Hopefully others in the community find this useful! https://ift.tt/ebUNzvc February 9, 2024 at 01:22AM

Show HN: AI and AR 20 yrs in the future: A utopian/distopian RPG https://ift.tt/kQzGXj9

Show HN: AI and AR 20 yrs in the future: A utopian/distopian RPG https://ift.tt/FWBbetp February 9, 2024 at 12:11AM

Daily Price Tracking for Trader Joes https://ift.tt/jfy2Wr0

Daily Price Tracking for Trader Joes https://ift.tt/HRAy5gl February 8, 2024 at 10:07PM

Show HN: Audiocate – a Haskell library for combating audio deepfake misuse https://ift.tt/kpcSIUs

Show HN: Audiocate – a Haskell library for combating audio deepfake misuse Audiocate is a Haskell library for audio verification and source validation to attempt to combat AI generated audio deepfake misuse. It's currently just a MSc dissertation project but hoping to make it actually usable in the near future https://ift.tt/fLjyuh9 February 8, 2024 at 04:19PM

Show HN: Open-source code editor with autocomplete built-in https://ift.tt/wkflXJm

Show HN: Open-source code editor with autocomplete built-in https://ift.tt/fNMq2gy February 8, 2024 at 12:27AM

Wednesday, February 7, 2024

Show HN: Directory of All LLM Models(Closed and Open Source) https://ift.tt/aSJVOsp

Show HN: Directory of All LLM Models(Closed and Open Source) https://llmmodels.org/ February 8, 2024 at 05:10AM

Show HN: kbackup – No-config push/pull incremental backups built on top of rsync https://ift.tt/1UMPVHB

Show HN: kbackup – No-config push/pull incremental backups built on top of rsync https://ift.tt/ZiGYhyW February 7, 2024 at 11:32PM

Tuesday, February 6, 2024

Show HN: Client/server 2d "defender" game in TypeScript https://ift.tt/3M29Xko

Show HN: Client/server 2d "defender" game in TypeScript https://ift.tt/cOHZrzd February 7, 2024 at 02:11AM

Show HN: Geppetto, an open source AI companion for your Slack teams https://ift.tt/HBeU8wC

Show HN: Geppetto, an open source AI companion for your Slack teams Our team just published Geppetto. Geppetto is an open source Slack App to use ChatGPT inside your workspace. It is written in Python and super easy to tinker and fork: - GitHub: https://ift.tt/T73OyjZ - Our first public release: https://ift.tt/oFQTewN https://ift.tt/T73OyjZ February 7, 2024 at 02:18AM

Show HN: Logdy.dev – web based logs viewer UI for local development environment https://ift.tt/oaH2jnD

Show HN: Logdy.dev – web based logs viewer UI for local development environment https://ift.tt/eMup3YL February 6, 2024 at 10:41PM

Monday, February 5, 2024

Show HN: Name That Nation https://ift.tt/5npxgUt

Show HN: Name That Nation I made this map game. react, material ui, hosted on vercel cdn, no back end. https://ift.tt/1zFs5Wh February 6, 2024 at 04:34AM

Show HN: CLI for generating beautiful PDF for offline reading https://ift.tt/Ldoqhri

Show HN: CLI for generating beautiful PDF for offline reading I've always thought that extensive reading was best suited for the realm of paper. As a result, I've created a command-line interface (CLI) tailored for my own use and decided to make it open source. I welcome any feedback you may have. https://ift.tt/ImcBwys February 6, 2024 at 12:54AM

Show HN: Molerat – Small Web Protocol https://ift.tt/QBgJyG1

Show HN: Molerat – Small Web Protocol https://ift.tt/PMhtaUp February 5, 2024 at 11:46PM

Sunday, February 4, 2024

Show HN: Hookdeck Event Gateway https://ift.tt/Jhy0GrS

Show HN: Hookdeck Event Gateway Hey HN, I’m Maurice, co-founder and CTO at Hookdeck. You might remember Alex and me from August 4th, 2021, when we announced Hookdeck as a “webhook infrastructure”. https://ift.tt/FAvG4kU Since then, a lot has happened. We’ve onboarded thousands of developers, and through their cleverness, we eventually realized that we could serve a much bigger purpose in your event-driven stack. Events are now essential to system interoperability since we spend just as much time building as integrating software nowadays, so we’ve dedicated the last six months to building an Event Gateway. Yeah, I know, that’s a new concept. So, let’s define it - The Event Gateway is a superset of our original work on webhooks management. It builds on that foundation to introduce support for more use cases, such as outbounds webhooks, integrating 3rd party providers, asynchronous APIs (IoT, SDKs, customer-facing endpoints), and more. It’s an API gateway for asynchronous, stateful, and interoperable requests. Hookdeck handles security (handshake and signature verification), spike protection, queuing, observability, alerting, logs, transformations, filtering, and much more. I wrote a blog post introducing the event gateway: https://ift.tt/Am7LFoE I am eager to read your thoughts and feedback. Let’s find out what we can do better. https://hookdeck.com February 4, 2024 at 11:57PM

Show HN: Weekend art project Voronoi Virus https://ift.tt/KhRWE0U

Show HN: Weekend art project Voronoi Virus https://ift.tt/1OG2lD3 February 4, 2024 at 06:12PM

Show HN: Aidely is AI powered thread; AI and Humans cooperation https://ift.tt/qYVd1nj

Show HN: Aidely is AI powered thread; AI and Humans cooperation Discover the future of community engagement with our iPhone app. Seamlessly blend human creativity with AI prowess as users collaborate with ChatGPT, Bard, and Llama to spark captivating threads and discussions. Join us in shaping the next frontier of content creation. https://www.aidely.io/ February 4, 2024 at 08:18PM

Show HN: USD 0.99/TB/month cloud storage https://ift.tt/dHecE7J

Show HN: USD 0.99/TB/month cloud storage https://ift.tt/HEr3cwx February 4, 2024 at 07:46PM

Show HN: Unofficial Google Lens OCR API https://ift.tt/wAOETys

Show HN: Unofficial Google Lens OCR API Default OCR in ShareX is pretty bad, so I reverse-engineered Lens API and made a library to call unofficial Lens API and made a script for ShareX to OCR the captured region. URL points to library I've made, there's a tutorial for ShareX in separate file: https://ift.tt/BVhs3pl... https://ift.tt/9rV3pic February 4, 2024 at 05:38PM

Saturday, February 3, 2024

Show HN: A Python PDF Form Library https://ift.tt/H8iVNWf

Show HN: A Python PDF Form Library Hi HN! I have a project that I have been working on for three years that I’d love to show you today called PyPDForm ( https://ift.tt/vYz4puk ). It is a Python library that specializes in processing PDF forms, with the most outstanding feature being programmatically filling a PDF form by simply feeding a Python dictionary. I used to work at a startup company with Python as our backend stack. We were constantly given paper documents by our clients that we needed to generate into PDFs. We were doing it using reportlab scripts and I quickly found the process tedious and time consuming for more complex PDFs. This is where the idea of this project came from. Instead of writing lengthy and unmaintainable reportlab scripts to generate PDFs, you can just turn any paper document into a PDF form template and PyPDFForm can fill it easily. On top of the GitHub repo, here are some additional resources for this project: PyPi: https://ift.tt/zy8goG4 Docs: https://ift.tt/mwBNtkP A public speak I did about this project: https://www.youtube.com/watch?v=8t1RdAKwr9w I hope you guys find the library helpful for your own PDF generation workflow. Feel free to try it, test it, leave comments or suggestions, and open issues. And of course if you are willing, kindly give me a star on GitHub. https://ift.tt/vYz4puk February 4, 2024 at 06:55AM

Friday, February 2, 2024

Show HN: HN stories categorized by topics of interest https://ift.tt/suO98GP

Show HN: HN stories categorized by topics of interest Hi HN, TL;DR: I made a website that takes all the top stories on HN and categorizes them into one of ten topics of interest using LangChain and GPT-4. I've always liked the idea of getting a personal mix of news tailored to my interests. Hacker News is pretty close to that ideal, which is reflected in the time I spend on here every day. It's a great mix, but playing with LangChain, I got the idea for a weekend project. Roughly speaking most posts on HN fall into one of these categories: * Programming, Software & Computer Science * AI, Data Science & Analytics * Business & Entrepreneurship * Science & Research * Cybersecurity & Digital Safety * Design, User Experience & Creativity * Finance & Economics in Tech * Work Culture & Career Development * Media, Content & Communication * General & Diverse Interests Using langchain with GPT-4 Turbo (JSON mode), I sort every top submission into one of the above. Costs for using the API are currently at 15$ a month, with few optimizations. For ease of use I also added the possibility to consume the news topics via a responsive webpage, RSS and email digest (daily,weekly,monthly). https://hnto.pics/ February 3, 2024 at 03:30AM

Show HN: Nibble https://ift.tt/fN5T23V

Show HN: Nibble An attempt at a single pass LLVM frontend in ~3000 lines of C without external dependencies, malloc, or an AST. Included are...