Tuesday, February 18, 2025

Show HN: A GPU-accelerated binary vector index https://ift.tt/1eKhHJM

Show HN: A GPU-accelerated binary vector index This is a vector index I built that supports insertion and k-nearest neighbors (k-NN) querying, optimized for GPUs. It operates entirely in CUDA and can process queries on half a billion vectors in under 200 milliseconds. The codebase is structured as a standalone library with an HTTP API for remote access. It’s intended for high-performance search tasks—think similarity search, AI model retrieval, or reinforcement learning replay buffers. The codebase is located at https://ift.tt/OrD5IbR . https://ift.tt/s5bpAZV February 17, 2025 at 06:15AM

No comments:

Show HN: Open-sourced my prompt management tool for LLM-powered apps https://ift.tt/4vClhqD

Show HN: Open-sourced my prompt management tool for LLM-powered apps https://ift.tt/9wY0mk7 August 3, 2025 at 01:42AM