This is a autopost bolg frinds we are trying to all latest sports,news,all new update provide for you
Tuesday, December 26, 2023
Show HN: Made a batching LLM API for a project. Mistral 200 tk/s on RTX 3090 https://ift.tt/pDsLI9o
Show HN: Made a batching LLM API for a project. Mistral 200 tk/s on RTX 3090 I was running into an issue with a vLLM bug that affected multiple GPUs and I needed a stand-in while that bug was getting fixed that used the same API format but had better performance than the API on text-generation-webui. It's very rough. I'm not a coder by trade. But it's very fast once you have many simultaneous connections. https://ift.tt/OGxLYCk December 27, 2023 at 01:22AM
Subscribe to:
Post Comments (Atom)
Show HN: Fast Elevation API with memory mapped tiles https://ift.tt/suKxOBC
Show HN: Fast Elevation API with memory mapped tiles I recently wrote and launched a high-performance Elevation API, built from the ground u...
-
Show HN: Locksmith – detect locks taken by Postgres migrations https://ift.tt/0cBueJt February 10, 2025 at 02:26AM
-
Show HN: I built a FOSS tool to run your Steam games in the Cloud I wanted to play my Steam games but my aging PC couldn’t keep up, so I bui...
-
Show HN: A directory of 800 free APIs, no auth required Explore reliable free APIs for developers — ideal for web and software development, ...
No comments:
Post a Comment