Friday, April 14, 2023

Show HN: Example POC to run the Dolly V2 12B model in 8bit mode https://ift.tt/xJ28lHU

Show HN: Example POC to run the Dolly V2 12B model in 8bit mode I banged this out today because nobody seems to have figured this out yet (and I wanted to play with this model). I was inspired by the fact that Simon Willison was struggling to get this running[0], so I wanted to take a crack at it! This is by no means "clean", but it is a baseline for others to hack from. Feel free to send over PRs and I'll land them! (I'd love it somebody used this to glue this into this webui[1].) 0: https://ift.tt/h0p53gH 1: https://ift.tt/sPeqfQu https://ift.tt/PCLIRlQ April 15, 2023 at 04:16AM

No comments:

Show HN: WebGPU enables local LLM in the browser – demo site with AI chat https://ift.tt/3P1DTwH

Show HN: WebGPU enables local LLM in the browser – demo site with AI chat Browser LLM demo working on JavaScript and WebGPU. WebGPU is alrea...