Tuesday, July 23, 2024

Show HN: Finetune Llama-3.1 2x faster in a Colab https://ift.tt/ojHv43E

Show HN: Finetune Llama-3.1 2x faster in a Colab Just added Llama-3.1 support! Unsloth https://ift.tt/oCArEX3 makes finetuning Llama, Mistral, Gemma & Phi 2x faster, and use 50 to 70% less VRAM with no accuracy degradation. There's a custom backprop engine which reduces actual FLOPs, and all kernels are written in OpenAI's Triton language to reduce data movement. Also have an 2x faster inference only notebook in a free Colab as well! https://ift.tt/A4QLxbC... https://ift.tt/GhqBKvy July 24, 2024 at 02:00AM

No comments:

Show HN: WebGPU enables local LLM in the browser – demo site with AI chat https://ift.tt/3P1DTwH

Show HN: WebGPU enables local LLM in the browser – demo site with AI chat Browser LLM demo working on JavaScript and WebGPU. WebGPU is alrea...