I'm super impressed that LLMs are scaling down and WebGPU is improving to the point that you can run a reasonable LLM in the browser! https://simonwillison.net/2023/Apr/16/web-llm/