2 weeks back Microsoft released the implementation of their 1 bit LLM transformer.
This can potentially change the world, by enabling LLM inference on CPU and mobile, without any internet connection.
In this hands-on video, you learn to run 1 bit LLM inference on your own laptop!