Quantization Fundamentals with Hugging Face
Generative AI models, like large language models, often exceed the capabilities of consumer-grade hardware and are expensive to run. Compressing models through...
By Younes Belkada on Coursera
About This Course
Generative AI models, like large language models, often exceed the capabilities of consumer-grade hardware and are expensive to run. Compressing models through methods such as quantization makes them more efficient, faster, and accessible. This allows them to run on a wide variety of devices, including smartphones, personal computers, and edge devices, and minimizes performance degradation. Join this course to: 1. Quantize any open source model with linear quantization using the Quanto library. 2. Get an overview of how linear quantization is implemented. This form of quantization can be applied to compress any model, including LLMs, vision models, etc. 3. Apply “downcasting,” another form of quantization, with the Transformers library, which enables you to load models in about half their normal size in the BFloat16 data type. By the end of this course, you will have a foundation in quantization techniques and be able to apply them to compress and optimize your own generative AI models, making them more accessible and efficient.
Topics Covered
Frequently Asked Questions
How much does Quantization Fundamentals with Hugging Face cost?
Visit the Quantization Fundamentals with Hugging Face course page for current pricing and available discounts.
Who teaches Quantization Fundamentals with Hugging Face?
Quantization Fundamentals with Hugging Face is taught by Younes Belkada, DeepLearning.AI.
What skill level is Quantization Fundamentals with Hugging Face for?
This course is designed for beginner learners.
Similar Courses
HTML & CSS Coding for Beginners: Build your own portfolio!
Chris Dixon
Maya for Beginners: Animation
Lucas Ridley
JavaScript for Beginners (includes 6+ real life projects)
Kalob Taulien
Beginner Bootstrap 4: Hand code beautiful responsive websites fast
Chris Dixon