
Darryl Fernandes
Senior Machine Learning Engineer
This Nanodegree program equips learners with advanced skills and practical experience in optimizing machine learning models for performance, scalability, and real-world application. Students will explore foundational principles and techniques such as quantization, pruning, and profiling, and apply these to both traditional machine learning models and large language models (LLMs). The program delves into advanced model compression methods, including low-rank compression and knowledge distillation, and covers the design of efficient architectures for hardware acceleration using tools like TensorRT and ONNX. Ultimately, graduates will be able to optimize inference pipelines for LLMs and deploy efficient models to meet specific performance and deployment requirements.

Subscription · Monthly
6 skills
6 prerequisites
Prior to enrolling, you should have the following knowledge:
You will also need to be able to communicate fluently and professionally in written and spoken English.
3 instructors
Unlike typical professors, our instructors come from Fortune 500 and Global 2000 companies and have demonstrated leadership and expertise in their professions:

Darryl Fernandes
Senior Machine Learning Engineer

Samantha Guerriero
AI Consultant

Rishabh Misra
Staff Machine Learning Engineer
Learn model optimization, compression, and hardware acceleration. Build scalable, efficient ML systems with hands-on projects in this Udacity program

Subscription · Monthly