Course curriculum

    1. How to use this course

      FREE PREVIEW
    2. Setup

    3. Project examples

    4. Before we begin ...

    1. Agenda

      FREE PREVIEW
    2. Why: Motivation

    3. Pre-serving

    4. Custom web server

    5. Inference server

    6. Practice

    7. Practice submission

    8. Practice implementation example

    9. Takeaways

    10. Feedback

    1. Agenda

      FREE PREVIEW
    2. Why: Motivation

    3. Serving platforms

    4. Serving patterns

    5. Serving LLMs

    6. Practice

    7. Practice submission

    8. Practice implementation example

    9. Takeaways

    10. Feedback

    1. Congrats! Here's what's next...

    2. Before you go...

About this course

  • 26 lessons
  • 1 hour of video content
  • > 5 hours a week
  • 8 weeks

Bring your ML journey to the next level!

FAQ

  • How is this mini-course related to the full "Machine Learning in Production" course?

    This mini-course, "Serving Basics," is the fifth module of the comprehensive "Machine Learning in Production" course. While the full course covers all aspects of deploying and managing ML projects end-to-end, this mini-course focuses specifically on the deployment and serving of ML models or LLMs. Mastering serving is crucial for making your models available and performant in production environments, making it an essential component for anyone aiming to operationalize ML models effectively.

  • Who is this course for?

    This mini-course is ideal for software engineers, ML professionals, tech switchers, and tech entrepreneurs interested in mastering the deployment and serving of ML models or LLMs using custom web servers and inference servers.

  • What will I achieve by the end of this mini-course?

    By the end of this course, you'll have a solid foundation in deploying and serving ML models using techniques such as pre-serving, custom web servers, and inference servers like TensorFlow Serving and TorchServe. You will also gain knowledge in serving platforms, patterns, and specific considerations for serving LLMs.

  • What prerequisites do I need?

    A basic understanding of programming concepts and familiarity with machine learning principles are recommended to get the most out of this course.

  • How long is the course?

    The course is designed to be completed in approximately 1.5 weeks. However, you can pace yourself according to your schedule and learning preferences.

  • Will I lose access to the course materials after finishing it?

    No, you have access to the course materials indefinitely.

  • How much time will I need to dedicate to the course each week?

    The course can be completed in about 8 hours total. You can choose to spread this out over a week or complete it in a single session, depending on your availability and learning style.

  • What if I get stuck on my homework?

    Don't worry! Our course comes with a supportive Discord community channel dedicated to homework help. Here, you can ask questions, seek guidance, and collaborate with fellow learners and the instructor. Additionally, for each practice task, we provide an example implementation to guide you through the process.

  • What if I want to enroll in the full course after this one? Will I get any discount?

    Yes, upon completing this course, you will receive a 70$ discount on the full "Machine Learning in Production" course. Use your email address as the promo code at checkout.