Next Steps for Machine Learning Mastery

Next Steps

Chapter 7 of 7 in the Data Science Primer

Welcome to the last chapter of our 7-part mini-course on data science and machine learning.

We've really cover a lot of ground.

  1. In Chapter 1, you saw a bird's-eye view of the entire machine learning workflow.
  2. Then, in Chapter 2, you learned our framework for fast, efficient, and decisive exploratory analysis.
  3. Chapter 3 was all about data cleaning, which is perhaps the most important step of all.
  4. Next, in Chapter 4, we shared our favorite heuristics for feature engineering.
  5. In Chapter 5, we discussed regularization and ensembles, and you learned about 5 algorithms that leverage those mechanisms.
  6. And in Chapter 6, we walked through a proven formula for training excellent models after the other steps have been completed correctly.

In this guide, we'll give you our best recommendations for where to go from here, including how to transform these concepts into invaluable, practical skills that can significantly impact your career.

You have two options...

Option #1

The first is to go at it alone, and begin applying these skills to projects that interest you.

The step-by-step blueprint you've learned will give you a huge head-start. But strike while the iron is hot! Pick a topic, find a dataset, and start practicing.

For tools, we strongly recommend the Python stack, including the following libraries:

  • NumPy for efficient numerical computations.
  • Pandas for data management.
  • Scikit-Learn for algorithms and model training.
  • Seaborn for easy/common visualizations.
  • Matplotlib to customize visualizations.

After you've mastered the core workflow, you can use the rest of this lesson as guideposts for continued study.

Our #1 tip for self-study is to skip the textbooks and jump into projects ASAP because it's much faster to learn in context, i.e. "learning by doing."

Plus, it will be easier to stay motivated and continue progressing.

Option #2

The second option is to allow us to provide over-the-shoulder mentorship.

We've lovingly crafted a Machine Learning Masterclass just for you. Hundreds of other happy students have been through the course, and the satisfaction rate is superb.

Let Us Help

Option #2: Let us help!

Of course, you may be wondering, if we've already given you the complete blueprint, what's there left to cover in the masterclass?

Well, quite simply, there's a lot that we couldn't squeeze into these 7 days... Here are your next steps for mastering machine learning:

Step 1: Learn Python

The first one is obvious. We couldn't include source code in this crash course because everyone comes from different experience levels.

However, thanks to our comprehensive Python Crash Course, you'll learn all of the programming skills needed for machine learning, even if you've never programmed before.

MLMC Contents

Masterclass Contents

In fact, we offer two versions of Python crash course so you can pick the one most suitable for your situation.

  • One for students experienced in other languages - you'll get a concise overview of Python syntax and best practices.
  • One for beginners with little to no coding experience - you'll get a detailed intro to Python and general programming concepts.

Both versions cover NumPy and Pandas, two essential libraries for data science:

  • 1

    Module 1: Python Basics

    First, we'll introduce the basics of Python, including data types, operations, and variables.

  • 2

    Module 2: Data Structures

    Then, we'll dive into data structures, Python objects that can store and organize other objects.

  • 3

    Module 3: Flow and Functions

    Next, we'll cover flow control and functions. Flow control allows conditional logic and functions keep your code clean and modular.

  • 4

    Module 4: NumPy

    NumPy is the premier Python library for numerical computing and many other data science libraries are built on top of NumPy.

  • 5

    Module 5: Pandas

    Finally, we'll learn all about Pandas, which will probably be your most used package because it helps you store, manipulate, and explore data.

Step 2: Clarify Essential Theory

Right from Project 1, you'll learn about Model Complexity, which is what we consider to be the "heart" of machine learning.

MLMC Contents

Masterclass Contents

And that's no exaggeration... This is perhaps the single most important concept that couldn't fit into this crash course.

Why? Well, for the simple reason that it's best taught alongside illustrative code examples.

We promise that this one vital concept will provide you so much clarity about the big picture of machine learning.

Overfit Polynomial

In addition, you'll find gentle, practical explanations of other essential concepts, including:

  • Mapping functions as a surprisingly effective learning tool.
  • Causes of overfitting and how to deal with it in diverse situations.
  • Cost functions and how they tie into important mechanisms such as regularization.
  • Classification algorithms and their key differences from their regression counterparts.
  • Clustering algorithms and the additional step you should take before training them.
  • And much more...

Everything is illustrated with code, examples, and visualizations.

K-Means

Step 3: Master Core Skills

Next, it's time to practice the core machine learning workflow, including:

  1. Exploratory Analysis
  2. Data Cleaning
  3. Feature Engineering
  4. Algorithm Selection
  5. Model Training

These are the fundamental building blocks you'll need for almost any project, and it's important to really master these skills.

In our masterclass, you'll progress on to carefully planned, end-to-end projects that will allow you to hone each of these skills in context.

MLMC Contents

Masterclass Contents

Plus... EACH project includes:

Each Project Includes
Each Project Includes
Each Project Includes

The learning curve is carefully planned so that you'll be challenged, but not frustrated.

In fact, the masterclass is structured after over-the-shoulder, on-the-job mentorship that gently yet quickly brings you up to speed.

Step 4: Build Situational Skills

In this crash course, we've covered the 5 core steps of applied machine learning.

However, there are also several important situational steps. When you master these steps, you'll open many more possibilities.

  • For example, some of the most interesting applications require you to creatively restructure the data first (Data Wrangling).
  • In addition, correctly transforming your features will boost your performance significantly (Preprocessing).
  • How to package your model into a script that can be called from a command line or run on the cloud (Project Delivery).
  • And others...

The 5 core steps glue everything together, and these situational steps will take your projects to the next level.

Just to be clear - Yes, you'll learn all of these skills in the masterclass!

Step 5: Practice Making Decisions

As you may have noticed, applied machine learning requires dozens of decisions for each step (i.e. a skilled chef).

  • When have you done enough exploratory analysis?
  • How can you spot potential fixes you'll need to make?
  • Do your visualizations suggest good features to engineer?
  • What are the most impactful hyperparameters to tune?
  • When should you pre-process your features?
  • Which steps should go inside the cross-validation loop?
  • Which performance metrics should you use?
  • And so on...

In the masterclass, you'll learn simple approaches for tackling these decisions, and you'll get plenty of practice making them.

Scatterplot Example

Step 6: Develop Advanced Skills

To train professional-grade machine learning models in the shortest time possible, you'll need to master a few additional advanced skills.

Our masterclass will teach each of these advanced skills, including:

  • How to fit Multi-Step Preprocessing Pipelines into cross-validation loops to ensure robust results.
  • How to deal with the Curse of Dimensionality.
  • How to implement Principle Component Analysis (PCA) and how to interpret your components.
  • How to deal with Unbalanced Classes.
  • How to use Probability Thresholds and ROC Curves to improve your classification models.
  • How to perform Multi-Layer Groupbys for data wrangling.
  • How to make Advanced Visualizations for presentations and reports.
  • And much more...

Best of all, you'll learn each of these important skills in context, which makes them easier understand, retain, and place into the big picture.

Step 7: Reinforce Key Concepts

By now, you'll have learned concepts for data science and machine learning.

You'll have a great understanding of where each piece fits into the big picture, and you'll have developed very valuable, practical skills.

This is a great time to circle back and review all that you've learned, truly solidifying all these skills and making them truly your own.

With the masterclass, review will be a breeze thanks to the free Companion E-Book.

Companion E-Book Screenshots

Conclusion

And that wraps up the 7-Part Data Science Primer mini-course!

By the way, even if you decide not to join the masterclass, you can use the list above as guideposts for further studying. You've learned the core blueprint, so you now have a very strong foundation to go forward.

The masterclass is meant for those who want over-the-shoulder mentorship and accelerated learning. If that sounds like something you'd be interested in, you can learn more about it here:

Learn more about the masterclass!

I was concerned whether the course would be accessible. But I found that it is very well thought out and the teaching strategy fit well with my learning style.

I've gotten a lot better at Python and I now know how to apply Machine Learning to business problems. I believe anyone interested in learning Machine Learning could definitely use this course as a door to accomplishing that.

Will M.
Will M. Analyst, USA

Before joining, I didn't know if this would really advance my knowledge of data science and machine learning to help get or start a job in the field.

So far, it has been really useful in learning applied machine learning. I recommend this course because the real world application of machine learning is hard to see, and this course cuts through the trees to see the forest.

Ryan T.
Ryan T. Entrepreneur and Data Analyst, USA
Learn more about the masterclass!
Share Data Science Primer