The First Rule of Machine Learning...


Data scientists love to jump straight to machine learning.

New problem? Throw data at a neural network. See what happens.

But there’s a foundational step that, in the right circumstances, can dramatically increase your chances of project success - and most data scientists skip right over it.

Mathematical modelling from first principles.

I’m talking about physics here.

Mass conservation. Energy conservation. Newton’s laws of motion.

The stuff you find in high school physics textbooks.

In this latest Value Boost episode of Value Driven Data Science, Dr Tim Varelmann joins me again to explain how building from first principles creates a low-cost scaffolding that makes your ML work more robust.

In just 11 minutes, you’ll learn:

  1. What mathematical modelling from first principles actually means [01:20]
  2. How to build modular models with different resolution levels [04:39]
  3. When to add machine learning to first principles models [08:18]
  4. The practical first step to incorporate this approach [09:23]

The first rule of machine learning? Don’t start with machine learning.

Listen now on Apple Podcasts or Spotify, or click the link below:

Episode 97: Mathematical Modelling as a Gateway to ML Success

Talk again soon,

Dr Genevieve Hayes

Data Science Impact Algorithm

Twice weekly, I share proven strategies to help data scientists get noticed, promoted, and valued. No theory — just practical steps to transform your technical expertise into business impact and the freedom to call your own shots.

Read more from Data Science Impact Algorithm

Around age 19, having just graduated high school, I got my first job in "data". I could barely use Excel and thought Python was something you'd find at the zoo, but a friend of the family hired me to tutor their teenaged son. My maths was good enough to help him get through high school maths. Around age 29, having just finished my PhD, I landed a role managing an insurance pricing and analytics team. I'd never heard the term "machine learning" back then, but I had spent a good chunk of the...

You’re already using optimisation every day as a data scientist - but you probably don’t think about it. Every time you train a machine learning model, you’re running an optimisation algorithm to find the best parameters. But there’s a second type of optimisation that most data scientists never even touch - and it’s where the real business value often lives. It’s called decision optimisation, and it can transform your ML predictions into actionable decisions. Here’s the difference: ML...

Data science exists to support decision making, but within any organisation, there is a hierarchy of decisions - low stakes, high volume decisions at the bottom; high stakes, low volume decisions at the top. Executives care most about the decisions at the top. There’s valuable work at the bottom of that hierarchy. But automating routine decisions is about clearing the path so executives can focus on the high-stakes decisions. It’s not about walking it with them. For years, data science has...