written by Eric J. Ma on 2018-11-13 | tags: data science insight data science
There's a quote by John Tukey that has been a recurrent theme at work.
It's better to solve the right problem approximately than to solve the wrong problem exactly.
Continuing on the theme of quoting two Georges:
All models are wrong, but some are more useful than others.
H/T Allen Downey for pointing out that our minds think alike.
I have been working on a modelling effort for colleagues at work. There were two curves involved, and the second depended on the first one.
In both cases, I started with a simple model, and made judgment calls along the way as to whether to continue improving the model, or to stop there because the current iteration of the model was sufficient enough to act on. With first curve, the first model was actionable for me. With the second curve, the first model I wrote clearly wasn't good enough to be actionable, so I spent lots more rounds of iteration on it.
But wait, how does one determine "actionability"?
For myself, it has generally meant that I'm confident enough in the results to take the next modelling step. My second curves depended on the first curves, and after double-checking multiple ways, I thought the first curve fits, though not perfect, were good enough when applied across a large number of samples that I could instead move on to the second curves.
For others, particularly at my workplace, it generally means a scientist can make a decision about what next experiment to run.
Going through Insight Data Science drilled into us an instinct for developing an MVP for our problem before going on to perfect it. I think that general model works well. My project's final modelling results will be the result of chains of modelling assumptions at every step. Documenting those steps clearly, and then being willing to revisit those assumptions, is going always a good thing.
@article{
ericmjl-2018-solving-actionably,
author = {Eric J. Ma},
title = {Solving Problems Actionably},
year = {2018},
month = {11},
day = {13},
howpublished = {\url{https://ericmjl.github.io}},
journal = {Eric J. Ma's Blog},
url = {https://ericmjl.github.io/blog/2018/11/13/solving-problems-actionably},
}
I send out a newsletter with tips and tools for data scientists. Come check it out at Substack.
If you would like to sponsor the coffee that goes into making my posts, please consider GitHub Sponsors!
Finally, I do free 30-minute GenAI strategy calls for teams that are looking to leverage GenAI for maximum impact. Consider booking a call on Calendly if you're interested!