Introducing ETL Markup Toolkit (EMT)
TL;DR – I developed an open source toolkit for writing Spark-native ETL using configurations in a highly sub-scriptable and transparent...
TL;DR – I developed an open source toolkit for writing Spark-native ETL using configurations in a highly sub-scriptable and transparent...
What is “real data science” anyway? tl;dr: most data scientists at Facebook are business analysts and that’s perfectly fine One...
“Hey, you got chocolate in my peanut butter!” “You got peanut butter in my chocolate!” “Delicious!” So goes the old...
Using an open-source dataset, I’ve written up a Jupyter notebook below that explores the performance of several commonly used decision...
Visualization (viz) is an incredibly hot topic in the business analytics/data science (DS) world right now. In every job description,...
There’s a term in engineering called a “1% Solution”. 1% solutions solve a problem that only 1% of the population...