Leo Reads The Internet

Implementing HyperLogLog in Redshift and Tableau

Implementing HyperLogLog in Redshift and Tableau

My Take In an ideal world, data is stored and queried against in as raw a form as possible. Materialization...

A Brief Overview of the Regime Shift Detection Methods

A Brief Overview of the Regime Shift Detection Methods

My Take The most persistent challenge in time series analysis is correcting adequately for changes in trend. Mean stationarity is...

Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department

Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department

My Take StitchFix is one of those companies where the algorithm is the product. Anyone can sell you clothes, but...

Visualizing Matrix Factorization Using Self-Organizing Maps

Visualizing Matrix Factorization Using Self-Organizing Maps

My Take This is a more detailed discussion of the matrix factorization approach that is usually used in recommender systems....

Introduction to Recommender Systems in 2018

Introduction to Recommender Systems in 2018

My Take Recommender systems are a great example of how machine learning can captures incremental benefits in the eCommerce industry....

Learning Market Dynamics for Optimal Pricing

Learning Market Dynamics for Optimal Pricing

My Take When it comes to building multi-level hierarchical models in a business context, there is a persistent tension between...

Linear regression implemented four different ways

Linear regression implemented four different ways

My Take Linear regression is one of the most versatile and fundamental tools for statistical modeling. It forms the basis...

An overview of proxy-label approaches for semi-supervised learning

My Take The relative expense and unavailability of labelled datasets is a major detractor from the utility of supervised learning...

Uber’s Big Data Platform: 100+ Petabytes with Minute Latency

My Take When it comes to doing analytics and data science at scale, it seems like you can either have...