Understanding Covid-19 transmission: what the data suggests about how the...
Covid-19 is turning the world upside down right now. One thing that’s extremely important to understand, in order to fight it as effectively as possible, is how the virus spreads and especially how...
View ArticleThe work-from-home episode
Many of us have the privilege of working from home right now, in an effort to keep ourselves and our family safe and slow the transmission of covid-19. But working from home is an adjustment for many...
View ArticlePutting machine learning into a database
Most data scientists bounce back and forth regularly between doing analysis in databases using SQL and building and deploying machine learning pipelines in R or python. But if we think ahead a few...
View ArticleChanging our formulation of AI to avoid runaway risks: Interview with Prof....
AI is evolving incredibly quickly, and thinking now about where it might go next (and how we as a species and a society should be prepared) is critical. Professor Stuart Russell, an AI expert at UC...
View ArticleKeeping ourselves honest when we work with observational health care data
The abundance of data in healthcare, and the value we could capture from structuring and analyzing that data, is a huge opportunity. It also presents huge challenges. One of the biggest challenges is...
View ArticleGaussian Processes
It’s pretty common to fit a function to a dataset when you’re a data scientist. But in many cases, it’s not clear what kind of function might be most appropriate—linear? quadratic? sinusoidal? some...
View ArticleThe Grammar of Graphics
You may not realize it consciously, but beautiful visualizations have rules. The rules are often implict and manifest themselves as expectations about how the data is summarized, presented, and...
View ArticleCausal Trees
What do you get when you combine the causal inference needs of econometrics with the data-driven methodology of machine learning? Usually these two don’t go well together (deriving causal conclusions...
View ArticleProtecting Individual-Level Census Data with Differential Privacy
The power of finely-grained, individual-level data comes with a drawback: it compromises the privacy of potentially anyone and everyone in the dataset. Even for de-identified datasets, there can be...
View ArticleStein's Paradox
This is a re-release of an episode that was originally released on February 26, 2017.When you're estimating something about some object that's a member of a larger group of similar objects (say, the...
View ArticleConvolutional neural networks
This is a re-release of an episode that originally aired on April 1, 2018If you've done image recognition or computer vision tasks with a neural network, you've probably used a convolutional neural...
View ArticleAn interstitial word from Ben
A message from Ben around algorithmic bias, and how our models are sometimes reflections of ourselves.
View ArticleRacism, the criminal justice system, and data science
As protests sweep across the United States in the wake of the killing of George Floyd by a Minneapolis police officer, we take a moment to dig into one of the ways that data science perpetuates and...
View ArticleCriminology and data science
This episode features Zach Drake, a working data scientist and PhD candidate in the Criminology, Law and Society program at George Mason University. Zach specializes in bringing data science methods to...
View ArticleRock the ROC Curve
This is a re-release of an episode that first ran on January 29, 2017.This week: everybody's favorite WWII-era classifier metric! But it's not just for winning wars, it's a fantastic go-to metric for...
View ArticleThe Data Science Open Source Ecosystem
Open source software is ubiquitous throughout data science, and enables the work of nearly every data scientist in some way or another. Open source projects, however, are disproportionately maintained...
View ArticleProcella: YouTube's super-system for analytics data storage
This is a re-release of an episode that originally ran in October 2019.If you’re trying to manage a project that serves up analytics data for a few very distinct uses, you’d be wise to consider having...
View ArticleA Data Science Take on Open Policing Data
A few weeks ago, we put out a call for data scientists interested in issues of race and racism, or people studying how those topics can be studied with data science methods, should get in touch to come...
View ArticleA reality check on AI-driven medical assistants
The data science and artificial intelligence community has made amazing strides in the past few years to algorithmically automate portions of the healthcare process. This episode looks at two computer...
View ArticleSo long, and thanks for all the fish
All good things must come to an end, including this podcast. This is the last episode we plan to release, and it doesn’t cover data science—it’s mostly reminiscing, thanking our wonderful audience...
View Article
More Pages to Explore .....