33: Katharine Jarmul - Testing in Data Science
0:00 -:--
Speed
++++++
PermalinkShare linkShare link with timestamp
November 30, 2017
A discussion with Katharine Jarmul, aka kjam, about some of the challenges of data science with respect to testing. Some of the topics we discuss: experimentation vs testing testing pipelines and pipeline changes automating data validation property based testing schema validation and detecting schema changes using unit test techniques to test data pipeline stages testing nodes and transitions in DAGs testing expected and unexpected data missing data and non-signals corrupting a dataset with noise fuzz testing for both data pipelines and web APIs datafuzz hypothesis testing internal interfaces documenting and sharing domain expertise to build good reasonableness intermediary data and stages neural networks speaking at conferences Special Guest: Katharine Jarmul.Sponsored By:Python Testing with pytest: 40% off eBook version now through Friday, Dec 1. Promo Code: turkeysale2017Test & Code Patreon Supporters: Thank you to Patreon SupportersLinks:@kjam on Twitter — Data Magic and Computer SorceryKjamistan: Data…