Towards Interpretable Reliable Models

Posted on So 29 Oktober 2017 in conferences

I presented a keynote at PyData Warsaw on moving toward interpretable reliable models. The talk was inspired by some of the work I admire in the field as well as a fear that if we do not address interpretable models as a community, we will be factors in our own …


Continue reading

GDPR & You: My Talk at Cloudera Sessions München

Posted on Mi 11 Oktober 2017 in conferences

Unless you have been avoiding all news, you have likely heard of the coming changes in European privacy regulations which go into effect in May 2018. The changes are covered under the General Data Privacy Regulation Directive, whose final text was made available in May 2016.

I presented a talk …


Continue reading

Algorithmic Art and "Künstliche Kunst"

Posted on Sa 07 Oktober 2017 in conferences

I was invited to give a talk at 404 Dublin, a really cool conference joining community groups w/ tech folks and art installations. When thinking of what topics might be of interest to the audience, I selfishly went to one of my (side) passions.. following artists who are doing amazing …


Continue reading

Comparing scikit-learn Text Classifiers on a Fake News Dataset

Posted on Mo 28 August 2017 in research

Finding ways to determine fake news from real news is a challenge most Natural Language Processing folks I meet and chat with want to solve. There is significant difficulty in doing this properly and without penalizing real news sources.

I was discussing this problem with Miguel Martinez-Alvarez on my last …


Continue reading

Data Unit Testing: EuroPython Tutorial

Posted on Fr 14 Juli 2017 in trainings

I gave a long and opinionated tutorial at EuroPython 2017 about how we should do unit testing and validation within a data science scope. The GitHub repository for the course (which is part of my O'Reilly Live Online training) is https://github.com/kjam/data-cleaning-101. I will continue editing and …


Continue reading

if Ethics is not None

Posted on Fr 14 Juli 2017 in conferences

This past Wednesday, I had the pleasure of giving a keynote at EuroPython 2017. I covered a historical view of ethics in computing. The slides are shared here, but it was also recorded so I will post a video when it is available. (Updated: video added!)

In addition, a series …


Continue reading

Practical Data Cleaning with Python Resources

Posted on Mi 03 Mai 2017 in trainings

Practical Data Cleaning Resources

(O'Reilly Live Online Training)

This week I will be giving my first O'Reilly Live Online Training via the Safari platform. I'm pretty excited to share some of my favorite data cleaning libraries and tips for validating and testing your data workflows.

This post hopes to be …


Continue reading

PyData Amsterdam Keynote on Ethical Machine Learning

Posted on Fr 07 April 2017 in conferences

I was kindly asked by the PyData Amsterdam organizers to keynote the conference. As a passionate fan of ethical machine learning and the great research being done by data scientists and academics around the world -- I am very enthused to present the topic to the conference.

My slides are currently …


Continue reading

Ten Tips for First-Time Conference Speakers

Posted on Sa 11 Februar 2017 in conferences

The saddest moment for me at conferences is when I'm in the middle of an interesting conversation with a bright person and I ask her when her talk is and she says, "Who me?"

The number of folks I speak with every year at conferences who have amazing stories to …


Continue reading

The Practice of Programming: 18 Years Later

Posted on Fr 20 Januar 2017 in programming

Over the new year holiday time I had a chance to get away from it all, and snuck up to Finland to sit in a lodge on the Gulf of Finland, sip coffee, take saunas and read. I brought along a few books, the only programming one being Brian W …


Continue reading