Version control on notebooks using pre-commit and Jupytext
Notebooks have a place and a time. They are suitable for sharing the insights of an exploratory data analysis, but not so convenient for collaborating with multiple people whilst having the notebook code under version control. Generally speaking notebooks do not promote good coding habits, for example because people tend to duplicate code by …
Initializing nested lists correctly
Regular expressions with optional starting or ending groups
Stemming and lemmatizing with sklearn vectorizers
Applying operations on grouped dataframes in Pandas
Flatten nested lists with a list comprehension
Wrong feature preprocessing is a source of train-test leakage
Masking with Boolean arrays in Numpy
Site update: Breadcrumbs, taxonomies, paginators
Hugo template snippets of new website features
Vacancy Recommender Hackaton with Spark
Getting a grip on programmer jargon (by Joran Welling)
See archives for more ...