Together you’ll learn better thanks to my workshop Data Science at the Command Line. Do you want to know more about this workshop? Curious how I can adapt it to your needs? Something else? Don’t hesitate to contact me.
Together you’ll learn better thanks to my workshop Data Science at the Command Line. Do you want to know more about this workshop? Curious how I can adapt it to your needs? Something else? Don’t hesitate to contact me.
The unix command line, although invented decades ago, is an amazing environment for efficiently performing tedious but essential data science tasks. By combining small, powerful, command-line tools (like parallel
, jq
, and csvkit
), you can quickly scrub and explore your data and hack together prototypes.
This hands-on workshop is based on the O’Reilly book Data Science at the Command Line, written by instructor Jeroen Janssens. You’ll learn how to build fast data pipelines, how to leverage R and Python at the command line, and how to quickly visualise data. No prior knowledge about the unix command line is required.
By the end of this workshop you will have a solid understanding of how to integrate the command line in your data science workflow. Even if you’re already comfortable processing data with, for example, R or Python, being able to also leverage the power of the command line can make you a more effective and efficient data scientist.
Day 1:
curl
cut
, paste
, grep
, and sed
jq
csvkit
pup
xmlstarlet
Day 2:
R
from the command lineParticipants are kindly requested to have the following items installed prior to the start of the workshop:
docker pull datasciencetoolbox/dsatcl2e
Stay up-to-date about new workshops, upcoming events, and other news about myself and Data Science Workshops.
Do you want to know more about this workshop? Curious how I can adapt it to your needs? Something else? Send an email to jeroen