A Tool for Collaborative and Reproducible Data Science

#A-Tool-for-Collaborative-and-Reproducible-Data-Science

We've spent the last few months designing a system to improve collaboration, reproducibility and presentation - an all-inclusive tool that optimizes the entire workflow of a data scientist. We've spoken to hundreds of data scientists over the last year gathering feedback, and we are now excited to bring to you the latest version of Kyso.

Think of it like Github, but specifically for data science.

The result is a tool to run, publish and share Jupyter notebooks, somewhere you can build upon the courses and projects completed with Datacamp and create your own data science portfolio. It's a free tool to showcase and share your work, get feedback and find cool & interesting new projects.

For a more comprehensive guide, check out this announcement, Introducing Kyso 2.0, recently posted upon our latest release. For now, here is a quick summary of what the platform offers:

  • Free Jupyterlab workspaces to start and run notebooks.
  • Blog-style rendering of these notebooks with the option to show or hide your code.
  • A custom Jupyterlab extension that allows users to publish to their Kyso profile from any Jupyterlab environment.
  • A profile page very much suited to building and hosting your data science portfolio.
  • Simple discovery for finding cool new projects to fork onto your own workspaces.
  • And many more features coming soon!
Loading output library...

I felt the best way to demonstrate the mechanics and purpose of the platform would be to actually publish this example study with some cool and interesting data visualisations, as you'll see below. I've uploaded two cool datasets just to play around with - however there is so much more depth to the data than what I've plotted below. Sign up for free, fork this study (along with the attached data files) onto your own Jupyterlab environment on Kyso, extend the analysis & come up with some cool visualizations yourself. When you're ready, you can simply re-publish!

Note that I have set the code to hidden for this study, but you can toggle that at the top right-hand side!

Modern Slavery

#Modern-Slavery

The Global Slavery Index publishes a report each year with information on modern slavery, which applies to various factors that make people vulnerable like forced labour, human trafficking, etc.., as well as government responses and products in the global supply chains that are at risk of being produced by modern slavery. I've uploaded the 2018 findings and generated some simple plots using plotly.

These interactive plots render nicely on Kyso!

Loading output library...

Maybe an idea to improve on the above map here would be to plot out the prevalance of modern slavery in each country, meaning the numbers expressed above as percentages of national populations.

The Global Slavery index Vulnerability Model maps 23 risk variables across five major dimensions, and assigns a score to each country's dimension based on these variables. One of the five dimensions that naturally has a part to play is Inequality - so let's map out the levels of global inequality.

Loading output library...

The World's Religions

#The-World's-Religions

The World Religion Project aims to provide detailed information about religious adherence worldwide since 1945 and is hosted by Zeev Maoz, University of California-Davis, and Errol A. Henderson, Pennsylvania State University. It contains data about the number of adherents by religion in each of the states in the international system.

Loading output library...
Loading output library...

Pretty cool! How about generating a time series of the total number of adherents to all religions evlolving over time? I imagine we will see an overall decline in religiosity.

Loading output library...

That's it for this brief post guys. There are, however, over 30 columns in the first dataset and over 70 in the second, meaning there is the possibility of much deeper analysis. If you're new to plotly, there is a quick-fire guide here. Our explore page also contains content most recently published if you'd like to discover other projects.

Try out the platform - feel free to reach out with feedback and/or ideas for future features - Kyso 2.0 is in beta & we take the feedback from our users very seriously. Don't hesitate to contact me directly at kyle@kyso.io.

Happy Coding!