Number of data-science related repos on Github by year

#Number-of-data-science-related-repos-on-Github-by-year

Search API docs

Growth rate of number of repos on Github

#Growth-rate-of-number-of-repos-on-Github
Loading output library...
Loading output library...

Searching the API

#Searching-the-API

We search where we sort descending by the number of stars.

Now lets process the data

#Now-lets-process-the-data

Lets group the data by the number of stars - pandas will automatically group for us.

Median stars per repo per year

#Median-stars-per-repo-per-year

Default search results:

Loading output library...
Loading output library...
Loading output library...
Loading output library...

Popularity adjusted number of repos per year

#Popularity-adjusted-number-of-repos-per-year

Assumption:

Repos made every year are of the same quality and relevance - thus the average stars per repo should tend to equality as time goes on

Since older repos have more time to get more stars - it will scew the search results. Lets normalise then the number of repos per year by the average stars per year.

Loading output library...
Loading output library...
Loading output library...
Loading output library...
Loading output library...
Loading output library...

Google trends data

#Google-trends-data
Loading output library...
Loading output library...