Github metadata for 78518 projects

Github metadata for 78518 projects

The projects were queried using the Github API, using the search qualifier stars:>=100 on 27.02.2017.

Dataset fields:
key = full Github project name (user ID/project ID)

If using our dataset, please refer to the following article under publication:
Zalán Bodó, Bipin Indurkhya. Software Categorization Using Low-Level Distributional Features. Accepted to SOMET 2017.