Github metadata for 78518 projects

Github metadata for 78518 projects

The projects were queried using the Github API, using the search qualifier stars:>=100 on 27.02.2017.

Dataset fields:
key = full Github project name (user ID/project ID)

If using our dataset, please refer to the following article:

Zalán Bodó, Bipin Indurkhya. Software Categorization Using Low-Level Distributional Features. New Trends in Intelligent Software Methodologies, Tools and Techniques. (Proceedings of the 16th International Conference on Intelligent Software Methodologies, Tools, and Techniques, September 26--28, Kitakyushu, Japan.) Frontiers in Artificial Intelligence and Applications, vol. 297, IOS Press, 2017, pp. 88-98. [link] [manuscript]