Data analysis

We have seen so far that the GitHub API provides interesting sets of information about the code repositories and metadata around the activity of its users around these repositories. In the following sections, we will analyze this data to find out which are the most popular repositories through the analysis of its descriptions and then drilling down to the watchers, forks, and issues submitted on the emerging technologies. Since, technology is evolving so rapidly, this approach could help us to stay on top of the latest trending technologies.

In order to find out what are the trending technologies, we will perform the analysis in a few steps:

  • Detect the most trending topics/technologies based on descriptions
  • Identify the most popular programming languages globally
  • Find out what programming languages are used for the top technologies
  • What are the differences between technologies in terms of repository size, open issues, number of forks, and watchers
  • See what are the most popular projects and top technology in 2017
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset