Predicting Popularity of Open Source Projects Using Recurrent Neural Networks
GitHub is the largest open source source development platform with millions of repositories on variety of topics. The number of stars received by a repository is often considered as a measure of its popularity. Predicting the number of stars of a repository has been associated with the number of forks, commits, followers, documentation size, and programming language in the literature. We extend prior studies in terms of input features and algorithm: We de ne six features from GitHub events corresponding to the development activities, and additional six features incorporating the influence of users (followers and contributors) on the popularity of projects into their development activities. We propose a time-series based forecast model using Recurrent Neural Networks to predict the number of stars received in consecutive k days. We assess the performance of our proposed model with varying k (1,7,14,30 days) and with varying input features. Our analysis on fi ve topmost starred repositories in data visualization area shows that the error rate ranges between 19.76 and 70.57 among the projects. The best performing models use either features from development activities only, or all metrics including all the features.
Sun 26 May
|14:00 - 14:30|
What Are the Perception Gaps between FLOSS Developers and SE Researchers? – A case of bug finding research
|14:30 - 15:00|
Does FLOSS in Software Engineering Education narrow the Theory-Practice Gap? A Study Grounded on Students' Perception
|15:00 - 15:30|