This page contains data and scripts used in the paper: "Developer See, Developer Do? Homophily and Influence in OSS Projects" by David Kavaler and Vladimir Filkov
ASF project data was initially processed and gathered by Mohammad Gharehyazie. Additional processing was done by us to gather metrics. The files above contain the post-processed data used in our work. All files are in csv format.
The scripts above were used to generate networks using a decay function and to run the SIENA model simulation. All scripts are written in R. Scripts require editing before use to specify file paths and model names (see all strings containing "PLACEHOLDER"). Control profile computations were performed using Python Zen (not included; the scripts are 4 lines long and can be easily reproduced using the Zen documentation).