"GitHub OSS Governance File Dataset", Yen et al., 2023
❓ How many #github repos have a governance.md?
➡️ ~1,600,000 🐘
❓ Of those, how many have the governance.md in their root directory? (I.e. remove dependencies)
➡️ 1,899 🐭
❓ Of those, how many have have at least one issue/commit? (I.e. 'significant')
➡️ 710 👀
https://arxiv.org/abs/2304.00460# #gov #governance
❓ How many #github repos have a governance.md?
➡️ ~1,600,000 🐘
❓ Of those, how many have the governance.md in their root directory? (I.e. remove dependencies)
➡️ 1,899 🐭
❓ Of those, how many have have at least one issue/commit? (I.e. 'significant')
➡️ 710 👀
https://arxiv.org/abs/2304.00460# #gov #governance
GitHub OSS Governance File Dataset
Open-source Software (OSS) has become a valuable resource in both industry and academia over the last few decades.arXiv.org
Doug Webb •
Doug Webb •
1) github API non-deterministic
2) there are many other places project may keep their governance (e.g. in their readme, any other filename, not on github...)
Doug Webb •
600mb dataset and the scripts they used here: https://zenodo.org/records/7530768
Open-source Software Governance Documentation Dataset on GitHub
Zenodo