Identifying Vulnerable GitHub Repositories and Users in Scientific Cyberinfrastructure: An Unsupervised Graph Embedding Approach

Altmetric Attention Score

This badge shows attention from news, blogs, social media, policy documents, and more. View details

๐Ÿ“ˆ Dimensions Citation Metrics

Dimensions tracks citations across scholarly literature, patents, clinical trials, and policy documents. View full metrics โ†’

In Plain Terms

Scientific research teams lean heavily on GitHub to share code, but public repositories can leak secrets and expose insecure code. This study uses social-network analysis and graph machine learning to map relationships among GitHub users and repositories and cluster those with similar vulnerabilities, finding that high-impact genomics projects were prone to leaked secrets and insecure coding.

Key Contributions

Key contributions will be added soon.

Artifacts

Citation

Ben Lazarine, Sagar Samtani, Mark Patton, Hongyi Zhu, Steven Ullman, Benjamin M. Ampel, & Hsinchun Chen (2020). Identifying Vulnerable GitHub Repositories and Users in Scientific Cyberinfrastructure: An Unsupervised Graph Embedding Approach . IEEE ISI https://doi.org/10.1109/ISI49825.2020.9280544
Benjamin M. Ampel
Benjamin M. Ampel
Assistant Professor in Computer Information Systems and Director, Center for CyberAI Research (CCAIR)

My research focuses on AI-enabled Cybersecurity, including Cyber Threat Intelligence, Large Language Models, and Phishing Detection.

Loading stats...