Mostra i principali dati dell'item
CrossVul: A cross-language vulnerability dataset with commit data
| dc.creator | Nikitopoulos G., Dritsa K., Louridas P., Mitropoulos D. | en |
| dc.date.accessioned | 2023-01-31T09:40:14Z | |
| dc.date.available | 2023-01-31T09:40:14Z | |
| dc.date.issued | 2021 | |
| dc.identifier | 10.1145/3468264.3473122 | |
| dc.identifier.isbn | 9781450385626 | |
| dc.identifier.uri | http://hdl.handle.net/11615/77188 | |
| dc.description.abstract | Examining the characteristics of software vulnerabilities and the code that contains them can lead to the development of more secure software. We present a dataset (∼1.4 GB) containing vulnerable source code files together with the corresponding, patched versions. Contrary to other existing vulnerability datasets, ours includes vulnerable files written in more than 40 programming languages. Each file is associated to (1) a Common Vulnerability Exposures identifier (CVE ID) and (2) the repository it came from. Further, our dataset can be the basis for machine learning applications that identify defects, as we show in specific examples. We also present a supporting dataset that contains commit messages derived from Git commits that serve as security patches. This dataset can be used to train ML models that in turn, can be used to detect security patch commits as we highlight in a specific use case. © 2021 ACM. | en |
| dc.language.iso | en | en |
| dc.source | ESEC/FSE 2021 - Proceedings of the 29th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering | en |
| dc.source.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85116242734&doi=10.1145%2f3468264.3473122&partnerID=40&md5=037339cf4c3e85e85d0b5fd6b8ad84ab | |
| dc.subject | Commit message | en |
| dc.subject | Cross languages | en |
| dc.subject | Dataset | en |
| dc.subject | Machine learning applications | en |
| dc.subject | Secure software | en |
| dc.subject | Security patches | en |
| dc.subject | Software vulnerabilities | en |
| dc.subject | Source codes | en |
| dc.subject | Vulnerability | en |
| dc.subject | Codes (symbols) | en |
| dc.subject | Association for Computing Machinery, Inc | en |
| dc.title | CrossVul: A cross-language vulnerability dataset with commit data | en |
| dc.type | conferenceItem | en |
Files in questo item
| Files | Dimensione | Formato | Mostra |
|---|---|---|---|
|
Nessun files in questo item. |
|||