Skip to content
Manual CVE_R is the lead signal

SAST Benchmark Leaderboard

We cross-reference the 165-case Java CVE Benchmark with OWASP Benchmark F1 results. Manual Check (S_M-C) detection rates stay front and center, while Approx. FP and the gap between vendor claims and measured detections remain visible at a glance.

Manual Best

Horusec

12.7% CVE_R (Manual)

Avg. Manual CVE_R

6.2%

Across all 7 tools / 165 CVEs

Manual coverage

12.7

Manual findings / 165 CVEs

Filters
Switch between license, freshness, and indicator priorities.
Supported languages: Java (multi-language UI planned)

License

Indicator

Last updated

Languages

CVE_R x OWASP F1 bubble chart
X: Manual CVE_R% / Y: OWASP F1% / bubble size: speed tier (fast < mid < slow).
fast: insider, contrastslow: semgrep, codeql

Leaderboard (manual-first sort)

Sorted by Manual CVE_R counts, with a helper score (Manual 60% + F1 30% + inverse FP 10%). High Approx. FP or over-claim tools surface with badges so non-experts can spot risk quickly.

#1

Horusec
OSS
Manual 12.7%OWASP F1 49.0%Approx.FP 28.6%Over-claim 80.4%

Manual CVE_R (S_M-C after review)

12.7%21 / 165 CVEs
Composite score
29.5
Updated: 2023-08-15Roughly 5-15 min depending on project size
ScenarioCount%
S_F-A5734.5%
S_F-C3823.0%
S_M-A4627.9%
S_M-C2817.0%
Manual (S_M-C verified)2112.7%
Speed: mid / Roughly 5-15 min depending on project size
View details

#2

SpotBugs+FindSecurityBugs
OSS
Manual 10.3%OWASP F1 82.8%Approx.FP 94.1%Over-claim 86.7%

Manual CVE_R (S_M-C after review)

10.3%17 / 165 CVEs
Composite score
31.6
Updated: 2023-07-30Roughly 5-15 min depending on project size
ScenarioCount%
S_F-A3420.6%
S_F-C2615.8%
S_M-A3118.8%
S_M-C1911.5%
Manual (S_M-C verified)1710.3%
Speed: mid / Roughly 5-15 min depending on project size
View details

#3

CodeQL
OSS
Manual 6.7%OWASP F1 79.8%Approx.FP 45.5%Over-claim 92.1%

Manual CVE_R (S_M-C after review)

6.7%11 / 165 CVEs
Composite score
33.4
Updated: 2023-10-01Long runs (Semgrep 230-274s, CodeQL queries may reach 24h)
ScenarioCount%
S_F-A2414.5%
S_F-C159.1%
S_M-A2012.1%
S_M-C137.9%
Manual (S_M-C verified)116.7%
Speed: slow / Long runs (Semgrep 230-274s, CodeQL queries may reach 24h)
View details

#4

Semgrep
OSS
Manual 5.5%OWASP F1 29.7%Approx.FP 44.4%Over-claim 88.6%

Manual CVE_R (S_M-C after review)

5.5%9 / 165 CVEs
Composite score
17.8
Updated: 2023-12-01Long runs (Semgrep 230-274s, CodeQL queries may reach 24h)
ScenarioCount%
S_F-A6036.4%
S_F-C3118.8%
S_M-A3722.4%
S_M-C1710.3%
Manual (S_M-C verified)95.5%
Speed: slow / Long runs (Semgrep 230-274s, CodeQL queries may reach 24h)
View details

#5

SonarQube (CE)
OSS
Manual 5.5%OWASP F1 27.0%Approx.FP 55.6%Over-claim 90.8%

Manual CVE_R (S_M-C after review)

5.5%9 / 165 CVEs
Composite score
15.8
Updated: 2023-09-05Roughly 5-15 min depending on project size
ScenarioCount%
S_F-A2213.3%
S_F-C127.3%
S_M-A169.7%
S_M-C106.1%
Manual (S_M-C verified)95.5%
Speed: mid / Roughly 5-15 min depending on project size
View details

#6

Insider
Commercial
Manual 2.4%OWASP F1 13.9%Approx.FP 100.0%Over-claim 95.8%

Manual CVE_R (S_M-C after review)

2.4%4 / 165 CVEs
Composite score
5.6
Updated: 2023-09-20Scan completes ~<=5 min (LoC < 50k)
ScenarioCount%
S_F-A2917.6%
S_F-C116.7%
S_M-A1911.5%
S_M-C95.5%
Manual (S_M-C verified)42.4%
Speed: fast / Scan completes ~<=5 min (LoC < 50k)
View details

#7

Contrast Codesec Scan
Commercial
Manual 0.6%OWASP F1 84.4%Approx.FP 100.0%Over-claim 99.1%

Manual CVE_R (S_M-C after review)

0.6%1 / 165 CVEs
Composite score
25.7
Updated: 2023-11-10Scan completes ~<=5 min (LoC < 50k)
ScenarioCount%
S_F-A31.8%
S_F-C21.2%
S_M-A21.2%
S_M-C10.6%
Manual (S_M-C verified)10.6%
Speed: fast / Scan completes ~<=5 min (LoC < 50k)
View details