Scoring AI hackers when there is no answer key
AI models are increasingly solving offensive-cyber benchmarks, rendering them less effective in measuring their capabilities. As models excel in these tests, they often rely on known bugs and publicly
Original source: Help Net Security