Jaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 2 months agoAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comexternal-linkmessage-square66linkfedilinkarrow-up1503arrow-down115cross-posted to: technology@lemmy.ml
arrow-up1488arrow-down1external-linkAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comJaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 2 months agomessage-square66linkfedilinkcross-posted to: technology@lemmy.ml
minus-squarelepinkainen@lemmy.worldlinkfedilinkEnglisharrow-up7arrow-down3·2 months agoWrong 70% doing what? I’ve used LLMs as a Stack Overflow / MSDN replacement for over a year and if they fucked up 7/10 questions I’d stop. Same with code, any free model can easily generate simple scripts and utilities with maybe 10% error rate, definitely not 70%
Wrong 70% doing what?
I’ve used LLMs as a Stack Overflow / MSDN replacement for over a year and if they fucked up 7/10 questions I’d stop.
Same with code, any free model can easily generate simple scripts and utilities with maybe 10% error rate, definitely not 70%