cyrano@lemmy.dbzer0.com to Technology@lemmy.worldEnglish · edit-213 hours agoGPT-4.5openai.comexternal-linkmessage-square4fedilinkarrow-up126arrow-down114file-text
arrow-up112arrow-down1external-linkGPT-4.5openai.comcyrano@lemmy.dbzer0.com to Technology@lemmy.worldEnglish · edit-213 hours agomessage-square4fedilinkfile-text
minus-squarecygnus@lemmy.calinkfedilinkEnglisharrow-up18arrow-down1·13 hours agoThose charts are hilarious: wow, it gives the right answer 62.5% of the time and only makes up completely false answers 37.1% of the time! It’s like Russian roulette, but worse!
minus-squareolympicyes@lemmy.worldlinkfedilinkEnglisharrow-up8·12 hours agoIf you play Russian roulette with two bullets like a real man, then this model is about the same outcome!
minus-squareregrub@lemmy.worldlinkfedilinkEnglisharrow-up4·12 hours agoSurely, people won’t use the slop generator in applications where being correct is important, right?
Those charts are hilarious: wow, it gives the right answer 62.5% of the time and only makes up completely false answers 37.1% of the time! It’s like Russian roulette, but worse!
If you play Russian roulette with two bullets like a real man, then this model is about the same outcome!
Surely, people won’t use the slop generator in applications where being correct is important, right?