ylai@lemmy.ml to AI@lemmy.ml • 1 year agoChatGPT gets code questions wrong 52% of the timeexternal-linkmessage-square2 fedilinkarrow-up12
arrow-up12external-linkChatGPT gets code questions wrong 52% of the timeylai@lemmy.ml to AI@lemmy.ml • 1 year agomessage-square2 Commentsfedilink
minus-squareSirGolan@lemmy.sdf.orghexbear1·1 year agoGPT4 with reflexion prompting gets 90% correct (for HumanEval coding benchmark). The paper this is based on is misleading at best. linkfedilink
GPT4 with reflexion prompting gets 90% correct (for HumanEval coding benchmark). The paper this is based on is misleading at best.