@ModerateImprovement to [email protected]English • 2 months agoEveryone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.themarkup.orgexternal-linkmessage-square26arrow-up1132arrow-down17
arrow-up1125arrow-down1external-linkEveryone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.themarkup.org@ModerateImprovement to [email protected]English • 2 months agomessage-square26
minus-square@[email protected]linkfedilinkEnglish2•2 months agoThis is the way: https://chat.lmsys.org/?arena
This is the way:
https://chat.lmsys.org/?arena