@[email protected] to [email protected] • 7 months agoUnpacking the hype around OpenAI’s rumored new Q* modelwww.technologyreview.comexternal-linkmessage-square12fedilinkarrow-up137arrow-down18
arrow-up129arrow-down1external-linkUnpacking the hype around OpenAI’s rumored new Q* modelwww.technologyreview.com@[email protected] to [email protected] • 7 months agomessage-square12fedilink
minus-squareQ*Bert Reynoldslink14•edit-27 months agoIt’s probably based on Q learning, which has been around for 30+ years, and I’m guessing the star is a nod to A* because it’s an optimization of some kind.
It’s probably based on Q learning, which has been around for 30+ years, and I’m guessing the star is a nod to A* because it’s an optimization of some kind.