• FooBarrington@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    It’s because there’s no review number in combination with varying certainty that makes for bad information regarding judgment calls about quality. If people are certain the game is a 7/10, that could produce a better score than being less certain about an 8/10, because the wider distribution (less certainty) could put more reviews below the positive/negative threshold.

    But what do you mean with “more certain” and “less certain”? Again, Steam doesn’t have reviews beyond boolean values.

    The following reviews: 6/10, 6.5/10 , 7/10, 7.5/10, 8/10 will produce a 100% rating. More certain, less useful.

    And since Steam doesn’t have point-based reviews, the 100% rating is fully correct, as presumably each of these reviewers gave a positive review.

    The following reviews: 4/10, 6/10, 8/10, 10/10, 10/10 will produce an 80% rating. Less certain, more useful.

    How is it “less certain”?

    It’s only consistent if you assume all games follow the same distribution, which is not how reviews work in my opinion.

    Do you have statistical analyses that show this assumption to be wrong?

    • bogdugg
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      My entire argument stems from the idea that you can ascertain quality from these ratings, which I am refuting. The rating is “correct” in that it is measuring something, and as long as people keep in mind what that something is, there is no problem. But this article, for example, uses the flood of positive reviews to make the case that it is one of the best of the year, which I believe is faulty reasoning.

      What I meant by certain is that the reviews are more clumped together (again if you had a score - even though it isn’t present presumably you could attach one to these reviews), so there’s more agreement among different people about the quality of the product. If you don’t agree that games can be more or less polarizing, you won’t agree with this point unless I can back it up with data which I’m not going to spend time doing. You could go through Rotten Tomatoes and compare Critic Score with RT Score because they surface both those values and see how closely they track on different parts of the spectrum.