Better Lemmy Through Automated Moderation

auk@slrpnk.net · edit-2 3 months ago

Better Lemmy Through Automated Moderation

Monkey With A Shell@lemmy.socdojo.com · 4 months ago

So that second part makes for some considerations. I can see merit in both taking consideration of outside actions and to keeping it limited in scope.

Using the user profile as a source rather than the local community you can get an instant status on the person even if they’ve never been in the moderated space. If they exclusively or even mostly posted in ‘those’ places though then their extremely popular views in that space may well not be suited for the wider fedi but still leave them with enough rank as I’m interpreting it to be allowed room to put out their rhetoric.

While there’s certainly something to be said for engaging with dissenting views, there are also a fair number of spaces out there full of people that are just a general net negative to civil conversation, but very strongly agree with each other, usually they’re found under a bridge harassing goats. Wondering here if there’s a way to create a weighting to discount actions in specific communities without creating the echo chamber effect as a result.

auk@slrpnk.net · 4 months ago

We talked about that issue early on. As a matter of fact, Santa initially had some options to give different types of weight to different instances, so that the “wrong” people wouldn’t wind up controlling its output, but in the end it turned out that simply by giving it a good amount of data about the holistic picture of all the communities, it was able to figure out who the jerks were without needing guidance about it.

The thing is that “rank” circulates within the community and feeds on itself. It’s not just that upvotes give you positive rank. It’s contingent on the rank of the person voting for you, in a huge recursive expression, so trusted users have more weight than untrusted users. I can show you the math, it’s really neat.

In the PageRank version of the algorithm, you could model PageRank as being a finite physical substance flowing through a network between the different web pages. That means that if you can create fake nodes adding up to 1% of the network, you’ll have 1% of the rank at your disposal to be able to use for gaming the system. My version of the algorithm isn’t like that. Because of the way I decided to change things to add downvotes into it, rank can build up and multiply within feedback loops. So if you create a little cycle of 100 users that all upvote each other, they’ll have some rank, but it’s not going to be able to outweigh tens of thousands of users that all upvote each other and multiply their rank on each other. So if the two networks are coming into communication, any downvotes flowing from the big network to the small network are going to have a huge weight that’ll overwhelm the small network’s ability to game the system. As far as I can tell, adding downvotes to the math actually made it more resistant to some failure modes than PageRank was.

It’s hard to say, of course, without seeing specifics. And it’s a tricky balancing act. You don’t want a minority community to be subject to censorship, but you also don’t want an obnoxiously vocal group of trolls to be able to overcome the community’s disapproval because they all agree with each other in their obnoxiousness.

I can talk in specifics, if that makes it easier. If you point out somebody from one of those minority communities that you think is “escaping” into the wider community and causing problems, I can do some introspection and let you know how Santa views their user and why. I think the way I have the algorithm tuned now is pretty good, but it’s always good to check, because I keep finding stuff that I missed.

Getting back to what you were saying initially, if you want to try it out on some of your communities to ease the moderation load for you, I think it’s ready. I’ve been running [email protected] and doing almost no moderation of my own, and things have been working fine. It’s been a little too eager to delete comments from people it doesn’t have a clear picture of, and I think I fixed that now, but the fix hasn’t been in action for long enough to be confident. But I do think that it’s realistic that this could be in alpha release as a hands-off moderation tool for a real sizable community. There are some new features I wanted to add to make it useful for real moderation, but if you want my help to set it up to be useful for running on a while instance and vetting the new users and controlling the communities and things, I think it’s ready to do that.

Monkey With A Shell@lemmy.socdojo.com · edit-2 4 months ago

Very interesting, it sounds similar to the web-of-trust scoring system implemented on Freenet some while ago I’d done some reading on. For right now most of my curiosity is in an academic sense. My node is functionally a client at this point largely set up because I can and enjoy the technical aspects rather than creating an account elsewhere. If I did start actually having local communities and users though I expect something like this would be useful, often times I would trust the impartiality of an algorithm over the more emotional response of people for consistency.

auk@slrpnk.net · 4 months ago

I agree. It helps if you model the thing as implementing the will of the community, outsourcing moderation to the consensus instead of having one person making the best decisions they can come up with.

Better Lemmy Through Automated Moderation

Better Lemmy Through Automated Moderation

FAQ

What do you think?