Better Lemmy Through Automated Moderation

auk@slrpnk.net · edit-2 3 months ago

Better Lemmy Through Automated Moderation

6 months ago

There are so many problems with this.

It would be extraordinarily easy to bot it and just silence anyone you want.
I agree, moderation is absolutely necessary to maintaine civil discussion, but silencing people, because they have unpopular opinions, is a really bad idea.
I love lemmy because it is the ultimate embodiment of decentralised free speech. This destroys that.
If I were a bad actor, hypothetically, let’s just say lammy.ml or haxbear and I decided I wanted to silence anyone who disagrees with what I have to say. Then I could just make a fork of this project to only value my instances votes and censor anyone who doesn’t agree with what my community thinks.
This tool simply acts as a force multiplier for those who want to use censorship as a tool for mass silencing of descent.

Universal Monk@slrpnk.net · 3 months ago

deleted by creator

auk@slrpnk.net · 6 months ago

Oh no! It hadn’t occurred to me that excluding unpopular opinions might be a problem. If only I’d thought of that, I might have looped in some other people, talked extensively about the problem and carefully watched how it was working in practice and tweaked it until it seemed like it was striking the right balance. I might have erred heavily on the side of allowing people to speak to the point that I was constantly fielding complaints from people wanting me to remove something they said shouldn’t be allowed.

And furthermore, you’re right. If this catches on then lemmy.ml might be able to silence dissenting views. That would be terrible.

Monkey With A Shell@lemmy.socdojo.com · 4 months ago

So the talk of some of the more eccentric parts of the fedi got me thinking here. I run a currently single user instance largely because the state of mod tools is scary (the inability to easily look over the activities on my instance for example) but would potentially like to open the doors at some point. Tools like this could help that.

A couple edge situations that I wonder how it would respond.

I’ve a time or two relocated the instance in my lab just by rebuilding it because migrating DBs is a pain and I’m the only one here anyhow but used the same user and domain names, would the bot recognize those recreated users as separate entities or would any actions be based purely on the name?

In a couple cases I’ve run one of those subscriber bots and as a result found some communities in circle-jerk parts of the fedi. Posting in them with anything dissenting from their views ends up all kinds of negative. Does the bot take into consideration scoring based on the user profile including actions outside the moderated community, or just within its own territory?

auk@slrpnk.net · 4 months ago

I’ve a time or two relocated the instance in my lab just by rebuilding it because migrating DBs is a pain and I’m the only one here anyhow but used the same user and domain names, would the bot recognize those recreated users as separate entities or would any actions be based purely on the name?

If it’s the same user and domain name, but something’s been rebuilt behind the scene, it’ll identify it as the same user. The same user name on a different instance, as identified by hostname, shows up as a different user.

In a couple cases I’ve run one of those subscriber bots and as a result found some communities in circle-jerk parts of the fedi. Posting in them with anything dissenting from their views ends up all kinds of negative. Does the bot take into consideration scoring based on the user profile including actions outside the moderated community, or just within its own territory?

It’s an interesting question. It’s going to be different case by case, but I think most of the time, it shouldn’t affect you too much. Most of the users in those circle-jerk communities have very small rank values, or negative. I think any participation you’re doing in those communities won’t matter one way or another.

A while ago, the algorithm was so simplistic that a user with heavily negative rank would have their votes count backwards. If a bunch of the circle-jerk people were downvoting you, that would actually raise your rank, and upvotes would lower it. I left it that way for quite a while, both because it’s funny and because it does have a certain logic. If someone’s wrong more often than they’re right, and they’re upvoting you, that probably means you’re doing something wrong.

After a while, I got rid of it. On top of the obvious possibility it creates for abuse, it turns out that a lot of the low-rank people still give out some sensible votes sometimes, and those people would get penalized unreasonably. So now, anything that someone does in a community that the wider community doesn’t respect, just gets ignored.

Monkey With A Shell@lemmy.socdojo.com · 4 months ago

So that second part makes for some considerations. I can see merit in both taking consideration of outside actions and to keeping it limited in scope.

Using the user profile as a source rather than the local community you can get an instant status on the person even if they’ve never been in the moderated space. If they exclusively or even mostly posted in ‘those’ places though then their extremely popular views in that space may well not be suited for the wider fedi but still leave them with enough rank as I’m interpreting it to be allowed room to put out their rhetoric.

While there’s certainly something to be said for engaging with dissenting views, there are also a fair number of spaces out there full of people that are just a general net negative to civil conversation, but very strongly agree with each other, usually they’re found under a bridge harassing goats. Wondering here if there’s a way to create a weighting to discount actions in specific communities without creating the echo chamber effect as a result.

auk@slrpnk.net · 4 months ago

We talked about that issue early on. As a matter of fact, Santa initially had some options to give different types of weight to different instances, so that the “wrong” people wouldn’t wind up controlling its output, but in the end it turned out that simply by giving it a good amount of data about the holistic picture of all the communities, it was able to figure out who the jerks were without needing guidance about it.

The thing is that “rank” circulates within the community and feeds on itself. It’s not just that upvotes give you positive rank. It’s contingent on the rank of the person voting for you, in a huge recursive expression, so trusted users have more weight than untrusted users. I can show you the math, it’s really neat.

In the PageRank version of the algorithm, you could model PageRank as being a finite physical substance flowing through a network between the different web pages. That means that if you can create fake nodes adding up to 1% of the network, you’ll have 1% of the rank at your disposal to be able to use for gaming the system. My version of the algorithm isn’t like that. Because of the way I decided to change things to add downvotes into it, rank can build up and multiply within feedback loops. So if you create a little cycle of 100 users that all upvote each other, they’ll have some rank, but it’s not going to be able to outweigh tens of thousands of users that all upvote each other and multiply their rank on each other. So if the two networks are coming into communication, any downvotes flowing from the big network to the small network are going to have a huge weight that’ll overwhelm the small network’s ability to game the system. As far as I can tell, adding downvotes to the math actually made it more resistant to some failure modes than PageRank was.

It’s hard to say, of course, without seeing specifics. And it’s a tricky balancing act. You don’t want a minority community to be subject to censorship, but you also don’t want an obnoxiously vocal group of trolls to be able to overcome the community’s disapproval because they all agree with each other in their obnoxiousness.

I can talk in specifics, if that makes it easier. If you point out somebody from one of those minority communities that you think is “escaping” into the wider community and causing problems, I can do some introspection and let you know how Santa views their user and why. I think the way I have the algorithm tuned now is pretty good, but it’s always good to check, because I keep finding stuff that I missed.

Getting back to what you were saying initially, if you want to try it out on some of your communities to ease the moderation load for you, I think it’s ready. I’ve been running [email protected] and doing almost no moderation of my own, and things have been working fine. It’s been a little too eager to delete comments from people it doesn’t have a clear picture of, and I think I fixed that now, but the fix hasn’t been in action for long enough to be confident. But I do think that it’s realistic that this could be in alpha release as a hands-off moderation tool for a real sizable community. There are some new features I wanted to add to make it useful for real moderation, but if you want my help to set it up to be useful for running on a while instance and vetting the new users and controlling the communities and things, I think it’s ready to do that.

Monkey With A Shell@lemmy.socdojo.com · edit-2 4 months ago

Very interesting, it sounds similar to the web-of-trust scoring system implemented on Freenet some while ago I’d done some reading on. For right now most of my curiosity is in an academic sense. My node is functionally a client at this point largely set up because I can and enjoy the technical aspects rather than creating an account elsewhere. If I did start actually having local communities and users though I expect something like this would be useful, often times I would trust the impartiality of an algorithm over the more emotional response of people for consistency.

auk@slrpnk.net · 4 months ago

I agree. It helps if you model the thing as implementing the will of the community, outsourcing moderation to the consensus instead of having one person making the best decisions they can come up with.

Universal Monk@slrpnk.net · edit-2 3 months ago

So I just noticed that your reply has more downvotes than upvotes.

And also, you tone seems to be sarcastic and going straight against what you I thought you were actually advocating for, which is positive communication.

I like the idea/theory of your bot, but the tone of your response to that person totally caught me off-guard.

If the santa bot were modding this very community, with all the negative downvotes your posts have gotten, wouldn’t you be banned according to it’s programming?

auk@slrpnk.net · 3 months ago

Being friendly at a surface level with other people is not at all the same as bringing pleasant interactions and patterns to the conversation as whole.

You, of all people, should reflect on that. Making sure to smile while you’re shitting on the carpet doesn’t mean you’re welcome at the party.

For transparency’s sake, I didn’t intend to ban anyone from the Santabot meta community, but you and your alts obviously deserve an exception. I’m banning you and I’ll make a post recommending that the admins do something about your other ban-evading accounts.

millie@slrpnk.net · 7 months ago

What the hell dystopian meow meow beanz nonsense is this?

auk@slrpnk.net · 7 months ago

Oh no, my MeowMeowBeanz!

southsamurai · 5 months ago

This is the shittiest bot ever

admin@lemmy.fediboat.duckdns.org · 5 months ago

am i federating? get ready for me to test ur bot to the limmit

LibertyLizard@slrpnk.net · 8 months ago

As I posted in the other thread, I’m very interested to see how this works out. I am definitely curious to see what the bot thinks of some of my posting habits if you are able to share that.

TheObviousSolution@lemm.ee · 2 months ago

On the one hand, I feel like this is on brand for a Black Mirror episode. On the other, I just came here because I found a popular troll who seems to grift mod abuse got banned by it, so kudos.

auk@slrpnk.net · 14 days ago

Thanks. I’m happy with how it’s performing. I haven’t been paying much attention to it recently, but I do consider it ready for wider deployment at this point. It’s been running with minimal issues for quite a while in a busy community. It was harder than I expected to get it to work in a satisfactory way, not just making bad or random decisions or banning unpopular people, or otherwise acting like a lot of human Lemmy moderators.

auk@slrpnk.net · 3 months ago

I added an entry to the FAQ, at the end, about why the bot doesn’t notify for bans. I think the key thing to understand is that these aren’t permanent bans, and they apply 99% to people who will never care about the moderated community. If you have thoughts about it, I’m happy to talk, up to a point. The explanation is up there.

@[email protected] @[email protected]

givesomefucks@lemmy.world · 2 months ago

Can you turn this shit off yet?

It clogs up modlogs on every instance and it looks like your idea never took off.

But your bot is going to keep running and banning/unbanning people.

auk@slrpnk.net · 14 days ago

I drink your tears and they are delicious. It’s the number 4 community on slrpnk without me needing to do anything at all to feed it. I’m happy with that. I might take a look at deploying it in some non-test-bed scenarios, since it seems like it’s proven itself for long enough that it’s no longer in a “test” phase in any meaningful sense.

The spam in the modlog is a real concern. I may take a look at how to try to minimize it without impacting any design goals.

I’m also enjoying some of the reasons you got attention from human moderators:

Being an condescending asshole

Misinformation

Being aggressive, ridiculous, and insulting. Any further moderation issues from this user will result in a permanent ban from this community.

Aggressive, hostile, and ridiculous.

Gee, I can’t imagine to myself why you might be mad about automated moderation tools. Surely you’re just suddenly concerned about their long-term impact on Lemmy, and want to offer your input, to do your part to make the environment a better place.

The Quuuuuill@slrpnk.net · 8 months ago

How will this be audited to ensure fascists don’t game the down votes to quell pro-solarpunk, pro-liberation messaging?

auk@slrpnk.net · 8 months ago

Gaming the system is, I think, more unlikely than it might seem. In my auditing leading up to making it live, the problem was the opposite of that. The average fascist account, if it’s not banned outright, might have a “weight” of plus or minus single digits, whereas slrpnk admins might have a weight of several hundred. Some people were getting banned just because of a single downvote from one of the admins, applied to a reasonable comment, outweighed the whole community’s consensus.

I am watching the results, to some extent, and depending on good people who do receive moderation saying something if it seems unreasonable. I think it is possible to create a network of artificial votes to game the system, but you have to do a lot. It’s resistant to simply massively inserting fake votes from some random account to throw off the tally. You have to engineer artificial trust for yourself, and outweigh a community consensus of millions of votes. I think that, if it even takes off to the point that defeating it becomes a focal point, the level of voting that’s required to game the system will be large enough to be obvious during an audit.

keepthepace@slrpnk.net · 4 months ago

Hi! Nothing constructive to add right now, but I just wanted to counteract the negativity of comments here. I think it is a really interesting experiment and that we should embrace the possibilities that the fediverse give us in that respect, that may be actually eventually become the killer feature over centralized solutions.

auk@slrpnk.net · 4 months ago

Thank you. I’m not bothered by the negative comments. For a while, I was trying to demonstrate to those people that I’m working hard on making it resistant to the problems they’re talking about, but I eventually realized that they mostly have no interest in learning what’s going on, or a real exchange about real problems and solutions. I think they just want to yell. My explanation is there in the FAQ, to read if they want to, and if not, there’s not much to do.

Most of the people who have constructive concerns or criticisms phrase them in productive ways, and the conversation is fine. The people who are angrily denouncing my bot generally have no interest in finding out if their claims are true or worth worrying about, so I generally stopped paying attention to them.

Better Lemmy Through Automated Moderation

Better Lemmy Through Automated Moderation

FAQ

What do you think?