The steady stream of people who are telling me that the Santa moderation bot is going to delete anyone who’s downvoted or disagrees with the group, is continuing unabated.

Here’s an olive branch: You’ve got a point. It’s just a black box and I juggle the parameters to some secret process to ban the people who got some downvotes, I can understand how that comes across as toxic. I might or might not be lying about taking careful time to look over its judgements and make sure that I think the impact is more positive than negative, but at the end of the day, it doesn’t matter. You still have to trust my intentions and trust the bot to make good decisions, and trusting that to an automated system rarely works out well.

To me, delegating the moderation of the community to the segment of that community that’s trusted and consistently upvoted by the rest of us is better than giving it to a handful of people who wield unilateral power according to random rules. I like the bot’s judgements most of the time when I look at them. The question is simply whether this algorithm is actually doing that delegation effectively, or if it’s just banhammering anyone who gets a couple of downvotes. I’m confident that it’s doing the first thing almost all of the time.

In talks behind the scenes with other moderators, I’ve been going into a lot of detail about specific users and going back and forth about judgements. I also do a ton of checking behind the scenes. I don’t want to do that publicly. I think it would be deeply informative to post a list of the “top ten” and “bottom ten” users, and go into detail about why the low-ranked users got where they are, but that’s probably not a good idea.

What I would like to do is share that information on some level, so that people can see what’s going on, instead of it being me relaying that everything’s good. It’s tough because I can’t break down every level of detail without invading all kinds of people’s privacy. That said, I do think that there’s a way to be found to open up the process so people can see and give input to what’s going on.

One happy medium I could do would be to have the bot post its spot-check automatically about once a week. It could pick out one random user who’s barely on the borderline, and post a couple of the worst comments they made. Usually, when I’m messing around with its parameters, that’s what I am trying to do. There are some comments that are clearly toxicity that have no business anywhere. There are some comments that are clearly free speech, and even if they’re getting downvotes, they deserve to be heard. Then there are some comments that are on the borderline between. My goal is to set up the parameters so that the borderline rank value for a ban matches up with the users who are on that borderline.

I can see some upsides and downsides to posting that publicly. What do people think, though? What would you want to see, in order to make an informed decision about what you think of this whole approach?

  • Five@slrpnk.net
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    3 months ago

    I think you’re right to be concerned with the trade-off between transparency and privacy. ActivityPub infrastructure technically exposes a lot of things that would be only shared between company employees and their advertising partners in other social media, but due to the discretion of the people implementing front-end software, most of that information is not exposed to the general public. While the Fediverse has technical transparency, it also has functional privacy. The developers of Lemmy frontends deserve a lot of credit for their caution and care.

    I think a spot-check on a member of the Fediverse would make the inner workings of SantaBot easier to understand. It may be difficult to do that though, without breaking some of the norms about member privacy that we have been carefully building as a culture.

    One solution is to allow members to opt-in to allow their Santabot analysis to be shared publicly. I think I might be one of your borderline cases; I give my consent if you’d like to use me as an example.

    • auk@slrpnk.netOPM
      link
      fedilink
      arrow-up
      1
      arrow-down
      160
      ·
      3 months ago

      I agree. I think spot-checking can do a lot to bring transparency into the picture, and if it’s done carefully, then it’ll be possible to avoid exposing too much about people who haven’t agreed to have it exposed about them.

      I thought about it for a while, and I think doing a weekly spot-check post for a handful of controversial users, showing a visualization of their rank and where it is coming from, might work. Here’s one quickly hacked-up example in the form of a bar code. Time goes from left to right, blue stripes are positive rank, and red stripes are negative rank. Here’s your breakdown for the last month:

      There are three big red stripes. From left to right, they are these threads:

      There is also plenty of blue, though, so you’re comfortably over the line as a nice person under the current parameter set. It’s worth mentioning that a lot of the blue stripes are “unpopular” opinions from the point of view of the average liberal, that are popular on Lemmy, or detailed takedowns of MBFC:

      My opinion is that most of the time, someone who’s garnering a healthy mixture of blue and red is probably showing good faith, and when someone is managing to garner mostly red, it’s more likely to be an issue of quality of engagement, not even necessarily that they’re trying to say something unpopular that the bot is then censoring. But, of course, the proof is in how it works in practice on real users and real content.

      I think doing some type of visualization, maybe automatically generated, and showing the progression over time of someone’s rank depending on particular comments, can help to inform the discussion. I’m sure it won’t stop people from accusing me of all kinds of malfeasance in the way the bot operates, but it can help to put more eyes on it from people who are open and interested in seeing how it’s working.

      • Five@slrpnk.net
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        3 months ago

        I love this – Reddit used to do a yearly thing where they’d send you your top upvoted and downvoted posts and comments that was always nostalgic and fascinating to me as a user. Like canvas, I think it’s an idea worth copying with a more federated framework.

        Maybe you could write an action that allows Fediverse members to get a similar breakdown and visualization automatically generated and then delivered to them via direct message. People who are curious about how the bot works can message the bot and see how it views them, and then they can share the details publicly if they so choose. I think this could be really popular.