STEEM Contribution Score - From Concept to Model

paulag (73) 7 years ago

bit of a large shout out - the following people engaged with the previous post on this subject and I thought you might be interested in seeing the developments so far, sorry if I missed anyone.

@bashadow, @glenalbrethsen, @gniksivart, @jusipassetti. @tarazkp, @enjar, @algo.coder, @mrday, @jestemkioskiem, @paparodin, @revisesociology, @felix.herrmann, @aneukpineung78, @tfq86, @fullcoverbetting, @just2random, @cryptkeeper17, @curatorcat, @mattclarke, @mountainjewel, @gillianpearce

$0.11

4 votes

glenalbrethsen (68) 7 years ago

Hey, @paulag.

When you say this excel file is huge, you're right. I know I've never produced anything that's come close to taking 30 seconds to load. :) Good job just being able to do that!

All kidding aside, I like where this is headed. I think any form of engagement metric will need to measure both sides of the equation.

I guess for me, the question is, what's the ultimate goal with this. The idea behind the current reputation model was, via upvote you could discern those who have the best content. That seems to be the same idea behind UA, just involving followers but with the same idea that it will lead a person to the best content.

Is the idea of best content being expanded here to include comments or engagement, then? Is the idea to basically change the way we look at reputation or the best content? I don't know. Maybe I'm barking up the wrong tree here.

I'm for a different way to evaluate user interaction. The current reputation system has long since failed. I'm not completely sold on UA, but it seems to be a step in the right direction. This intrigues me a lot because it seems to be based more on what I do, and then what I might be able to influence others to do. I like that idea so far the best.

$0.05

paulag (73) 7 years ago

"One of the primary goals of Steem’s reward system is to produce the best discussions on the internet"

That is taken from the whitepaper. it takes more than just a good post for the best discussions to happen, people need to engage. These metrics try to find the people on steemit that are making the best discussion happen by looking at certain metrics.

$0.03

glenalbrethsen (68) 7 years ago

Very good. So, as you were saying to someone else, the contribution score would be something that would work in concert with, say, UA, and therefore give us all a better idea of who is potentially producing good content as well as producing the best discussions.

$0.00

paulag (73) 7 years ago

yes, they should be able to be used together and in isolation, it really depends on the needs. but the whole idea is as you say to give us a better idea of who is actively engaged in the best discussions

$0.00

mountainjewel (68) 7 years ago

Thanks for the shout out!

$0.00

aneukpineung78 (74) 7 years ago

I knew it. You can't sleep well before pushing some magics to a little bit higher level, Ma'am. And I know it won't end here. 😎

$0.00

curatorcat (67) 7 years ago

Thanks for the shoutout @paulag; appreciate the update! Can't actually look at the excel file here; my laptop is too puny to process it without locking up. However, from your description, I think you're onto something viable...

I have also been watching what the Steem UA guys have been doing, and I realize they are measuring something else that's less "activity specific" but it seems like there might be an opportunity for resource sharing and collaboration here.

"One of the primary goals of Steem’s reward system is to produce the best discussions on the internet"

It's slightly alarming to me just how FAR that is from current reality. Which, I suppose, just illustrates the Human condition; We have great ideals... and once money is involved, everything goes down the drain. So far, we have seen "Those with the most SP win" and "those who can buy the most votes win."

Anyway, I'm still holding some hope for that original ideal, and I'll be interested in seeing your ongoing efforts to quantify them!

=^..^=

$0.00

buggedout (68) 7 years ago

With a bit of heavy lifting support this could be a better indicator than both standard reputation and steem-au. Hope to see the idea progressed.

$0.09

2 votes

paulag (73) 7 years ago

A lot of heaving lifting and work is needed on this to make it a reality, but the good news is that I have spoken with a few talented people with the skills needed.

$0.00

abh12345 (76) 7 years ago

I missed the @excelclub post on Machine Learning, cool stuff!

While UA is leading the way as far as an alternative solution, I've seen numerous comments over the past week on their initial post, my post yesterday, and in @tarazkp's post this morning with regards to the scoring potentially lacking an engagement element.

I've just seen that in the steem-ua discord however that they are starting to look into engagement as a metric:

The information in/out sounds most interesting with regards to this concept and model. But there is obviously still lots of work to do for everyone looking at a better solution to reputation :)

$0.09

2 votes

paulag (73) 7 years ago

yes its good to see that UA are implementing ideas to include engagement, and they are open to feedback as this was a suggestion that I had made to them

However UA and CS are rather different and cater for different needs. I believe these two scoring systems could easily compliment each other.

$0.00

holger80 (72) 7 years ago

Thanks for your suggestion, it is improving the UA vote calculation. Engagement is my weak spot, now I have a score for it. This will help me to write more answers and comments on my posts, hopefully ;).

$0.05

paulag (73) 7 years ago

we all bring different things to the block. Some are awesome engagers, some are awesome developers, some are awesome at promoting steemit to the wider public. each quality has its own value and merit.

You already know I am a fan of UA, and we are all batting for the same team here, the steem team :-) I even have a much higher UA score than CS score which I find rather funny

$0.00

sammosk (71) 7 years ago

Interesting to see where this one leads! Great work @paulag. <3

$0.06

7 votes

meno (73) 7 years ago

oh i like this very much... makes a lot of sense to me to incorporate this asap actually...

$0.05

paulag (73) 7 years ago

wow 'asap' now I am under pressure lol. Glad you like the idea @meno, I know you are keen on the engagement side, its a metric many community leaders focus on.

$0.00

bait002 (72) 7 years ago

This is mind blowing. We need this to be fully developed. It's time we figure out true and organic reputation score not some artificial numbers. I am really looking forward to see you roll out the model @paulag. I see a project changing the way we look at the STEEM ecosystem.

paulag (73) 7 years ago

Cheers @bait002, lets hope we can make a difference 😃

Posted using Partiko Android

$0.00

llfarms (68) 7 years ago

Love this Paula, and am glad to see UA already open to want to implement something like this. It will make the scores more rounded, giving us a more complete picture. Thanks for this!

paulag (73) 7 years ago

UA are not implementing this from what i know but they have taken suggestions on bord which is cool

Posted using Partiko Android

$0.00

bashadow (65) 7 years ago

I am glad to see the response and responses to your your post. (I am still waiting for the file to download, slow connection for me). Using it in combo with other scores can give a user a little bit of an idea on an account. Even though Rep is broken, it is somewhat easy to see the Rep buyers not always but sometimes. With UA and now your CS scoring we can have three things to look at and decide. A combination of CS and UA would be useful the most I think.

I am not sure but I think in the UA post they talked about not showing most of the bots via their system, so with it and CS a mostly bot free system type score.

It was nice to read so many nice comments and to see there are people working on issues, which we do not get to see often enough.

paulag (73) 7 years ago

I think, but not sure UAs system was designed to score bots/spam poorly too.

What is important to remember that both systems score different metrics and cater to different needs. Therefore can be used in isolation or together.

Posted using Partiko Android

$0.00

crokkon (68) 7 years ago

This is very interesting, @paulag! The concept of information given and received sounds reasonable. I have no experience with ML, but I guess tuning the algorithm is key to find a reasonable metric? I had a quick look into the excel (close to 100 MB - wow :) ). I see a couple of bots in the current top 50. Some of them post more or less the same content repeatedly, which could somehow be ruled out by looking into the entropy across comments. Others echo user comments, which is probably harder to filter out. I also noticed a couple of steemhunt mods, giving dozens of comments daily. Overall great work, looking forward to updates!!

paulag (73) 7 years ago

a metric like word diversity, or count of 'unique' comments are on my list to test. Tuning and testing is key. This was a very raw model, a very early starting point. Glad you got the file open, many didn't lol, but I needed to do some sort of testing and well thats the tool I know how to use best. Delighted you came by to have a read, thanks for taking the time to review the idea.

$0.00

evolved08gsr (57) 7 years ago

Very interesting.. How will this account for things like that it's, who engage with enough intelligence to spam a few unique comments to various users with high engagement or reputation or other metric scores, simply to augment their own reputation by playing the numbers game of the algorithm. Almost "reverse calculating" who they need to interact with and how they need to interact as opposed to being human and interacting naturally?

Let's face it, we're playing a zero sum game here against the people who are building these bots for profit... There will always be an incentive to game the system or metrics that are in place to "protect the content". And bots will always win in the long-run based on the resources available to them vs the resources available to a valid and interesting content creator.

paulag (73) 7 years ago

reverse calculations would not be impossible but the information gain probability on each data point used for the weighting would change each day so it would mean working out which metric is the best on the day, rather difficult for the average person. You are so right there will always be an "incentive to game" and so we would need to continuously evolve and we would not be able to eliminate this,

$0.03

2 votes

evolved08gsr (57) 7 years ago

I'm not trying to say that your efforts are for naught, but this is something that I feel any "rating" efforts will need to consider. In my previous analysis of the blockchain, before SteemSQL was a for-purchase tool, there were many attributes that overwhelmingly identified bots/spammers/scammers, but also found valid users who were new and unaware of "acceptable" posting etiquette. The same will occur for providing rankings of valid users, except the "smart bots" will adapt to appear like valid users. The monetary incentive will always murky even the most complex and thoughtful algorithm.

I wish you all the best of luck, but I hope that expectations are tempered and considerable value is not weighted on any of these new scoring mechanisms until they are fully vetted. Looking solely at the top rank and bottom rank may be tempting, but sampling the middle ranks will be paramount to developing a truly reliable scoring method.

$0.00

paulag (73) 7 years ago

I value your input and my efforts my very well be for naught but at least I can say I tried to come up with solutions, and I will keep trying. Steemit allows us do that, help try shape the future of the platform. My intentions are good.

$0.00