STEEMIT 2017 Round UP – Analyzing the ‘STEEMIT’ Category

in utopian-io •  7 years ago  (edited)

There are many categories available on Steemit under which you can publish a post. Consistently in the top 10 categories is STEEMIT. Being such a popular category to post in, I felt it was important to document the data for this category in 2017.

Also when I started on Steemit in July, this was a very hot tag. I observed at the time that most of the posts on Trending were from the tag Steemit. What I did not realise at the time was the peak in new users the same time as I joined. It would make sense so that many people were looking to learn about Steemit and hence the popularity of the tag at the time.

Categories are set by the 'Tags' you select when publishing a post. The first 'Tag' that you select is the Category the post will be recorded in. Once a post has been published on the Steemit platform, tags can be changed but the category will no change.

The aim of this analysis is to

Get an overview and document of the posts made to STEEMIT in 2017
Look at the growth of this category over 2017
Look at the earnings of this category during 2017
Look at top authors of this category
Establish a trend based on previous 2017 tag analysis for the months of May and June for average earnings per Author. Investigate reasons for this trend.

The data and the query

I have connected to the Steemsql database held and managed by @arcange using Microsofts Power BI
The SQL query used for this exploration is

Select *
 FROM comments (NOLOCK)
 where   
 ( created >= CONVERT(datetime,'01/01/2017')
 AND created< CONVERT(datetime,'01/01/2018')) and
  Depth = 0
 And category  in ('steemit')

This query will pull all of the Posts made during 2017 in the category steemit. It will ignore any comments and will also ignore tags. Just to point out, the first tag that is selected on a post is the category in which a post is recorded.

After this I carried out further transformations on the data using M. First with this piece of code I removed any columns I did not want

   = Table.SelectColumns(Source,{"author", "title", "body", "created", "children", "total_payout_value", 
   "net_votes", "pending_payout_value", "total_pending_payout_value"})

Then with this piece of code I changed the data type on the date field

  = Table.TransformColumnTypes(#"Removed Other Columns",{{"created", type date}})

After this I loaded the data into the model and proceeded to model the data using DAX calculations.

The Analysis

3.png

In 2017 13KK posts were created in the STEEMIT category by 23.80K authors. These posts netted 2982K votes and 927K comments and also generated a total SBD pay-out value of 720.89K.
The average number of monthly posts to Life is 10,248 and the median is 10,979. From May to July there was a steady increase in the number of posts per month. This then declined slightly in September and October, increasing again in Dec. This trend is in line with previous tag analysis carried out for 2017

4.png

The visualisation above look at the SBD pay-outs. The bar chart shows the pay-outs over time. We can also see that 26% of posts earn between $1 and $50. Less than 3% of posts earned over $50. The balance of posts earned less than $1. This is also consistent with other 2017 tag analysis carried out. The table on the right shows the top preforming posts in STEEMIT for 2017.

Let’s take a look at some averages

5.png

23.8K authors produced on average 5.17 posts each to the STEEMIT category. The total of these 5.17 posts earned SBD 30.28 with the average post earning 5.86 SBD. This is higher than the reported averages on the categories ‘Blog’ ‘Life’ and ‘Cryptocurrency’. On average each post received 7.54 comments and 24.25 votes. The average vote value on posts in the STEEMIT category was 0.24 SBD

For comparison the averages on the BLOG category are shown below
6.png

We can also view these averages over the year of 2017, we can see some peaks in payments from May to June with a decline till December where we can see another rise. What is disappointing to see is the average number of comment per post declining, however this trend seems to be on all the tags analysed so far.

7.png

If we view the same information over months instead of days
9.png

We can see clearer now that the average SBD per post peaked in May and June, and then decreased again until another rise in December. It is also clear to see that the average number of comments left on a post also peaked in May and June and decreased from then with a small increase again in December.

It seems from this an previous analysis that May and June of 2017 have substantially larger earning. In May we seen an increase in the price of steemit from as low as $.15 in April to $1.42 in May. The price increased again in June to $2.80. The price then fell in July and remained bearish till December. This seems to be what is reflected in the charts above.

This leads to a question on December, as the price increased to over $7. So why is the average payment peak higher than that in May and June?

Looking at the chart below I have added the average number of votes per post. We can see that May has almost twice the average number of votes per post than Dec, but July and December are relatively consistent. There has to be another factor involved.

10.png

Further research brought me to Hardfork 19 which happened in June, and this changed how rewards and voting power were calculated.

Having a look at the authors that posted to the STEEMIT category, first I have sorted the data by the average number of comments per post. Comments are an awesome reflection on engagement within the post.

11.png

I then sorted the data by the average number of votes per post

12.png

And finally I have sorted the data by the total SBD pay out value on posts to cryptocurrency by author

13.png

The following cluster chart shows the Number of comments against the Post SBD pay out.

14.png

Conclusion

We can see clearly that posts on average in the STEEMIT category tend to do better in terms of votes, payouts and comments than posts made in all the categories reported so far. However the overall trends remain the same.

It would be interesting to see, had hardfork 19 not been implemented how the reward pool would be with the price of STEEM at all-time highs.

You can find a full analysis of the LIFE category here
https://utopian.io/utopian-io/@paulag/steemit-2017-round-up-analysing-the-life-category

You can find a full analysis of the BLOG category here
https://utopian.io/utopian-io/@paulag/steemit-2017-round-up-analyzing-the-blog-category

You can find a full analysis of the cryptocurrency category here
https://steemit.com/utopian-io/@paulag/steemit-2017-round-up-analyzing-the-cryptocurrency-tag

On saying that, this analysis does not take into consideration the quality of the post, however I do class comments as a good indicator of engagement. I am hopeful that the more quality the post, the more comments.

I am part of a Blockchain Business Intelligence community. We all post under the tag #BlockchainBI. If you have an analysis you would like carried out on Steemit or Blockchain data, please do contact me or any of the#BlockchainBI team and we will do our best to help you...

You can find #BlockchainBI on discord https://discordapp.com/invite/JN7Yv7j



Posted on Utopian.io - Rewarding Open Source Contributors

Authors get paid when people like you upvote their post.
If you enjoyed what you read here, create your account today and start earning FREE STEEM!
Sort Order:  

Well, the names that appear on this list are no strangers to me. Interesting and maybe coincidence the same group of names appear in many of your data research. It appears to always be the same group of names that appear together. I wonder if there is any reason for that.

My guess? Steemit is a very small posting community, especially when you start talking about content of any length or more than minor significance. On top of that, we have a very active "automated response" community which also dabbles in "automated content generation."

That's not to say that the people that appear regularly aren't producing good content, but they are also people who have learned – by and large – how to use the bot up voters and other means of blockchain interaction to maximize their ability to get on trending lists and thus in front of the rather large influx of new people.

Steemit in general seems to be a bit monomaniacal when it comes to cryptocurrency, so you combine that with a fairly small posting population and at least one significantly gameable aspect and you could definitely see the sort of results that we do.

I think at first glance that would make sense. After being here for a little while longer I do wonder if there is more than meets the eye.

Having been here a little longer, it still makes sense.

This is a system which is very clear about what it rewards. Upvotes, and only upvotes. The theoretical selling point is that good content will lead to more frequent upvotes – but I'm pretty comfortable in saying that no one actually believes that, because if they did there would be less argument every time someone claims someone else is "making too much." If it were commonly believed to be true, it would be impossible to assert that someone else is "making too much," because it would be mechanically impossible.

So we know what people believe and how they act. Let's apply that to the data as it comes.

We could look for some correlation between the names that appear together across multiple reports and then look at their behavior. If we find a commonality, then we can make a host of analyses from that observation.

But do we want to know? That's an important question. If we know, we might have to act, and if we feel that we must act – we have to think about the fact that there is very little action that we could take, save to do likewise. Is that a thing we want to know?

It's a little philosophical, but it's what's on my mind.

If people worried more about what they can do to improve their posts and less about people earning too much, we might find things start to level off a little. All the energy put into posting and commenting instead.

I hope when I am on the platform as long a some of these folk that my name appears in many reports too.

My name comes up a lot anyway, so when my post earning grow, will people give out about me earning too much too?

Dammit... still haven't got my name on there.

One day Paula... ONE DAY!

:D Cheers, nice work once again dear Numbersmith.

lol me too....the only way I will show up is if I do an analysis on blockchainbi

@paulag I love your analyses.

I tried to find two kinds of analyses but didn't discover it yet:

  • users, new users, activity per country (current data)
  • max, average and median payout per category per time (while this article looks at categories individually, I would love to see a category comparison)

Did you have a look into this yet?

Important.... Good

Thank you for the contribution. It has been approved.

Hi @paulag, thanks for going deeper into the May/June peaks, this is really interesting! So it seems that on one hand a factor >10 increase in Steem price made post rewards explode, while the switch to HF19 pulled them back. Did you check what was changed and if HF19 specifically addressed this effect?

You can contact us on Discord.
[utopian-moderator]

HF 19 introduced linear rewards and also reduced the power gap between whales and minnows. when this happened whales votes were worth less and minnow votes where worth more.

Hai @paulag! Good work , hopefully more successful. I like every post you have.

Cool Analysis.

GOOOD JOB PAULAG

i want to be his student @paulag, is it allowed? :D

@paulag am happy to follow you to have your lovely posts in my feed so i can upvote to get some reward, voted!

Hey @paulag I am @utopian-io. I have just upvoted you!

Achievements

  • Seems like you contribute quite often. AMAZING!

Community-Driven Witness!

I am the first and only Steem Community-Driven Witness. Participate on Discord. Lets GROW TOGETHER!

mooncryption-utopian-witness-gif

Up-vote this comment to grow my power and help Open Source contributions like this one. Want to chat? Join me on Discord https://discord.gg/Pc8HG9x

i enjoy your post! thanks

Another well calculated analysis Paulag. Thank you for doing this.

Hi, @paulag. These is an excellent statistical analysis. Keep up the good work.