[SentiSteem #1] Rise and fall of Conor McGregor. Sentiment analysis of tweets from 2013 to 2017

in science •  6 years ago  (edited)

sentiment.png

Hello world! Welcome to report where I'm using machine learning to analyze tweets about specified topic and present results in form of various and easy to understand charts. This sentiment analysis algorithm has been developed as part of my Master Thesis in 2017/2018.

This report is currently being published exclusively here on Steemit.

text10.png

Parameters

Today's analysis has been executed on tweets which contain word "Mcgregor" and were published between 2013-01-01 and 2017-12-31. Detailed specification of the data is shown in the following list:

  • Keyword: McGregor
  • From: 2013-01-01
  • To: 2017-12-31
  • Number of analyzed tweets: 60000
  • Language: en
  • Geographical location: Not specified

text01.png

Results

Sentiment

After downloading 60000 tweets between the specified dates, sentiment analysis has been executed on each and every one of those tweets. Sentiment score has been then aggregated over weeks and months, to lower the granularity of results on the time axis and then plotted as a following linechart.

sentiment.png
Sentiment of tweets for keyword "McGregor"

My subjective comment on the chart: I think there's no arguing here that the sentiment of Tweets is declining steadily. I personally believe it's half the envy of people as Conor got rich and successful and half his trashtalk which really got offensive and over the border in last couple of years.

Aggregation using heatmaps


To show the general trend/pattern in the sentiment, linechart works great. We can see the bigger timeframe and estimate the long-term direction. But if you're interested in particular month or week, it's hard and in case of weeks actually impossible to see the change. Has an athlete put the great performance in particular match? Has the brand/company released a new line of product? So see such low lever changes, following 2 heatmaps are to be used.

heatMap.png
Chart shows average sentiment per month where 0.50 is the worst and 0.78 the best achieved score

My subjective comment on the chart: Oh wow, I looove this chart. Look where the lowest and highest sentiment accured. Exactly around his 2 leendary fights. The most positive score is in October 2016 as there was anticipated a huuuge bout between him and Eddie Alvarez. It was the first time ever, UFC fighter could potentially hold 2 belts simultaneously. The wost sentiment can be seen in August 2017. Why? Well, he fought Floyd Mayweather and there were probably many many tweets he will "lose", "get his ass kicked" etc etc. This chart doesnt lie ;) Another interesting pattern I've noticed is that sentiment always gets bigger one month before his fight - e.g. in the 2016, he fought in March, August and November. Sentiment peaked 3 times - in February, July and October :) What's also worth noticing is that with his boxing bout, the opposite occurred. Sentiment was bad in the months leading to the fight, but once it happened, sentiment rose up higher.

heatMapWeekly.png
Chart shows average sentiment per week where 0.50 is the worst and 0.79 the best achieved score

My subjective comment on the chart: This chart basically "zooms" in. It might be that the granularity is bit too much. We can clearly see increase when May-Mac fight was announced and also the week of his Aldo fight for the featherweight belt has huge sentiment.

Most frequently used words


Another very interesting aspect to look into are the repeatedly used words using wordclouds. Even more interesting is to compare two wordclouds generated from different time - usually before and after some event/change happened. If you give this a second though, the problem here is that many short words (like "and", "or", "with" and so on) are used almost in every sentence and would also show up in wordclouds. To mitigate this, I've removed list of 153 so called stopwords. Additionally I've also removed words typical for this area listed in the end of the report*.

CommonWords.png

Most often used words in tweets containing word "McGregor" before and after 2015-12-12.

My subjective comment on the chart: <3 I totally love data science :D We can very nicely see that before December 2015, two names were often used with Conor's - Aldo and Frankie (two featherweight kings). After 2015, when Conor left division, Nate Diaz or Floyd Mayweather are the big names. It's also interesting to see the word will in the before 2015 wordcloud. I think it comes from tweets which were talking about Conor as a future champion.

Most frequently used UNIQUE words

As we can see in the previous worldcloud, there are many words which are actually shared in both wordclouds. That makes all the sense as there are many areas which will be forever connected with . But I went one step further and decided to create wordclouds which contain only unique words with don't appear in the opposite wordcloud.

UniqueWords.png
Most often UNIQUE used words in tweets containing word "McGregor" before and after 2015-12-12.

My subjective comment on the chart: This chart shows similar results as the previous one, only bit more amplified. I can clearly see following points:

  • Biggest change are oviously two words - boxing & suck. THere's no doubt these talk about Conor's bout and skills against Mayweather.
  • Positive words as truly, love, star, genius are very popular during Conor's rise to stardom
  • When he really became a household name because of Mayweather boxing match, words like ESPN, Payout occurred as they're closely connected to boxing and his big payday for the fight
  • Word bottle shows how huge topic his incident with Nate Diaz was. Conor threw a bottle and a Monster can on Nate.
  • Huge is also a word refuse which definitely occurred in tweets about Conor being pulled from UFC200 as he refuse to fly over to US for press conference.

text10.png

Get your report - Christmas present!


Twitter sentiment analysis reports are being sold for quite some dollars in the world outside of Steemit. I'll be selling these reports here for much more reasonable price in the near future, I just want to fine tune the algorithm bit more. You'll get the report, numbers and charts and can it all post on your account!. But because it's Christmas time, I've decided to gift one report! To qualify, complete these 2 conditions:

  • Comment down the parameters of report you're interested in
  • Resteem this post
  • No upvote required :)
I'll choose the winning entry based on doability of the report as Tweets can get pretty tricky and don't always make sense :D

Thanks for reading! Matko.

text10.png

You can find my latest posts here:

🏆 My STEEMMONSTERS trophies/scalps 🏆

ATS_NIGHT VIEW-2.jpg
Reached #16 in leaderboard
ATS_NIGHT VIEW.jpg
One-sided win over JoeParys
ATS_NIGHT VIEW.jpg
Proud owner of legendary Hydra

Steemit Bloggers
Join us @steemitbloggers
Animation By @zord189

Authors get paid when people like you upvote their post.
If you enjoyed what you read here, create your account today and start earning FREE STEEM!
Sort Order:  

Paráda :) Nechceš přidat tag SteemStem? Sám s ním zkušenost nemám, ale vím, že ho místní vědátoři na tento typ článků dávají :D

Ozaj to si mi vravel..hmm no rozmyslam ktory nahradit, asi som mohol ten science len ten uz nepojde bo je prvy :D no nejak sa s tym pohram, diks :D

Dude, this is so interesting! I’d like to know the sentiment for crypto or bitcoin. :D

Hah I planned on doing "steemit" next, but I guess not many people will join, sooo there's quite a chance it'll be a bitcoin :D Any preferences regarding time intervals or geo location of tweets?

Last 4 years could be interesting. As for the geolocation, I think it would be great if we could see it too on the reports, so we know, which locations on earth are really into btc or crypto in general :D

4 years noted. I actually looked into some geolocation python plotting but then thought I don't have enough spare time for that :D Maybe in SentiSteem 1.1 hah :D For now I can just specify the geolocation of tweets as a filter before downloading them. I guess I'll take a look on it in January.

oh this is so cool! Awesome analysis, and really interesting how much you can find out with it! Have you chosen a winner for your next analysis yet? I'd be curious to see the rise(hopefully) of popularity of veganism to get an idea about whether it's being adopted by more people. Maybe since 2015...I guess the keywords would be plant based, cruelty free, vegan. If you can't do it, no worries =)

Hmmm I like the topic a lot. I haven't chosen the winner yet but su far there's just vegan and bitcoin. Dunno, will think about it. But I kinda like vegan over bitcoin ;) If noone else comes I guess I'll let the program run with "vegan" and will send you the report on Discord so you can post it and write some comments to charts if you find something interesting :)

Soo the 2015 will be the date of before/after? Soo the tweets will be lets say from 2013 till 2018 included?

awesome =) yes a 5 year period seems good to measure progress, but I'll let you decide =)

  ·  6 years ago (edited)

How many times did things like Asshat and fuckwit come up in the analysis?

His mental warfare tactics may have worked for quite a while but the last time he took it way too far and dragged religion and politics into the shit slinging. It was arguably very well researched and designed to inflict maximum psychological impact but it was below the belt none the less.

The way he conducted himself at the pre fight press conference was downright shameful. I wanted to see him get his face punched in after that performance (a;so the bus incident). he brought shame to his nation and everyone associated with him.

It was good to see Khabib beat the crap out of him, I just wish they'd let him "change his face" like he planned to do.

Honestly, I have noooo idea where do people see him talking about religion.me as a huge Conor fan since early 2015 knew that it's a possibility that he would use religion. In that very moment I'd completely dislike him probably forever. But he didn't mention a religion even once! It was hard, well researched and even under the belt ( father, family)..but he never mentioned religion. It makes me go nuts when people keep talking about him talking crap about religion and it just never happened :D and they've just followed what YT comments say :D altho I have to say Conor made it hard to stay on his side during the buildup..but he never mentioned religion.

Posted using Partiko Android

  ·  6 years ago (edited)

Maybe I was mistaken there @matkodurko , possibly by Khabibs' statement when he apologised for the brawl where he stated
"“I’m a human, what about he talked about my religion, my country, my father, he come to Brooklyn and broke bus, almost killed a couple of people, what about this shit"
was what skewed my memory.

Even though Conors' instagram post where he called Khabibs dad a coward could have been taken as a slur on Khabibs religion it could be easily argued that it wasn't a direct slur on Islam. It almost certainly was taken as an insult on religion by Khabib as the insult made a reference to mosque.

”The Chechen people know what I’m talking about when I called this man a coward,” McGregor said. “When I called his father a quivering coward. Him and Kadyrov were at a mosque together, and he posts a picture of Kadirov on his Instagram. Kadirov’s the Chechen dictator, a crazy man, don’t get me wrong. But, Khabib’s father, lick-ass O’Hoolihan, posts a picture of Kadirov at his mosque and the caption is, ‘Together we are stronger.’”

So anyway, I still think Mcgregor is an Asshat. Our kids don't need arseholes for role models. A whole generation of kids are now growing up and becoming adults with idea that being a disrespectful rude arrogant prick is the best way to get on in the world because of his antics.

Yeah that one was on the border..but still not directly talking about islam..I was afraid he'd spill the whiskey on Khabib, that would suck.

Well yeah, the new post Mayweather Conor is not the best role model. But I "got to know" him during rise where altho he was cocky, his interviews about hard work, believing in yourself, law of attraction etc etc etc were just suuuperinspiring.. and trashtalk was hard but funny and mostly related to fighting skills anyway...I deffo liked that conor more..at the end of the day, this analysis shows the same.

Posted using Partiko Android

You have a minor misspelling in the following sentence:

What's also worth noticing is that with his boxing bout, the opposite occured.
It should be occurred instead of occured.

Thxx! Long time no see btw :D

Hi matkodurko,

This post has been upvoted by the Curie community curation project and associated vote trail as exceptional content (human curated and reviewed). Have a great day :)

Visit curiesteem.com or join the Curie Discord community to learn more.

Ho! Ho! Ho! Merry Christmas!! I've given you an upvote and left you this amazing automated comment!!

Hi, @matkodurko!

You just got a 0.41% upvote from SteemPlus!
To get higher upvotes, earn more SteemPlus Points (SPP). On your Steemit wallet, check your SPP balance and click on "How to earn SPP?" to find out all the ways to earn.
If you're not using SteemPlus yet, please check our last posts in here to see the many ways in which SteemPlus can improve your Steem experience on Steemit and Busy.

This is quite a curious analysis! I read a couple of times, but why did you choose McGregor as they keyword?

YOU JUST GOT UPVOTED

Congratulations,
you just received a 11.02% upvote from @steemhq - Community Bot!

Wanna join and receive free upvotes yourself?
Vote for steemhq.witness on Steemit or directly on SteemConnect and join the Community Witness.

This service was brought to you by SteemHQ.com

Hi @matkodurko!

Your post was upvoted by @steem-ua, new Steem dApp, using UserAuthority for algorithmic post curation!
Your UA account score is currently 3.463 which ranks you at #6642 across all Steem accounts.
Your rank has improved 4 places in the last three days (old rank 6646).

In our last Algorithmic Curation Round, consisting of 207 contributions, your post is ranked at #7.

Evaluation of your UA score:
  • You're on the right track, try to gather more followers.
  • The readers appreciate your great work!
  • Great user engagement! You rock!

Feel free to join our @steem-ua Discord server

Very cool! It's amazing how much you can analyze these days in this social media-driven world we live in, and how useful that analysis can be when you are marketing a person/product. I agree with @evecab. It would be really interesting to see the analysis on vegan in the last few years! I know a lot of different types of vegan products have exploded in just the last year or two.