Net daily sentiment ranged from -32.21 for on Wednesday the 22nd to 31.85 for on Monday the 20th.
The sentiment for each tweet is scored from -1 (most negative) to +1 (most positive) using VADER sentiment analysis. Net sentiment is calculated by summing the sentiment across all tweets for a given day and/or category, then normalizing the score by the number of tweets.
Medical conditions from the MedDRA dictionary
Anxiety was the most frequently observed medical condition mentioned. Sleepiness, Sleepy had the highest overall net sentiment of 42.78. Pain had the lowest net sentiment this week (-44.19).
MedDra is a standardized medical terminology developed by the International Council for Harmonization Cross-referencing tweets against this list is a starting point for identifying medical conditions mentioned in tweets.
Cross-referencing the MedDRA dictionary by sentiment and topic
There were 104 tweets with an strong positive sentiment. The top 10 most frequent medical conditions mentioned within these tweets were (1) Anxiety, (2) Sleepiness, Sleepy, (3) ADHD, (4) Pain, (5) Fall, (6) Fibromyalgia, (7) Addiction, (8) Insomnia, (9) Blindness, Blind, (10) Forgetfulness. Of these terms, Insomnia (n=4), Blindness, Blind (n=3), Forgetfulness (n=3) were not in the top 10 most frequent terms across all tweets.
There were 164 tweets with an strong negative sentiment. The top 10 most frequent medical conditions mentioned within these tweets were (1) Pain, (2) Anxiety, (3) Crying, (4) Boil, (5) ADHD, (6) Stress, (7) Forgetfulness, (8) Inflammation, (9) Numbness, (10) Overdose. Of these terms, Forgetfulness (n=4), Inflammation (n=4), Numbness (n=4), Overdose (n=4) were not in the top 10 most frequent terms across all tweets.
The 25 most important words within positive tweets (compared to negative and neutral) tweets are shown in the treemap below. The size of each box represents the weighted score of each word. The word “help” within the search for “Sertraline” had the highest overall weight. When the words are summed for each topic, Tryptophan had the highest overall weight.
The 25 most important words within negative tweets (compared to positive and neutral) tweets are shown in the treemap below. The word “depress” within the search for “Fluoxetine” had the highest overall weight. When the words are summed for each topic, “Sertraline” had the highest overall weight within negative tweets.
This analysis of words evaluates the stemmed version of words using the Snowball algorithm. By stemming words, words with similar meaning, such as pain, painful & pained, are grouped together as simply “pain”.
Beta: text classification
Tweets that describe adverse events/side effects (first person point of view)
|tweet||search topic(s)||medical condition(s) mentioned|
This classifier was trained on a set of tweets manually reviewed and tagged. The classifier was trained using GloVe, a pretrained word embedding layer.
Webpage created in R version 4.1.0 (2021-05-18) and R Studio (Version 1.4.1717) using the following packages: plotly, kableExtra, formattable, treemap, and wordpressr.
- C. Sievert. Interactive Web-Based Data Visualization with R, plotly, and shiny. Chapman and Hall/CRC Florida, 2020.
- Hao Zhu (2021). kableExtra: Construct Complex Table with ‘kable’ and Pipe Syntax. R package version 1.3.4.
- Kun Ren and Kenton Russell (2021). formattable: Create ‘Formattable’ Data Structures. R package version 0.2.1.
- Martijn Tennekes (2021). treemap: Treemap Visualization. R package version 2.4-3.
- Simit Patel (2021). wordpressr: An API Wrapper for WordPress Site APIs. R package version 0.1.0.
- Hutto, C.J. & Gilbert, E.E. (2014). VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text. Eighth International Conference on Weblogs and Social Media (ICWSM-14). Ann Arbor, MI, June 2014.