Detecting Personal Medication Intake in Twitter: An Annotated Corpus and Baseline Classification System

Klein A, Sarker A, Rouhizadeh M, O’Connor K, Gonzalez G. Detecting personal medication intake in Twitter: an annotated corpus and baseline classification system. InBioNLP 2017 2017 Aug (pp. 136-142).

Link to journal


Social media sites (e.g., Twitter) have been used for surveillance of drug safety at the population level, but studies that focus on the effects of medications on specific sets of individuals have had to rely on other sources of data. Mining social media data for this information would require the ability to distinguish indications of personal medication intake in
this media. Towards that end, this paper presents an annotated corpus that can be used to train machine learning systems to determine
whether a tweet that mentions a medication indicates that the individual posting has taken that medication (at a specific time). To demonstrate the utility of the corpus as a training set, we present baseline results of supervised classification.