Task 6 : Classification of COVID19 tweets containing symptoms

Identifying personal mentions of COVID19 symptoms requires distinguishing personal mentions from other mentions such as symptoms reported by others and references to news articles or other sources. The classification medical symptoms from COVID-19 Twitter posts presents two key issues: First, there is plenty of discourse around news and scientific articles that describe medical symptoms. While this discourse is not related to any user in particular, it enhances the difficulty of identifying valuable user-reported information. Second, many users describe symptoms that other people experience, instead of their own, as they are usually caregivers or relatives of people presenting the symptoms. This makes the task of separating what the user is self-reporting particularly tricky, as the discourse is not only around personal experiences. 

This task is considered a three-way classification task where the target classes are:
(1) self-reports,
(2) non-personal reports, and
(3) literature/news mentions.

  • Training data: 9,567 tweets
  • Test data: 6,500 tweets

Register your team here : https://forms.gle/1qs3rdNLDxAph88n6
Link to Codalab : Available Feb 1 2021

Evaluation Metric : Micro F1-score. 

Contact information: Juan Banda (juan@jmbanda.com)