Pregnancy and health in the age of the Internet: a content analysis of online “birth club” forums

Anna Wexler 1, Anahita Davoudi 2, Davy Weissenbacher 2, Rebekah Choi 1, Karen O’Connor 2, Holly Cummings 3, Graciela Gonzalez-Hernandez 2

1 Department of Medical Ethics and Health Policy, University of Pennsylvania

2 Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania

3 Department of Obstetrics and Gynecology, University of Pennsylvania


Background: Although studies report that more than 90% of pregnant women utilize digital sources to supplement their maternal healthcare, little is known about the kinds of information that women seek from their peers during pregnancy. To date, most research has used self-report measures to elucidate how and why women to turn to digital sources during pregnancy. However, given that these measures may differ from actual utilization of online health information, it is important to analyze the online content pregnant women generate.
Objective: To apply machine learning methods to analyze online pregnancy forums, to better understand how women seek information from a community of online peers during pregnancy.
Methods: Data from seven “birth club” forums (September 2018; January-June 2018) were scraped. Forum posts were collected for a one-year period, which included three trimesters and three months postpartum. Only initial posts from each thread were analyzed (n=262,238). Automatic natural language processing (NLP) methods captured 50 discussed topics, which were annotated by two independent coders and grouped categorically.
Results: The largest topic categories were maternal health (45%), baby-related topics (29%), and people/relationships (10%). While pain was a popular topic all throughout pregnancy, individual topics that were dominant by trimester included miscarriage (first trimester), labor (third trimester), and baby sleeping routine (postpartum period).
Conclusion: More than just emotional or peer support, pregnant women turn to online forums to discuss their health. Dominant topics, such as labor and miscarriage, suggest unmet informational needs in these domains. With misinformation becoming a growing public health concern, more attention must be directed toward peer-exchange outlets.

Quick Downloads

Dataset information