Although the abortion debate is often framed along strict proabortion or antiabortion stances (eg, prochoice versus prolife—terms common in the United States, Ireland, and other English-speaking countries; pro-elección versus provida and pro-aborto versus anti-aborto—terms used in Mexico, Argentina, and other Spanish-speaking countries), actual abortion beliefs are complex, contextual, and at times contradictory [-]. Notably, despite media characterizations of these 2 oppositional perspectives—for people ascribing to either proabortion or prochoice labels (ie, broad abortion support) or antiabortion or prolife labels (ie, broad abortion opposition)—there exist circumstances in which people’s views diverge from the dichotomy []. These circumstances include, for example, the gestation period of pregnancy [], the context for seeking abortion [], and whether people consider abortion as a legal versus moral issue []. In addition, attitudes toward abortion also vary across some demographic characteristics such as age, educational attainment, political affiliation, and race or ethnicity of a person or groups of people participating in a survey [,,].
Beyond context-specific or cultural considerations that may predict complex abortion views, personal accounts, narratives, and discussions about abortion may similarly reveal the extent to which abortion views depart from a support or opposition dichotomy, including extreme abortion circumstances or personal experience with an abortion. Evidence suggests that these considerations are not ethnocentric but shared globally. Research comparing abortion beliefs between English-speaking US residents and Spanish-speaking US residents of diverse nations of origin demonstrates that clear general differences exist in abortion beliefs. Following investigations of the abovementioned considerations, we suggest that further research may yield more precise insights into evolving views on abortion [].
Contextual, contradictory, and, in some cases, changing beliefs on abortion make it difficult to accurately assess global and US abortion climates beyond rote and dichotomized categories []. However, evidence strongly suggests the US and global populations hold views that depart from these 2 categories, reflecting abortion attitude complexity [,]. Although survey data have quantitatively supported the idea of abortion attitude complexity—qualitative data, broadly defined as any type of open-ended text, audio, visual, or language data, may add additional nuance to suggest where and how complexity may emerge. For example, interviews about abortion reveal specific circumstances that contribute to variability in people’s views on abortion [] or reveal how current events and news cycles, in turn, shape social beliefs and attitudes []. Qualitative data can also inform how people contextualize assistance-related resources such as those found on social media.
Social media posts, as a novel form of qualitative data, may similarly reveal how people view abortion and the associated complexity of belief systems at a population-level scale. Notably, the inescapable role of social media in the public lexicon has evolved over time into an outlet for community building and information dissemination that can connect users over shared interests disregarding location []. For example, the Pew Research Center contends that more than three-quarters of the US adult population regularly use at least 1 social media platform [], and half of all the users have actively maintained at least 1 account for more than a decade. Because social media data are part of the public domain, longitudinal tracking of such data can represent an open-access running diary of thoughts, perspectives, and affective indicators—particularly for issues deemed controversial or contentious, including COVID-19 vaccination status, marriage equality, transgender sports bans, and abortion [,]. Furthermore, social media data are also global, implying that shared languages, regardless of geographic constraint, can contribute to discourses about abortion and associated beliefs therein.
Research has documented that people use social media to share their opinions and views and engage in debates on various topics, as well as to seek help and information and solicit personal advice that pertains to their situation or to something they are going through in life [-]. These web-based interactions vary widely across social media platforms and topics but may include discussions about substance use disorders [], mental health [], sexual assault [], and managing HIV treatment [], among a wide range of other topics. Furthermore, some more limited research has explored social media users’ engagement and interactions as part of sharing personal experiences, soliciting help, and requesting information pertaining to abortion. This research has focused particularly on assessing how social media users rely on each other to discuss cost-related barriers to abortion care [], to discuss decision-making processes regarding abortion methods [], and to seek support to make abortion decisions when they may lack familial and medical support otherwise [].
Reddit (Reddit, Inc) is a social networking website, which is defined by its structure that allows users to subscribe to forums on diverse topics, both controversial and noncontroversial. Their approach to topic discussion is distinct from other social media platforms in that users can opt into conversations with variably different foci depending on needs and interests. For example, previous research has demonstrated that Reddit can serve as a social connection metric, information-sharing tool, and outreach resource [] for controversial or contentious social topics, including sexual assault [], abortion [], and addiction and recovery []. For most, Reddit forums are a source of information on these topics. However, many of these same topics, particularly those with political contexts, can also be discussed on different Reddit forums in more social commentary or debate-style perspectives. Abortion is one example of a contentious social topic with ranging subreddits pertaining to different aspects of abortion, including as a social connection and information-sharing tool and debate platform.
Analyzing different facets of the same topic through various subreddits could yield nuanced aspects regarding crucial health topics unique from other quantitative and qualitative abortion research. Notably, as of December 2022, Reddit was the 20th most accessed website globally (sixth in the United States), and 50% of all Reddit users reside in the United States, with Canada, Australia, and the United Kingdom comprising approximately 20% of the total Reddit users. Reddit data can principally serve as a window into views on abortion in the United States; however, because not all English language data originate in the United States, it is also possible to observe abortion attitude complexity in a more Westernized, but global context or global reactions to news related to abortion in the United States.
ObjectivesAdvances in computational data mining have made it feasible to extract, analyze, and interpret these data en masse. This study used natural language processing (NLP) and data mining methods to identify and visualize latent themes across 2 distinct subreddits specific to abortion: r/Abortion and r/AbortionDebate. As a comparative study, we aimed to compare the semantic and content differences across these subreddits to gain a comprehensive social media portrait of abortion dialogue on Reddit. This study was guided by three research questions:
What themes emerge in a corpus of Reddit posts in r/Abortion, the largest subreddit dedicated to abortion social support and outreach?What themes emerge in a corpus of Reddit posts in r/AbortionDebate?What do similarities and differences by subreddits implicate regarding social media–derived beliefs and ideologies on abortion?Data for this study were collected over 5 months (ie, from January 2020 to May 2022) from the social networking website Reddit. Reddit represents an open network of communities where users can engage and connect with others over shared interests, hobbies, or personal experiences. Unlike other popular social media websites used for computational analyses, including X (X Corp, formerly known as Twitter), Reddit is unique in that users can create specific channels to form communities with other interested parties on diverse issues or topics. These channels, otherwise known as subreddits, comprise people with shared identities who find, subscribe to, and post within these channels. For instance, people interested in gaming can join the r/Gaming subreddit and people with depression can join the r/Depression subreddit.
We leveraged the PRAW (Python Reddit Application programming interface Wrapper) [], a third-party application programming interface (API), to collect data for this study and specifically to isolate and download content posted into subreddits in English germane to abortion—we queried the API to allocate similar subreddits also spanning abortion-related topics. This query returned 1 additional subreddit: r/AbortionDebate. Given observable differences in framing (ie, people’s abortion experiences vs debates about abortion), we included this subreddit in our study as an additional but mutually distinct unit of analysis; that is, we collected and stored data for r/Abortion and r/AbortionDebate as separate corpora intended for separate analyses. All data collected for this study were in English, which we selected for 2 reasons: first, >70% of all Reddit users originated from English-speaking countries, and second, at the time of data collection, Reddit posts originating in languages other than English were insufficient for analysis. In Spanish, for example, r/Aborto contained only 5 members, with no activity since 2019; similarly, we observed <50 Spanish-language posts in either r/Abortion or r/AbortionDebate.
Once we identified our subreddits of interest, we queried the API to collect new posts and top posts from the r/Abortion and r/AbortionDebate subreddits. After filtering our data for duplicates and accounting for API data scraping limits, our final sample size comprised 4966 posts, divided into 2 corpora: 56.68% (2815/4966) of r/AbortionDebate posts and 43.31% (2151/4966) of r/Abortion posts.
AnalysesWe aimed to use NLP to identify salient categories in the r/Abortion and r/AbortionDebate subreddits. In numerous studies, latent Dirichlet allocation (LDA) topic models have been predominantly used for this purpose. LDA is a well-regarded unsupervised probabilistic model that evaluates word co-occurrence patterns using an iterative Gibbs sampling method []. Although LDA is often considered the gold standard within many academic and professional communities, advancements in NLP, artificial intelligence, and neural networks have introduced innovative topic modeling methods that can more closely approximate the potential meaning in these categories [].
For this study, we applied one such advancement, the Bidirectional Encoder Representations from Transformers (BERT) topic modeling tool, BERTopic. BERTopic is an NLP topic modeling approach used to identify latent themes or topics within a collection of interrelated documents []. Unlike LDA, which uses probabilistic modeling to identify latent topics, BERTopic leverages pretrained embeddings from one of many transformer models, a type of neural network architecture in which an input sequence is compared against large-scale language models to calculate embeddings []. Embeddings are used to convert unstructured data, including words and sentences, into fixed-length continuous vectors. These vectors enable mathematical operations to capture semantic meanings, relationships, and other properties related to natural human language.
The vectors calculated using this approach tend to be highly dimensional and difficult to interpret. To reduce dimensionality while maintaining the integrity of our data, we applied a principal component analysis, which is commonly applied in NLP approaches for general dimensionality reduction purposes []. This analysis allowed us to extract and more easily interpret a range of possible clusters or topics in both the r/Abortion and r/AbortionDebate subreddit data. Once we reduced the dimensionality of our vectors, we applied a Hierarchical Density-Based Spatial Clustering of Applications with Noise to identify latent clusters or topics [], CountVectorizer to tokenize each topic, and class term frequency–inverse document frequency to extract topic words for each cluster [].
Furthermore, to gauge the emotional tone or mood represented in each post from the studied corpora, we applied a Valence Aware Dictionary and Sentiment Reasoner (VADER), a rule-based sentiment analysis tool [], and Text2Emotion, a rule-based emotion analysis tool []. VADER sentiment analysis is an algorithm and analysis that examines the polarity of words within each social media post. Posts are fed through a lexicon or web-based dictionary, which is precoded with values for all positive and negative words in the English language. When posts are run through the VADER lexicon, they receive a composite score. Negative VADER values denote lower affect (ie, −0.99 to −0.01), and positive values denote higher affect (ie, 0.01 to 0.99). Although an older tool, VADER is commonly used to assess content affect and emotional affect. In contrast, the Text2Emotion tool for emotion analysis scans each entry for key phrases and terms denoting one of four base emotions: (1) happy, (2) surprise, (3) fear, and (4) sadness. Collectively, these 2 tools can identify potential tonal differences in each post, again implicating the different uses of each subreddit included in the analysis. Both tools have been applied extensively in computational public health studies owing to their ease of access, replicability, and numerous validation studies [,,,].
ProcedureOur workflow is depicted in . First, we queried the Reddit API to archive top and new posts from the r/Abortion and r/AbortionDebate subreddits. Data collected from the r/Abortion and r/AbortionDebate subreddits were saved as separate data files. After removing duplicate and non-English posts in either data file, we applied standard preprocessing steps to remove parts of speech that would detract from the clarity of our models, including articles, prepositions, punctuation, abbreviations, and numbers []. Once the data were cleaned, we tokenized our data at the sentence level before calculating embeddings. Once the data were preprocessed and tokenized, we proceeded with our BERTopic pipeline. First, to calculate embeddings in our data we applied all-MiniLM-L6_v2 [], a transformer-based model developed by Microsoft Corp. This model is designed to be a smaller and more efficient transformer model than larger models, including a generative pretrained transformer or T5, which may make it more appropriate for smaller data sets; however, more research is needed to confirm this notion. Once we calculated embeddings for all sentences in each corpus, we applied a principal component analysis to reduce dimensionality in our data, retaining 5 components. We then ran an iterative topic model ranging from 10 to 80 topics and calculated coherence scores [] to identify an optimal number of topics, retaining a topic solution with the highest coherence score. For both r/Abortion and r/AbortionDebate, the optimal solution was 10 topics, yielding a respective coherence score of 0.42 and 0.35, which indicates a marginal fit. After we extracted key terms per topic, we applied a sorting function to examine key terms in each entry. Each entry was then classified into one of 10 possible topics in either corpus. We lastly performed a VADER sentiment analysis and Text2Emotion emotion analysis for each entry in both corpora.
This study involved a secondary analysis of deidentified and anonymized Reddit posts collected between January and May 2022. As this was an observational study with no contact between human subjects and no possible way to trace posts to any individual author, this study was exempt from Institutional Review Board review.
Our study applied computational tools to collect and analyze subreddits specific to abortion. We aimed to examine how abortion was discussed on the social media platform Reddit, both as an information-sharing tool and as a platform for debating worldviews.
What Themes Emerged in a Corpus of Reddit Posts About r/Abortion, the Largest Subreddit Dedicated to Abortion Social Support and Outreach?Our coherence score analysis indicated a 10-topic solution for the r/Abortion subreddit. outlines each topic by keywords, the number of sentences belonging to each topic, and the percentage of each topic relative to the larger corpus. Names for each topic were derived by reviewing a small excerpt of Reddit data that were sorted into one of 10 topics by a sorting function using keywords.
The r/Abortion subreddit analysis revealed numerous ways in which abortion was discussed in a social support context. The most prominent topic of our study, topic 1: sharing support, comprised the bulk of the conversation with >18% total representation. Social support was commonly manifested by people sharing their own experiences with abortion or by friends and family members who may have experienced abortion. This was further evident by multiple topics containing information-sharing content: abortion experience (topics 3, 7, and 10). Beyond social support, several of our topics also appeared to discuss abortion in a neutral and educational information-sharing context: general abortion (topic 5) and general pregnancy (topic 8).
Further review of the topics added context to our findings. provides a summary of each topic and the key excerpts that denote additional meaning. As shown in , there was little content indicative of debate or questioning one’s position on abortion. Instead, we observed personal experience sharing, including narrative accounts of one’s experience with abortion generally, miscarriage, and medication abortion specifically.
Table 1. Topic names and key terms pertaining to r/Abortion subreddit posts (n=2151).IndexNameKey termsSentences (n=42,732), n (%)1Sharing supportwant, wanted, told, didnt, say, asked, said, talk, okay, dont7970 (18.65)2Postabortion emotionfeel, sadness, feeling, depressed, regret, depression, emotionally, guilt, grieving, emotional4502 (10.54)3Abortion experienceslept, pad, bed, hour, rest, test, waited, room, home, minute4361 (10.21)4Social supporttherapy, supportive, support, struggling, advice, experience, situation, care, family, talked4215 (9.86)5General abortionabortion, pregnancy, aborted, miscarriage, fetus, pregnant, abort, parenthood, birth, prochoice4093 (9.58)6Clinic experienceappointment, clinic, pill, prescription, sedation, patient, consultation, doctor, iud, obgyn3770 (8.82)7Abortion experienceanesthesia, sedation, anxiety, painkiller, pain, ibuprofen, relief, anxious, painful, uncomfortable3584 (8.39)8General pregnancypregnancy, pregnant, gestational, miscarriage, condom, positive, placenta, fetus, parenthood, pill3441 (8.05)9Abortion alternativespregnant, pregnancy, adoption, child, parent, selfish, baby, kid, mother, marriage3433 (8.03)10Abortion experiencecramp, crampy, cramping, bleeding, bleed, clot, ibuprofen, spotting, period, nausea3363 (7.87)Table 2. r/Abortion subreddit summaries with example excerpts.Topic nameSummaryExcerptsSharing supportThis cluster contained diverse content pertaining to lending and sharing abortion-related support.“I just wanted to say that I am thinking of you during this time”Perhaps one of the most recurring patterns in our data was frank discussions about postabortion feelings in a clinical setting, (“I felt so nauseous in that waiting room, I was not sure I could go through with it”), a postabortion setting (“It took me a few days to finally feel like myself again post-abortion”), or a medication abortion context (“The mifepristone caused some pretty intense clotting after I took the pill”). The medication abortion narratives were sometimes framed as someone explaining their decision (“I chose the pill because where I live you cannot have someone with you when getting an abortion due to COVID-19”).
What Themes Emerged in a Corpus of Reddit Posts in the r/AbortionDebate Subreddit?Our coherence score analysis indicated a 10-topic solution for the r/AbortionDebate subreddit. outlines each topic by keywords, the number of sentences belonging to each topic, and the percentage of each topic relative to the larger corpus. Names for each topic were derived by reviewing a small excerpt of Reddit data that was sorted into one of 10 topics by a sorting function based on keywords.
Unlike the r/Abortion subreddit, which we determined seemed to be used in a social support and information-sharing context, the r/AbortionDebate subreddit comprised conversations dedicated to critically assessing abortion from legal, moral, and ethical perspectives. The topic with the greatest representation was topic 1: Reddit forum rules and regulations. In topic 1, we observed several posts directly from moderators explicitly warning against outright attacks, misinformation, and vitriol targeted at people with opposing views on abortion; this topic was absent completely in the r/Abortion topic model. The second most prominent topic, topic 2: abortion morality, was centered on debating abortion from a moral perspective. The topic with the smallest representation was topic 5, pertaining to general pregnancy. At face value, we did not observe a great overlap in topic content in the r/AbortionDebate subreddit compared with the r/Abortion subreddit. However, we reviewed additional excerpts to ascribe a deeper meaning to these topics to examine precisely how abortion debates manifested on these forums.
outlines additional information about each topic, including a summary and key excerpts that implicate deeper meaning. This additional analysis allowed us to examine more precise moral, legal, and ethical arguments pertaining to people’s expressed views on abortion.
Table 3. Topic names and key terms pertaining to the r/AbortionDebate subreddit posts (n=2815).IndexTopic nameKey termsSentences (n=30,031), n (%)1Forum rulespost, rule, problem, posted, following, response, sorry, line, mod, issue5847 (19.47)2Abortion moralitymorality, debating, discussion, arguing, debate, argue, reasoning, argument, premise3253 (10.83)3Abortion legalitylaw, enforcement, violence, consent, crime, autonomy, consenting, freedom, murder, bodily3240 (10.79)4Personhoodabortion, parenthood, fetus, unborn, pregnancy, aborted, contraception, maternal, abort2839 (9.45)5General pregnancyprenatal, pregnancy, childbirth, fetus, maternal, pregnant, fetal, gestational, placenta2754 (9.17)6Abortion debateabortion, debate, fetus, prolife, argument, arguing, morality, advocate, prolifers2700 (8.99)7Humanity and lifehuman, humanity, biological, personhood, sentient, moral, consciousness, concept, brain2573 (8.57)8Euthanasiaeuthanasia, killing, death, dying, murder, lethal, suicide, kill, die, killed2325 (7.74)9Abortion alternativespregnancy, parenthood, contraception, abstinence, pregnant, contraceptive, fetus, unborn2320 (7.72)10General pregnancyfetus, foetus, unborn, fetal, conception, womb, embryo, pregnancy, childbirth, pregnant2185 (7.27)Table 4. r/AbortionDebate subreddit summaries with example excerpts.Topic nameSummaryExcerptsForum rulesThis cluster contained content from moderators reminding posters to mind rules and regulations.“As a courtesy, be respectful to others.”For example, we observed that the abortion morality topic typically contained content related to drawing lines about abortion permissibility (“Where do [people] draw the line between acceptable and not acceptable”). This style of discussion was mirrored in conversations about fetal personhood (“Who here honestly believes a zygote is a person with rights?”) and the role spirituality plays in moral arguments about abortion (“But what do Catholics really think on this issue?”). Discussions and arguments about abortion morality were notably similar to the content in topic 3: abortion legality. Content on this topic typically discussed new abortion-related laws, the merits of those laws, and opinions about their relative effectiveness (“Texas passed a very restrictive law and it will serve as a benchmark for other states, watch”). Importantly, and across topics, we observed that people declared their abortion views (“I am pro-choice and I will always be”) and, in some cases, discussed how their abortion views evolved over time (“I am pro-life, but we should be discussing the merits of abortion as a life-saving tool here”). Here, we observed more opinions than the outright support articulated in the r/Abortion subreddit.
What Do Similarities and Differences by Subreddit Implicate About Social Media–Derived Abortion Beliefs and Ideologies?visually represents data from each subreddit, where dense, overlapping clusters signify similar topics (or higher collinearity) and nonoverlapping circles indicate dissimilar topics (or lower collinearity).
In both the r/Abortion and r/AbortionDebate subreddits, the intertopic distance maps depict mutual exclusivity in general abortion and pregnancy topics, distinguished by basic sharing of language and specific information related to pregnancy and abortion (“Sometimes a pregnancy can end without warning or reason”; “abortion is a women’s health issue”). Beyond these statements, however, other conversations exhibit a richer and more nuanced discourse about abortion, overlapping between topics and offering deeper insights into an individual’s worldview on abortion, and portraying how various co-occurring factors influence one’s beliefs and worldviews (“Laws are one thing but have you considered the humanistic side of it all?”).
We used VADER and Text2Emotion tools to discern affective differences between r/Abortion and r/AbortionDebate subreddits. The r/Abortion subreddit displayed a compound VADER score of 0.10, reflecting overall neutral content, whereas the r/AbortionDebate subreddit displayed a score of −0.06, denoting neutral to slightly negative content. The emotion analysis findings for the r/Abortion subreddit were as follows: happy (mean 0.06, SD 0.19), angry (mean 0.20, SD 0.31), surprise (mean 0.12, SD 0.26), sad (mean 0.20, SD 0.31), and fear (mean 0.36, SD 0.39). The emotion analysis findings for r/AbortionDebate subreddit were as follows: happy (mean 0.12, SD 0.27), angry (mean 0.05, SD 0.18), surprise (mean 0.11, SD 0.25), sad (mean 0.22, SD 0.35), fear (mean 0.28, SD 0.35).
Furthermore, the Text2Emotion variable fear was prominent in r/Abortion, whereas happy was slightly more elevated in r/AbortionDebate. These observed differences are likely attributed to the differing nature and scope of the subreddits. For instance, the manifestation of fear may be more related to personal abortion narratives in r/Abortion, whereas happiness may arise from occasional friendly exchanges of views in r/AbortionDebate.
Despite their different foci, both subreddits contain myriad conversation topics, allowing for civil and enlightening discussions on evolving abortion views and ideologies. The discourse in these forums sometimes hints at the evolution of individual ideologies with time, reflecting the dynamic nature of personal beliefs and the influences shaping them (adapted excerpt: “I guess I just don’t know my views”; excerpt: “My opinion changed over time, growing up in a Christian household I was always against abortion...until I needed one myself”). This phenomenon underscores the essential role of such platforms in fostering understanding and dialogue on the multifaceted issue of abortion.
Our study leveraged Reddit data as a novel, big data form of qualitative data to examine abortion discourse on r/Abortion and r/AbortionDebate subreddits. We observed several important themes, including evidence of complexity in abortion-related social media posts, which warrant further discussion.
The r/Abortion Subreddit as an Information-Seeking or Information-Sharing Platform for People With Questions About Their Abortion ExperiencesWithin the r/Abortion subreddit, we noticed posters using this platform to discuss abortion in diverse, sometimes overlapping contexts. However, each topic emerging from r/Abortion typically involved a degree of information sharing, whether through the provision of available resources or sharing personal narratives and experiences with abortion. We primarily observed these types of posts in topic 1: sharing support; topic 2: postabortion emotions; topics 3, 7, and 10: abortion experience; and topic 5: clinical experience. The content within these topics typically involved direct sharing of one’s experiences related to abortion or posing highly specific questions about access (eg, excerpt: “Is abortion legal past 6 weeks gestation in Oklahoma?”) and medication abortion (eg, excerpt: “Abortion is legal here; can I get abortion pills by mail?”). Within the medication abortion topic, the content was both informative and supportive, with some posters sharing their experience in solidarity with others facing a similar choice. Notably, we did not observe any critiques against anyone’s abortion narratives; rather, the tone and structure, as also evident in this study’s VADER and emotion analysis, are largely informative and overall supportive of abortion. Given that the rules and guidelines established this subreddit as a place of nonconfrontational discussion, perhaps people advocating for other reproductive choices may have shared their perspectives in other subreddits, such as those related to adoption.
We acknowledge the possible connection between personal tendencies to share intimate information and the continually evolving role of the internet as a medium for social connection and information acquisition []. Notably, for the past 3 decades, the internet has become the most influential medium for information-seeking globally. The Pew Research Center indicates that approximately 80% of the adult population in the United States regularly use the internet to acquire general information or understand unfamiliar topics []. For example, an individual contemplating an abortion might opt to seek guidance in web-based forums to avoid potential ostracism from friends and family. Similarly, a friend or family member of someone considering an abortion might turn to web-based forums to secure advice or perspectives on assisting their loved one. Discourse on such platforms is crucial, especially when addressing sensitive topics that many may feel uneasy to discuss openly. This emphasizes the significance of the internet as a confidential and reliable resource for information and advice. Importantly, this also supports Reddit as a source of information for people needing abortion-related counseling.
These excerpts, and others in our composite sample, illustrate that social networking websites serve as a potentially crucial source of information for some [], offering insights and details that may be otherwise unavailable, including local and state resources for abortion. This finding becomes particularly salient in light of the overturning of Roe v. Wade, which marked the end of federal protections for abortion until viability []. In the wake of this decision, 24 states enacted bans with limited exceptions or additional restrictions on abortion—generally earlier in terms of weeks of gestation than previously occurring under Roe v. Wade []. For those residing in states where abortion transitioned from being broadly legal to almost entirely illegal, web-based resources may have played a pivotal role during instances of unplanned pregnancy, as observed previously []. Further research is imperative to assess the efficacy of Reddit and other social networking sites in offering support and resources on this and other health-related topics. Notably, this subreddit contained little to no expression about personal abortion beliefs and ideologies.
The r/AbortionDebate Subreddit and Discussions of Abortion Identity and Changing Views Over TimeWe did not observe much information or support sharing in the r/AbortionDebate subreddit. Rather, content in this subreddit discussed values and beliefs about abortion across many domains, including ethical, moral, legal, and humanistic. In several circumstances, we observed complex and nuanced abortion perspectives that do not correspond neatly to prochoice or prolife frameworks—2 commonly used but contested abortion identity labels used to outline personal abortion beliefs. For example, as many as half of the topics uncovered by r/AbortionDebate contained contradictory expressions regarding abortion and how the abortion debate was framed. These posts were broadly delineated as those deconstructing or debating prochoice and prolife movements and others explaining how circumstances contributed to moral and ethical shifts in abortion views, for example, in the following excerpts: “I was and will always be pro-choice, but my reaction was absolutely not [to abort a fetus with serious birth defects] even though I knew it was the right answer” and “I was pro-life and never thought I’d need Planned Parenthood until I did. My experience changed my opinion of them, but [I still wish] they didn’t primarily exist to perform abortions.” Here, the emphasis is far less on information or support sharing, rather the purpose is to articulate personal views about abortion and defend them accordingly. These findings align with ongoing abortion attitude research citing complex or nuanced abortion views that do not neatly fit into a singular label [-].
In addition to discussing and debating abortion values, we observed more combative content in the r/AbortionDebate subreddit. This is likely by design, namely to parse out people seeking information about abortion versus people looking to debate abortion []. Such differences between the r/Abortion and r/AbortionDebate subreddits were particularly evident in our sentiment and emotion analyses. For example, r/AbortionDebate yielded slightly more negative VADER affect scores and decreased emotion analysis scores for fear. We attributed more negative VADER scores to the often contentious exchanges among users (excerpt: “All these pro-choicers in here trying to lump as all as anti-women bigots”). We attributed lower fear scores to the apparent use of r/AbortionDebate as a forum to discuss abortion views and not for sharing information or narrative accounts about abortion. In other words, negative language was reflected via discourse in the r/AbortionDebate subreddit, as opposed to expressing personal fears or concerns about abortion, which may have surfaced more in the r/Abortion subreddit. In this context, the r/AbortionDebate subreddit may be more useful for mining insights into abortion ideologies, particularly when examining precise factors about abortion, including moral and legal arguments, gestational limits, and others. However, to gain insights into how abortion, as a medical procedure, is communicated from a decision-making perspective, r/Abortion may be more informative.
We identified 2 main implications from the content differences observed in r/Abortion and r/AbortionDebate. First, opting for the right Reddit forum is critically important. Reddit’s structure—where users select forums based on interests or needs—is different from other social networking sites. For people looking for ideally accurate, impartial information about abortion, r/Abortion or similar subreddits are suitable. Meanwhile, r/AbortionDebate is better for those wanting to discuss and ponder the ethical aspects of abortion. However, this choice is dependent on knowing how Reddit works. We project that a significant proportion of people may join the wrong forum and get exposed to unintended outcomes and viewpoints owing to a lack of preexisting knowledge about Reddit and its operations. Second, our observations support the idea that Reddit’s higher moderation levels make it a valuable tool for social science research. Historically, Reddit has carried the reputation of fostering trolls and hate speech. However, for health content, subreddits tend to be more effectively moderated by content experts. As evidenced in our data, both subreddits seemed relatively free from hate speech and trolling because of this moderation, which is unique to Reddit compared with other social media platforms. Therefore, Reddit remains a fairly reliable platform for both users and researchers, especially in the wake of recent changes in APIs and data access on other platforms, including X (X Corp, formerly known as Twitter).
Social Media as a Resource and Triangulation Tool to Support Ongoing Quantitative and Qualitative Research on AbortionOur findings, particularly those critiquing abortion identity labels or people explaining their contextual abortion beliefs, support extant research demonstrating that people’s attitudes toward abortion are complex. Notably, this larger body of research argues that abortion attitudes are not unidimensional or polar but rather vary along legal, moral, social, and other similar domains [,,,]. This work is composed of both quantitative (surveys) and qualitative (interviews) data collections, which collectively yield deep insights into social attitude formation in the United States and how beliefs vary based on context and other dimensions. Consistent with these studies, our results support the notion that abortion attitudes and abortion decision-making are not unidimensional but involve multiple co-occurring considerations.
The novel nature of social media as data adds additional validity to previous abortion attitude research. This is particularly salient regarding how our findings triangulate or corroborate previous research on abortion attitude complexity. Notably, by mining Reddit abortion forums, we observed at least two principal uses of these forums: (1) as a space to share narratives and resources about abortion and (2) as a dedicated channel to debate abortion views. For many, Reddit forums could be a place where some people feel comfortable sharing or debating abortion views, although we acknowledge that more research on this area is needed. Furthermore, Reddit offers a somewhat anonymous space where people can gather the information they need about abortion or inform their perspectives on abortion. These shared Reddit perspectives, which are generally top of the mind, spontaneous, and unprompted [], may provide a window into collective abortion beliefs that support or refute previous findings from other conventional forms of data collection. Similar uses of social media data, namely to corroborate findings on social issues, including gun control [], marriage equality [], and vaccination mandates [], have been similarly leveraged. Therefore, we argue that social media can be a valuable source of data to help elucidate people’s opinions on relevant social issues.
Furthermore, we argue that national surveys, strategic qualitative interviews, and mass social media scrapes as data sources yield specific outcomes that, when combined, provide a robust and comprehensive portrait of social issues. Survey data, which are strengthened when participants are identified via probability-based sampling protocols [], reveal nationwide associations between demographic variables and other variables of interest. Qualitative data can reveal insights into highly specific research questions, for example, whether changing auxiliary verbs leads to diverging responses about abortion beliefs []. Social media data scrapes can offer population-level insights that support or contradict findings from previous studies at the population-level scope and scale []. Our Reddit data support previous findings from surveys and qualitative research, demonstrating how social media data can serve as a triangulation tool. We contend that further strategic applications of social media mining with traditional quantitative and qualitative research can provide highly accurate portrayals of social views in the United States.
Limitations and Future ResearchThis study has several limitations that we hope to address in future research. First, although Reddit posts can be construed as qualitative data, we did not perform a formal qualitative analysis using these data. Owing to the scope of this study, we instead leveraged NLP algorithms to categorize and visualize all data simultaneously. In the future, researchers could perform detailed qualitative inquiries with these data, which can occur with the entire data set or among one or several clusters depending on the scope and research questions. Second, our study was limited to exploratory analyses. Although more refined algorithms could more effectively annotate and classify our data, we believed that these approaches would better serve as a follow-up to our exploratory approach to mining Reddit data. Future studies should consider using our data for more refined machine learning–driven or artificial intelligence-driven tasks. Finally, our study was limited by its relatively small timeframe (5 months). It is likely that collecting data for an even longer period may have yielded more nuanced findings.
ConclusionsWith the decision in Dobbs v. Jackson Women’s Health Organization overturning Roe v. Wade, there is renewed attention to abortion as a contentious political and social issue. Despite abortion being an exceedingly complex topic, political debate and discussions about abortion are generally framed dichotomously as a support or opposition, or prolife or prochoice issue. However, extensive research indicates that public opinion about abortion does not ascribe neatly to that dichotomy and that circumstances beyond a person’s control may lead to shifts in views of abortion over time. Our research corroborates such findings that detail the myriad ways in which abortion attitudes are complex and contextual, beyond simple information-seeking. Furthermore, our findings provide evidence that social media data can be a helpful triangulation tool for public opinion survey research.
The data are currently stored in a secure GitHub repository and are available for further analysis upon request.
None declared.
Edited by A Mavragani; submitted 18.03.23; peer-reviewed by L Jacques, T Zhang; comments to author 27.07.23; revised version received 27.09.23; accepted 20.12.23; published 14.02.24.
©Danny Valdez, Lucrecia Mena-Meléndez, Brandon L Crawford, Kristen N Jozkowski. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 14.02.2024.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
Comments (0)