Skip to main content

Showing 1–50 of 79 results for author: Blackburn, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11839  [pdf, ps, other

    cs.AI

    On the Eligibility of LLMs for Counterfactual Reasoning: A Decompositional Study

    Authors: Shuai Yang, Qi Yang, Luoxi Tang, Jeremy Blackburn, Zhaohan Xi

    Abstract: Counterfactual reasoning has emerged as a crucial technique for generalizing the reasoning capabilities of large language models (LLMs). By generating and analyzing counterfactual scenarios, researchers can assess the adaptability and reliability of model decision-making. Although prior work has shown that LLMs often struggle with counterfactual reasoning, it remains unclear which factors most sig… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  2. arXiv:2502.10921  [pdf, other

    cs.CL cs.SI

    Evolving Hate Speech Online: An Adaptive Framework for Detection and Mitigation

    Authors: Shiza Ali, Jeremy Blackburn, Gianluca Stringhini

    Abstract: The proliferation of social media platforms has led to an increase in the spread of hate speech, particularly targeting vulnerable communities. Unfortunately, existing methods for automatically identifying and blocking toxic language rely on pre-constructed lexicons, making them reactive rather than adaptive. As such, these approaches become less effective over time, especially when new communitie… ▽ More

    Submitted 21 February, 2025; v1 submitted 15 February, 2025; originally announced February 2025.

  3. Exploring Climate Change Discourse: Measurements and Analysis of Reddit Data

    Authors: Smriti Janaswamy, Jeremy Blackburn

    Abstract: Social media is very popular for facilitating conversations about important topics and bringing forth insights and issues related to these topics. Reddit serves as a platform that fosters social interactions and hosts engaging discussions on a wide array of topics, thus forming narratives around these topics. One such topic is climate change. There are extensive discussions on Reddit about climate… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

  4. arXiv:2410.22142  [pdf, other

    cs.SI cs.CY cs.HC

    A Data-Driven Analysis of the Sovereign Citizens Movement on Telegram

    Authors: Satrio Yudhoatmojo, Utkucan Balci, Jeremy Blackburn

    Abstract: Online communities of known extremist groups like the alt-right and QAnon have been well explored in past work. However, we find that an extremist group called Sovereign Citizens is relatively unexplored despite its existence since the 1970s. Their main belief is delegitimizing the established government with a tactic called paper terrorism, clogging courts with pseudolegal claims. In recent years… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

    Comments: 11 pages, 3 figures, 5 tables

    Journal ref: Workshop Proceedings of the 18th International AAAI Conference on Web and Social Media (ICWSM) -- Workshop: CySoc 2024: 5th International Workshop on Cyber Social Threats

  5. arXiv:2409.12842  [pdf, other

    cs.RO cs.AI

    Vision Language Models Can Parse Floor Plan Maps

    Authors: David DeFazio, Hrudayangam Mehta, Jeremy Blackburn, Shiqi Zhang

    Abstract: Vision language models (VLMs) can simultaneously reason about images and texts to tackle many tasks, from visual question answering to image captioning. This paper focuses on map parsing, a novel task that is unexplored within the VLM context and particularly useful to mobile robots. Map parsing requires understanding not only the labels but also the geometric configurations of a map, i.e., what a… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  6. arXiv:2407.20987  [pdf, other

    cs.CV cs.CY

    PIXELMOD: Improving Soft Moderation of Visual Misleading Information on Twitter

    Authors: Pujan Paudel, Chen Ling, Jeremy Blackburn, Gianluca Stringhini

    Abstract: Images are a powerful and immediate vehicle to carry misleading or outright false messages, yet identifying image-based misinformation at scale poses unique challenges. In this paper, we present PIXELMOD, a system that leverages perceptual hashes, vector databases, and optical character recognition (OCR) to efficiently identify images that are candidates to receive soft moderation labels on Twitte… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  7. arXiv:2407.18098  [pdf, other

    cs.CY cs.SI

    Unraveling the Web of Disinformation: Exploring the Larger Context of State-Sponsored Influence Campaigns on Twitter

    Authors: Mohammad Hammas Saeed, Shiza Ali, Pujan Paudel, Jeremy Blackburn, Gianluca Stringhini

    Abstract: Social media platforms offer unprecedented opportunities for connectivity and exchange of ideas; however, they also serve as fertile grounds for the dissemination of disinformation. Over the years, there has been a rise in state-sponsored campaigns aiming to spread disinformation and sway public opinion on sensitive topics through designated accounts, known as troll accounts. Past works on detecti… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Journal ref: International Symposium on Research in Attacks, Intrusions and Defenses (RAID 2024)

  8. arXiv:2406.14460  [pdf, other

    cs.SI

    Podcast Outcasts: Understanding Rumble's Podcast Dynamics

    Authors: Utkucan Balci, Jay Patel, Berkan Balci, Jeremy Blackburn

    Abstract: Podcasting on Rumble, an alternative video-sharing platform, attracts controversial figures known for spreading divisive and often misleading content, which sharply contrasts with YouTube's more regulated environment. Motivated by the growing impact of podcasts on political discourse, as seen with figures like Joe Rogan and Andrew Tate, this paper explores the political biases and content strategi… ▽ More

    Submitted 23 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  9. arXiv:2405.10233  [pdf, other

    cs.SI cs.CY cs.IR

    iDRAMA-Scored-2024: A Dataset of the Scored Social Media Platform from 2020 to 2023

    Authors: Jay Patel, Pujan Paudel, Emiliano De Cristofaro, Gianluca Stringhini, Jeremy Blackburn

    Abstract: Online web communities often face bans for violating platform policies, encouraging their migration to alternative platforms. This migration, however, can result in increased toxicity and unforeseen consequences on the new platform. In recent years, researchers have collected data from many alternative platforms, indicating coordinated efforts leading to offline events, conspiracy movements, hate… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  10. arXiv:2403.09254  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Gun Culture in Fringe Social Media

    Authors: Fatemeh Tahmasbi, Aakarsha Chug, Barry Bradlyn, Jeremy Blackburn

    Abstract: The increasing frequency of mass shootings in the United States has, unfortunately, become a norm. While the issue of gun control in the US involves complex legal concerns, there are also societal issues at play. One such social issue is so-called "gun culture," i.e., a general set of beliefs and actions related to gun ownership. However relatively little is known about gun culture, and even less… ▽ More

    Submitted 18 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  11. arXiv:2401.13248  [pdf, other

    cs.CY cs.SI

    "Here's Your Evidence": False Consensus in Public Twitter Discussions of COVID-19 Science

    Authors: Alexandros Efstratiou, Marina Efstratiou, Satrio Yudhoatmojo, Jeremy Blackburn, Emiliano De Cristofaro

    Abstract: The COVID-19 pandemic brought about an extraordinary rate of scientific papers on the topic that were discussed among the general public, although often in biased or misinformed ways. In this paper, we present a mixed-methods analysis aimed at examining whether public discussions were commensurate with the scientific consensus on several COVID-19 issues. We estimate scientific consensus based on s… ▽ More

    Submitted 7 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted for publication at 27th ACM Conference on Computer Supported Cooperative Work and Social Computing (ACM CSCW 2024). Please cite accordingly

  12. arXiv:2312.08394  [pdf, other

    cs.CR cs.CY cs.SI

    From HODL to MOON: Understanding Community Evolution, Emotional Dynamics, and Price Interplay in the Cryptocurrency Ecosystem

    Authors: Kostantinos Papadamou, Jay Patel, Jeremy Blackburn, Philipp Jovanovic, Emiliano De Cristofaro

    Abstract: This paper presents a large-scale analysis of the cryptocurrency community on Reddit, shedding light on the intricate relationship between the evolution of their activity, emotional dynamics, and price movements. We analyze over 130M posts on 122 cryptocurrency-related subreddits using temporal analysis, statistical modeling, and emotion detection. While /r/CryptoCurrency and /r/dogecoin are the m… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  13. arXiv:2308.05247  [pdf, other

    cs.SI cs.CR

    TUBERAIDER: Attributing Coordinated Hate Attacks on YouTube Videos to their Source Communities

    Authors: Mohammad Hammas Saeed, Kostantinos Papadamou, Jeremy Blackburn, Emiliano De Cristofaro, Gianluca Stringhini

    Abstract: Alas, coordinated hate attacks, or raids, are becoming increasingly common online. In a nutshell, these are perpetrated by a group of aggressors who organize and coordinate operations on a platform (e.g., 4chan) to target victims on another community (e.g., YouTube). In this paper, we focus on attributing raids to their source community, paving the way for moderation approaches that take the conte… ▽ More

    Submitted 22 June, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted for publication at the 18th International AAAI Conference on Web and Social Media (ICWSM 2024). Please cite accordingly

  14. arXiv:2307.06981  [pdf, ps, other

    cs.SI cs.CY

    Roll in the Tanks! Measuring Left-wing Extremism on Reddit at Scale

    Authors: Utkucan Balcı, Michael Sirivianos, Jeremy Blackburn

    Abstract: Social media's role in the spread and evolution of extremism is a focus of intense study. Online extremists have been involved in the spread of online hate, mis- and disinformation, and real-world violence. However, most existing work has focuses on right-wing extremism. In this paper, we perform a first of its kind large-scale measurement study exploring left-wing extremism. We focus on "tankies,… ▽ More

    Submitted 13 June, 2025; v1 submitted 13 July, 2023; originally announced July 2023.

  15. arXiv:2304.05874  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE eess.SP q-bio.QM

    Adaptive Gated Graph Convolutional Network for Explainable Diagnosis of Alzheimer's Disease using EEG Data

    Authors: Dominik Klepl, Fei He, Min Wu, Daniel J. Blackburn, Ptolemaios G. Sarrigiannis

    Abstract: Graph neural network (GNN) models are increasingly being used for the classification of electroencephalography (EEG) data. However, GNN-based diagnosis of neurological disorders, such as Alzheimer's disease (AD), remains a relatively unexplored area of research. Previous studies have relied on functional connectivity methods to infer brain graph structures and used simple GNN architectures for the… ▽ More

    Submitted 27 September, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: 16 pages, 16 figures

  16. arXiv:2303.07099  [pdf, other

    cs.CY cs.SI

    Beyond Fish and Bicycles: Exploring the Varieties of Online Women's Ideological Spaces

    Authors: Utkucan Balci, Chen Ling, Emiliano De Cristofaro, Megan Squire, Gianluca Stringhini, Jeremy Blackburn

    Abstract: The Internet has been instrumental in connecting under-represented and vulnerable groups of people. Platforms built to foster social interaction and engagement have enabled historically disenfranchised groups to have a voice. One such vulnerable group is women. In this paper, we explore the diversity in online women's ideological spaces using a multi-dimensional approach. We perform a large-scale,… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Journal ref: Published in the Proceedings of the 15th ACM Web Science Conference 2023 (ACM WebSci 2023). Please cite the WebSci version

  17. arXiv:2303.02182  [pdf, other

    cs.LG cs.AI

    CoRL: Environment Creation and Management Focused on System Integration

    Authors: Justin D. Merrick, Benjamin K. Heiner, Cameron Long, Brian Stieber, Steve Fierro, Vardaan Gangal, Madison Blake, Joshua Blackburn

    Abstract: Existing reinforcement learning environment libraries use monolithic environment classes, provide shallow methods for altering agent observation and action spaces, and/or are tied to a specific simulation environment. The Core Reinforcement Learning library (CoRL) is a modular, composable, and hyper-configurable environment creation tool. It allows minute control over agent observations, rewards,… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: for code, see https://github.com/act3-ace/CoRL

  18. arXiv:2301.05777  [pdf

    cs.LG eess.IV q-bio.TO

    Lung airway geometry as an early predictor of autism: A preliminary machine learning-based study

    Authors: Asef Islam, Anthony Ronco, Stephen M. Becker, Jeremiah Blackburn, Johannes C. Schittny, Kyoungmi Kim, Rebecca Stein-Wexler, Anthony S. Wexler

    Abstract: The goal of this study is to assess the feasibility of airway geometry as a biomarker for ASD. Chest CT images of children with a documented diagnosis of ASD as well as healthy controls were identified retrospectively. 54 scans were obtained for analysis, including 31 ASD cases and 23 age and sex-matched controls. A feature selection and classification procedure using principal component analysis… ▽ More

    Submitted 9 February, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

  19. arXiv:2212.05926  [pdf, other

    cs.CR cs.CY cs.SI

    LAMBRETTA: Learning to Rank for Twitter Soft Moderation

    Authors: Pujan Paudel, Jeremy Blackburn, Emiliano De Cristofaro, Savvas Zannettou, Gianluca Stringhini

    Abstract: To curb the problem of false information, social media platforms like Twitter started adding warning labels to content discussing debunked narratives, with the goal of providing more context to their audiences. Unfortunately, these labels are not applied uniformly and leave large amounts of false content unmoderated. This paper presents LAMBRETTA, a system that automatically identifies tweets that… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: 44th IEEE Symposium on Security & Privacy (S&P 2023)

  20. arXiv:2211.14388  [pdf, other

    cs.CY cs.SI

    Non-Polar Opposites: Analyzing the Relationship Between Echo Chambers and Hostile Intergroup Interactions on Reddit

    Authors: Alexandros Efstratiou, Jeremy Blackburn, Tristan Caulfield, Gianluca Stringhini, Savvas Zannettou, Emiliano De Cristofaro

    Abstract: Previous research has documented the existence of both online echo chambers and hostile intergroup interactions. In this paper, we explore the relationship between these two phenomena by studying the activity of 5.97M Reddit users and 421M comments posted over 13 years. We examine whether users who are more engaged in echo chambers are more hostile when they comment on other communities. We then c… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Journal ref: 17th International AAAI Conference on Web and Social Media (ICWSM 2023). Please cite accordingly

  21. arXiv:2209.03463  [pdf, other

    cs.CY cs.AI cs.CR cs.SI

    Why So Toxic? Measuring and Triggering Toxic Behavior in Open-Domain Chatbots

    Authors: Wai Man Si, Michael Backes, Jeremy Blackburn, Emiliano De Cristofaro, Gianluca Stringhini, Savvas Zannettou, Yang Zhang

    Abstract: Chatbots are used in many applications, e.g., automated agents, smart home assistants, interactive characters in online games, etc. Therefore, it is crucial to ensure they do not behave in undesired manners, providing offensive or toxic responses to users. This is not a trivial task as state-of-the-art chatbot models are trained on large, public datasets openly collected from the Internet. This pa… ▽ More

    Submitted 9 September, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

    Journal ref: Published in ACM CCS 2022. Please cite the CCS version

  22. arXiv:2204.08935  [pdf, other

    cs.SI cs.CY

    On Xing Tian and the Perseverance of Anti-China Sentiment Online

    Authors: Xinyue Shen, Xinlei He, Michael Backes, Jeremy Blackburn, Savvas Zannettou, Yang Zhang

    Abstract: Sinophobia, anti-Chinese sentiment, has existed on the Web for a long time. The outbreak of COVID-19 and the extended quarantine has further amplified it. However, we lack a quantitative understanding of the cause of Sinophobia as well as how it evolves over time. In this paper, we conduct a large-scale longitudinal measurement of Sinophobia, between 2016 and 2021, on two mainstream and fringe Web… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: To Appear in the 16th International Conference on Web and Social Media (ICWSM), 2022

  23. arXiv:2202.08492  [pdf, other

    cs.CY cs.CV

    Feels Bad Man: Dissecting Automated Hateful Meme Detection Through the Lens of Facebook's Challenge

    Authors: Catherine Jennifer, Fatemeh Tahmasbi, Jeremy Blackburn, Gianluca Stringhini, Savvas Zannettou, Emiliano De Cristofaro

    Abstract: Internet memes have become a dominant method of communication; at the same time, however, they are also increasingly being used to advocate extremism and foster derogatory beliefs. Nonetheless, we do not have a firm understanding as to which perceptual aspects of memes cause this phenomenon. In this work, we assess the efficacy of current state-of-the-art multimodal machine learning models toward… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  24. arXiv:2112.00443  [pdf, other

    cs.CR cs.CY cs.SI

    TROLLMAGNIFIER: Detecting State-Sponsored Troll Accounts on Reddit

    Authors: Mohammad Hammas Saeed, Shiza Ali, Jeremy Blackburn, Emiliano De Cristofaro, Savvas Zannettou, Gianluca Stringhini

    Abstract: Growing evidence points to recurring influence campaigns on social media, often sponsored by state actors aiming to manipulate public opinion on sensitive political topics. Typically, campaigns are performed through instrumented accounts, known as troll accounts; despite their prominence, however, little work has been done to detect these accounts in the wild. In this paper, we present TROLLMAGNIF… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  25. arXiv:2111.02455  [pdf, other

    cs.DL cs.SI

    Understanding the Use of e-Prints on Reddit and 4chan's Politically Incorrect Board

    Authors: Satrio Baskoro Yudhoatmojo, Emiliano De Cristofaro, Jeremy Blackburn

    Abstract: The dissemination and reach of scientific knowledge have increased at a blistering pace. In this context, e-Print servers have played a central role by providing scientists with a rapid and open mechanism for disseminating research without waiting for the (lengthy) peer review process. While helping the scientific community in several ways, e-Print servers also provide scientific communicators and… ▽ More

    Submitted 8 March, 2023; v1 submitted 3 November, 2021; originally announced November 2021.

    Journal ref: Published in the Proceedings of the 15th ACM Web Science Conference 2023 (ACM WebSci 2023). Please cite the WebSci version

  26. arXiv:2111.02452  [pdf, other

    cs.CY cs.CV

    Slapping Cats, Bopping Heads, and Oreo Shakes: Understanding Indicators of Virality in TikTok Short Videos

    Authors: Chen Ling, Jeremy Blackburn, Emiliano De Cristofaro, Gianluca Stringhini

    Abstract: Short videos have become one of the leading media used by younger generations to express themselves online and thus a driving force in shaping online culture. In this context, TikTok has emerged as a platform where viral videos are often posted first. In this paper, we study what elements of short videos posted on TikTok contribute to their virality. We apply a mixed-method approach to develop a c… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

  27. arXiv:2111.02187  [pdf, other

    cs.SI cs.CY

    Soros, Child Sacrifices, and 5G: Understanding the Spread of Conspiracy Theories on Web Communities

    Authors: Pujan Paudel, Jeremy Blackburn, Emiliano De Cristofaro, Savvas Zannettou, Gianluca Stringhini

    Abstract: This paper presents a multi-platform computational pipeline geared to identify social media posts discussing (known) conspiracy theories. We use 189 conspiracy claims collected by Snopes, and find 66k posts and 277k comments on Reddit, and 379k tweets discussing them. Then, we study how conspiracies are discussed on different Web communities and which ones are particularly influential in driving t… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

  28. arXiv:2108.05876  [pdf, other

    cs.CY cs.SI

    An Early Look at the Gettr Social Network

    Authors: Pujan Paudel, Jeremy Blackburn, Emiliano De Cristofaro, Savvas Zannettou, Gianluca Stringhini

    Abstract: This paper presents the first data-driven analysis of Gettr, a new social network platform launched by former US President Donald Trump's team. Among other things, we find that users on the platform heavily discuss politics, with a focus on the Trump campaign in the US and Bolsonaro's in Brazil. Activity on the platform has steadily been decreasing since its launch, although a core of verified use… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

  29. arXiv:2104.11145  [pdf, other

    cs.CY

    "I'm a Professor, which isn't usually a dangerous job": Internet-Facilitated Harassment and its Impact on Researchers

    Authors: Periwinkle Doerfler, Andrea Forte, Emiliano De Cristofaro, Gianluca Stringhini, Jeremy Blackburn, Damon McCoy

    Abstract: While the Internet has dramatically increased the exposure that research can receive, it has also facilitated harassment against scholars. To understand the impact that these attacks can have on the work of researchers, we perform a series of systematic interviews with researchers including academics, journalists, and activists, who have experienced targeted, Internet-facilitated harassment. We pr… ▽ More

    Submitted 22 April, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

  30. arXiv:2103.03631  [pdf, other

    cs.CY cs.SI

    A Multi-Platform Analysis of Political News Discussion and Sharing on Web Communities

    Authors: Yuping Wang, Savvas Zannettou, Jeremy Blackburn, Barry Bradlyn, Emiliano De Cristofaro, Gianluca Stringhini

    Abstract: The news ecosystem has become increasingly complex, encompassing a wide range of sources with varying levels of trustworthiness, and with public commentary giving different spins to the same stories. In this paper, we present a multi-platform measurement of this ecosystem. We compile a list of 1,073 news websites and extract posts from four Web communities (Twitter, Reddit, 4chan, and Gab) that co… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  31. arXiv:2102.09882  [pdf, other

    q-bio.NC cs.IT eess.SP eess.SY

    Characterising Alzheimer's Disease with EEG-based Energy Landscape Analysis

    Authors: Dominik Klepl, Fei He, Min Wu, Matteo De Marco, Daniel J. Blackburn, Ptolemaios Sarrigiannis

    Abstract: Alzheimer's disease (AD) is one of the most common neurodegenerative diseases, with around 50 million patients worldwide. Accessible and non-invasive methods of diagnosing and characterising AD are therefore urgently required. Electroencephalography (EEG) fulfils these criteria and is often used when studying AD. Several features derived from EEG were shown to predict AD with high accuracy, e.g. s… ▽ More

    Submitted 13 July, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

    Comments: 11 pages, 7 figures

  32. arXiv:2101.08750  [pdf, other

    cs.CY cs.SI

    The Gospel According to Q: Understanding the QAnon Conspiracy from the Perspective of Canonical Information

    Authors: Antonis Papasavva, Max Aliapoulios, Cameron Ballard, Emiliano De Cristofaro, Gianluca Stringhini, Savvas Zannettou, Jeremy Blackburn

    Abstract: The QAnon conspiracy theory claims that a cabal of (literally) blood-thirsty politicians and media personalities are engaged in a war to destroy society. By interpreting cryptic "drops" of information from an anonymous insider calling themself Q, adherents of the conspiracy theory believe that Donald Trump is leading them in an active fight against this cabal. QAnon has been covered extensively by… ▽ More

    Submitted 29 April, 2022; v1 submitted 21 January, 2021; originally announced January 2021.

    Journal ref: Published in the Proceedings of the 16th International AAAI Conference on Web and Social Media (ICWSM 2022). Please cite accordingly

  33. arXiv:2101.06535  [pdf, other

    cs.HC cs.CY cs.SI

    Dissecting the Meme Magic: Understanding Indicators of Virality in Image Memes

    Authors: Chen Ling, Ihab AbuHilal, Jeremy Blackburn, Emiliano De Cristofaro, Savvas Zannettou, Gianluca Stringhini

    Abstract: Despite the increasingly important role played by image memes, we do not yet have a solid understanding of the elements that might make a meme go viral on social media. In this paper, we investigate what visual elements distinguish image memes that are highly viral on social media from those that do not get re-shared, across three dimensions: composition, subjects, and target audience. Drawing fro… ▽ More

    Submitted 16 January, 2021; originally announced January 2021.

    Comments: To appear at the 24th ACM Conference on Computer-Supported Coop- erative Work and Social Computing (CSCW 2021)

  34. arXiv:2101.03820  [pdf, other

    cs.SI cs.CY physics.soc-ph

    An Early Look at the Parler Online Social Network

    Authors: Max Aliapoulios, Emmi Bevensee, Jeremy Blackburn, Barry Bradlyn, Emiliano De Cristofaro, Gianluca Stringhini, Savvas Zannettou

    Abstract: Parler is as an "alternative" social network promoting itself as a service that allows to "speak freely and express yourself openly, without fear of being deplatformed for your views." Because of this promise, the platform become popular among users who were suspended on mainstream social networks for violating their terms of service, as well as those fearing censorship. In particular, the service… ▽ More

    Submitted 18 February, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

    Journal ref: Proceedings of the International AAAI Conference on Web and Social Media, 15(1), 943--951 (2021)

  35. arXiv:2010.11638  [pdf, other

    cs.CY cs.SI

    "It is just a flu": Assessing the Effect of Watch History on YouTube's Pseudoscientific Video Recommendations

    Authors: Kostantinos Papadamou, Savvas Zannettou, Jeremy Blackburn, Emiliano De Cristofaro, Gianluca Stringhini, Michael Sirivianos

    Abstract: The role played by YouTube's recommendation algorithm in unwittingly promoting misinformation and conspiracy theories is not entirely understood. Yet, this can have dire real-world consequences, especially when pseudoscientific content is promoted to users at critical times, such as the COVID-19 pandemic. In this paper, we set out to characterize and detect pseudoscientific misinformation on YouTu… ▽ More

    Submitted 12 October, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: To appear at the 16th International Conference on Web and Social Media (ICWSM 2022). Please cite the ICWSM version

  36. Do Platform Migrations Compromise Content Moderation? Evidence from r/The_Donald and r/Incels

    Authors: Manoel Horta Ribeiro, Shagun Jhaver, Savvas Zannettou, Jeremy Blackburn, Emiliano De Cristofaro, Gianluca Stringhini, Robert West

    Abstract: When toxic online communities on mainstream platforms face moderation measures, such as bans, they may migrate to other platforms with laxer policies or set up their own dedicated websites. Previous work suggests that within mainstream platforms, community-level moderation is effective in mitigating the harm caused by the moderated communities. It is, however, unclear whether these results also ho… ▽ More

    Submitted 20 August, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: This paper has been accepted at CSCW 2021, please cite accordingly

  37. arXiv:2009.11792  [pdf, other

    cs.CY

    Understanding the Use of Fauxtography on Social Media

    Authors: Yuping Wang, Fatemeh Tahmasbi, Jeremy Blackburn, Barry Bradlyn, Emiliano De Cristofaro, David Magerman, Savvas Zannettou, Gianluca Stringhini

    Abstract: Despite the influence that image-based communication has on online discourse, the role played by images in disinformation is still not well understood. In this paper, we present the first large-scale study of fauxtography, analyzing the use of manipulated or misleading images in news discussion on online communities. First, we develop a computational pipeline geared to detect fauxtography, and ide… ▽ More

    Submitted 25 September, 2020; v1 submitted 24 September, 2020; originally announced September 2020.

  38. arXiv:2009.04885  [pdf, other

    cs.CY

    "Is it a Qoincidence?": An Exploratory Study of QAnon on Voat

    Authors: Antonis Papasavva, Jeremy Blackburn, Gianluca Stringhini, Savvas Zannettou, Emiliano De Cristofaro

    Abstract: Online fringe communities offer fertile grounds for users seeking and sharing ideas fueling suspicion of mainstream news and conspiracy theories. Among these, the QAnon conspiracy theory emerged in 2017 on 4chan, broadly supporting the idea that powerful politicians, aristocrats, and celebrities are closely engaged in a global pedophile ring. Simultaneously, governments are thought to be controlle… ▽ More

    Submitted 14 February, 2021; v1 submitted 10 September, 2020; originally announced September 2020.

    Journal ref: Published in the Proceedings of 30th The Web Conference (WWW 2021). Please cite the WWW version

  39. arXiv:2009.03822  [pdf, other

    cs.CY

    A First Look at Zoombombing

    Authors: Chen Ling, Utkucan Balcı, Jeremy Blackburn, Gianluca Stringhini

    Abstract: Online meeting tools like Zoom and Google Meet have become central to our professional, educational, and personal lives. This has opened up new opportunities for large scale harassment. In particular, a phenomenon known as zoombombing has emerged, in which aggressors join online meetings with the goal of disrupting them and harassing their participants. In this paper, we conduct the first data-dri… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

    Comments: First two authors equally contributed

  40. Reading In-Between the Lines: An Analysis of Dissenter

    Authors: Erik Rye, Jeremy Blackburn, Robert Beverly

    Abstract: Efforts by content creators and social networks to enforce legal and policy-based norms, e.g. blocking hate speech and users, has driven the rise of unrestricted communication platforms. One such recent effort is Dissenter, a browser and web application that provides a conversational overlay for any web page. These conversations hide in plain sight - users of Dissenter can see and participate in t… ▽ More

    Submitted 26 September, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

    Comments: Accepted at IMC 2020

  41. arXiv:2004.04046  [pdf, other

    cs.SI cs.CY

    "Go eat a bat, Chang!": On the Emergence of Sinophobic Behavior on Web Communities in the Face of COVID-19

    Authors: Fatemeh Tahmasbi, Leonard Schild, Chen Ling, Jeremy Blackburn, Gianluca Stringhini, Yang Zhang, Savvas Zannettou

    Abstract: The outbreak of the COVID-19 pandemic has changed our lives in unprecedented ways. In the face of the projected catastrophic consequences, many countries have enacted social distancing measures in an attempt to limit the spread of the virus. Under these conditions, the Web has become an indispensable medium for information acquisition, communication, and entertainment. At the same time, unfortunat… ▽ More

    Submitted 3 March, 2021; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: This is the full version of the paper, with same title, appearing in the Proceedings of the 30th The Web Conference (WWW 2021). Please cite the WWW version

  42. arXiv:2001.08438  [pdf, other

    cs.SI cs.CY

    The Pushshift Telegram Dataset

    Authors: Jason Baumgartner, Savvas Zannettou, Megan Squire, Jeremy Blackburn

    Abstract: Messaging platforms, especially those with a mobile focus, have become increasingly ubiquitous in society. These mobile messaging platforms can have deceivingly large user bases, and in addition to being a way for people to stay in touch, are often used to organize social movements, as well as a place for extremists and other ne'er-do-well to congregate. In this paper, we present a dataset from on… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

  43. arXiv:2001.08435  [pdf, other

    cs.SI cs.CY

    The Pushshift Reddit Dataset

    Authors: Jason Baumgartner, Savvas Zannettou, Brian Keegan, Megan Squire, Jeremy Blackburn

    Abstract: Social media data has become crucial to the advancement of scientific understanding. However, even though it has become ubiquitous, just collecting large-scale social media data involves a high degree of engineering skill set and computational resources. In fact, research is often times gated by data engineering problems that must be overcome before analysis can proceed. This has resulted recognit… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

  44. "How over is it?" Understanding the Incel Community on YouTube

    Authors: Kostantinos Papadamou, Savvas Zannettou, Jeremy Blackburn, Emiliano De Cristofaro, Gianluca Stringhini, Michael Sirivianos

    Abstract: YouTube is by far the largest host of user-generated video content worldwide. Alas, the platform has also come under fire for hosting inappropriate, toxic, and hateful content. One community that has often been linked to sharing and publishing hateful and misogynistic content are the Involuntary Celibates (Incels), a loosely defined movement ostensibly focusing on men's issues. In this paper, we s… ▽ More

    Submitted 23 August, 2021; v1 submitted 22 January, 2020; originally announced January 2020.

    Comments: To appear at the 24th ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW 2021). Please cite the CSCW version

  45. arXiv:2001.07600  [pdf, other

    cs.CY

    The Evolution of the Manosphere Across the Web

    Authors: Manoel Horta Ribeiro, Jeremy Blackburn, Barry Bradlyn, Emiliano De Cristofaro, Gianluca Stringhini, Summer Long, Stephanie Greenberg, Savvas Zannettou

    Abstract: In this paper, we present a large-scale characterization of the Manosphere, a conglomerate of Web-based misogynist movements roughly focused on "men's issues," which has seen significant growth over the past years. We do so by gathering and analyzing 28.8M posts from 6 forums and 51 subreddits. Overall, we paint a comprehensive picture of the evolution of the Manosphere on the Web, showing the lin… ▽ More

    Submitted 8 April, 2021; v1 submitted 21 January, 2020; originally announced January 2020.

    Comments: To appear at the 15th International AAAI Conference on Web and Social Media (ICWSM 2021) -- please cite accordingly

  46. arXiv:2001.07487  [pdf, other

    cs.CY cs.SI

    Raiders of the Lost Kek: 3.5 Years of Augmented 4chan Posts from the Politically Incorrect Board

    Authors: Antonis Papasavva, Savvas Zannettou, Emiliano De Cristofaro, Gianluca Stringhini, Jeremy Blackburn

    Abstract: This paper presents a dataset with over 3.3M threads and 134.5M posts from the Politically Incorrect board (/pol/) of the imageboard forum 4chan, posted over a period of almost 3.5 years (June 2016-November 2019). To the best of our knowledge, this represents the largest publicly available 4chan dataset, providing the community with an archive of posts that have been permanently deleted from 4chan… ▽ More

    Submitted 1 April, 2020; v1 submitted 21 January, 2020; originally announced January 2020.

    Journal ref: Published at the 14th International AAAI Conference on Web and Social Media (ICWSM 2020). Please cite the ICWSM version

  47. arXiv:1907.08873  [pdf, other

    cs.SI cs.CY cs.IR

    Detecting Cyberbullying and Cyberaggression in Social Media

    Authors: Despoina Chatzakou, Ilias Leontiadis, Jeremy Blackburn, Emiliano De Cristofaro, Gianluca Stringhini, Athena Vakali, Nicolas Kourtellis

    Abstract: Cyberbullying and cyberaggression are increasingly worrisome phenomena affecting people across all demographics. More than half of young social media users worldwide have been exposed to such prolonged and/or coordinated digital harassment. Victims can experience a wide range of emotions, with negative consequences such as embarrassment, depression, isolation from other community members, which em… ▽ More

    Submitted 20 July, 2019; originally announced July 2019.

    Comments: To appear in ACM Transactions on the Web (TWEB)

  48. arXiv:1906.06240  [pdf, other

    cs.DC cs.PF

    Diffusing Your Mobile Apps: Extending In-Network Function Virtualization to Mobile Function Offloading

    Authors: Mario Almeida, Liang Wang, Jeremy Blackburn, Konstantina Papagiannaki, Jon Crowcroft

    Abstract: Motivated by the huge disparity between the limited battery capacity of user devices and the ever-growing energy demands of modern mobile apps, we propose INFv. It is the first offloading system able to cache, migrate and dynamically execute on demand functionality from mobile devices in ISP networks. It aims to bridge this gap by extending the promising NFV paradigm to mobile applications in orde… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

  49. EYEORG: A Platform For Crowdsourcing Web Quality Of Experience Measurements

    Authors: Matteo Varvello, Jeremy Blackburn, David Naylor, Kostantina Papagiannaki

    Abstract: Tremendous effort has gone into the ongoing battle to make webpages load faster. This effort has culminated in new protocols (QUIC, SPDY, and HTTP/2) as well as novel content delivery mechanisms. In addition, companies like Google and SpeedCurve investigated how to measure "page load time" (PLT) in a way that captures human perception. In this paper we present Eyeorg, a platform for crowdsourcing… ▽ More

    Submitted 7 February, 2019; originally announced February 2019.

    Comments: 14 pages, CONEXT2016

  50. arXiv:1901.09735  [pdf, other

    cs.CY

    "And We Will Fight For Our Race!" A Measurement Study of Genetic Testing Conversations on Reddit and 4chan

    Authors: Alexandros Mittos, Savvas Zannettou, Jeremy Blackburn, Emiliano De Cristofaro

    Abstract: Progress in genomics has enabled the emergence of a booming market for "direct-to-consumer" genetic testing. Nowadays, companies like 23andMe and AncestryDNA provide affordable health, genealogy, and ancestry reports, and have already tested tens of millions of customers. At the same time, alt- and far-right groups have also taken an interest in genetic testing, using them to attack minorities and… ▽ More

    Submitted 4 October, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

    Comments: This is the full version of the paper, with same title, appearing in the 14th AAAI Conference on Web and Social Media (ICWSM 2020). Please cite the ICWSM version