Echo Chambers embedded in the structure of News media websites
I have nearly always voraciously consumed the news; each day chowing down on newsletters and long-form articles and imbibing The Economist’s Daily Espresso briefing. However, recently, I have been considering what news I have been taking in, my news diet being somewhat one-note and seriously lacking in ideological diversity. (This self-reflection was actually prompted by a recent doctor’s visit, where the physician also recommended widening my food palette). I have, as a result, been trying to balance my media diet, reading everything from the Wall Street Journal and the Economist to the Washington Post and Foreign Affairs. However, as I sought out more nutritious and diverse news, I stopped and wondered why I had only been consuming a select number and ideologically narrow newspapers in the first place. I don’t regularly use social media, so it was not the curation algorithms of Facebook and Twitter. This made me wonder if the entire structure of online news media was primed to sequester people into silos, and if so what are some of the consequences. In this piece, I explore these questions.
To explore the structure of the news media ecosystem, I first needed some data. To begin, I decided to analyze the hyperlink/URL interconnections of nearly 1000 random news websites (950 to be exact). I wanted to see if more liberal sites connected more frequently with liberal sites and vice versa. If so, just being on a more liberal news site could cause users to click more often and visit more liberal sites versus conservative sites. This perhaps was a factor perpetuating my own media echo chamber.
Getting a list of news sites off of https://mediabiasfactcheck.com/, I collected these websites’ web pages from Common Crawl-widely considered the most complete publicly available source of web crawl data. Using all the web pages indexed by Common Crawl for these 950 websites since 2014, I managed to see when and how often these different news websites referenced or hyperlinked to each other.
To measure each website’s approximate partisanship level (i.e. whether the news website is conservative or liberal), I used a dataset from researchers at Northeastern University. This dataset tries to understand the partisanship of websites using the percentage of time that given websites are shared on Twitter by Democrats and Republicans. The dataset presents partisanship data for websites on a scale of -1 (liberal/Democratic-leaning) to +1 (conservative/Republican-leaning).
Pictured here is the distribution of the partisanship of each website in our 950 websites selected (-1 (liberal/Democratic-leaning) to +1 (conservative/Republican-leaning). There was an overall liberal bias, despite my selection being random, with a median partisanship level of -0.11 (Democratic-leaning) and an average of -0.12 within the 950 websites.
Pictured here are the connections amongst different news websites. Each node represents a different website while the directed edges represent whether each website links to another one. The more blue a website, the more liberal; the more red a website, the more conservative. As seen, blue websites are clumped and well connected on the left; while more conservative websites are grouped and clustered together on the right. The assortativity based on the polarization levels is 0.448.
Pictured is the percentage of shared domain connections that news websites have with our conspiracy theory websites as a function of their URLs polarization level. On average, as websites become more polarized they share more connections with conspiracy theory websites.
As seen in the above graph, as websites become more and more polarized, linking to more and more one-sided websites, they actually share more hyperlink connections with conspiracy theory websites. Simply, the deeper in the rabbit hole of polarization a website travels, the more likely it is to hyperlink to many of the same websites that conspiracy theories websites do. The echo chambers of websites corresponded with these news websites having more in common with conspiracy theory websites.
I had found that not only do news websites hyperlink/connect more with websites that have more in common with them ideologically but also as websites get more polarized in this way, they become more similar to conspiracy theory websites!