Is Reddit a Reliable Data Source?
- Feb 15
- 3 min read
Stephanie El Tawil
February 2026

“You love Reddit, lol” said my boss when I sent him a random link a few days ago. “I love data, specifically unsolicited feedback” I retorted before doomscrolling through some of my favorite Reddit channels like LongBeach, RecruitingHell, RemoteJobs, DataisBeautiful, etc. (don’t judge me, lol).
The link I shared was in the LongBeach community - the thread was started by a Redditor looking to move into Downtown Long Beach. She had shared a map of her prospective apartment and was asking the locals how to be prepared. Considering it was a lightly monitored online forum, the comments were surprisingly positive and most residents provided solutions to existing issues rather than harping on their negative experiences. I love perusing these types of unsolicited information hubs because 1. Who doesn’t like tea? Especially when it's not their own and 2. It feels like the most genuine and real data sources we have left. It shows me how people really think and behave, as well as what their expectations are sans prompts. I know, I know, these forums should be taken with a grain of salt - they are filled with a specific personality/population type and therefore shouldn’t be considered validated data. Arguments can be made that the people communicating on these social media platforms have unreliable motives, information sources, and therefore, insights to share. And I do always view these faceless conversations with a ere of caution and scrutiny.
However, I do believe there is value in creeping in online conversations as a data analyst. My background in Communications focused heavily on not only the methodology of data gathering, but also the ethics behind leaving the observed population undisturbed. Standard qualitative and quantitative methods require a researcher to identify themselves and their data gathering interests prior to documentation. The existence of intention and purpose on the researcher's part then puts an expectation on the participant to provide “valuable” or “credible” information in hopes of signifying a change or improvement from normalcy. But is that good data? It’s like putting a police vehicle in an area where speeding is frequent - if people know they’re being watched, they behave. It's being caught unaware and punished accordingly that truly affects a change in outlook, right? If one is ticketed and fined in an area they were not expecting to see a police officer, it is likely that incident will make them more cautious in general, not just within a specific area. The same can be true for taking surveys or answering interview questions - if the participant has an expectation about the study, their feedback is more likely to be contrived and curated to either confirm or deny those expectations.
In my opinion, this is where online forums provide a helpful, albeit, grey space in data gathering - a place where the information may not be 100% verifiable (definitely not quantitative), but it provides value by synthesizing qualitative experiences inexplicitly sourced. In short, it shows you how the general public perceives a topic without any expectations on altering the status quo. From there, specific data queries can be constructed to see if this perception is common, localized, applicable, etc. and how it helps us data analysts piece together real experiences instead of performative recollections that vaguely align with reality.
For this conversation, that LongBeach thread informed me that while public safety and cleanliness are general concerns for our Long Beach residents, the main source of frustration for our DTLB residents specifically is parking. That insight helps inform our decisions regarding traffic disruptions caused by development projects or large-scale events, cleaning and safety sweep schedules conducted by our special projects teams, and even the level of detailed communication we share regarding temporary street closures. It also helps direct advocacy efforts not only for increasing additional parking options, but also remediation of existing parking obstacles e.g., mobile food trucks lingering in unpaid spots or delivery vehicles limiting access.
What do you think? Would you use Reddit as a reliable source of information? And before you immediately reject the concept, remember, they wouldn’t let us use Wikipedia as a reference for decades and it is now one of the most recognizable open data sources we have available.


Comments