Access to Data for Researchers: The Challenges of Civil Society in the Face of Digital Platforms. A Conversation with Carlos Cortés and Laura Palacio – by Juan Martín Marinangeli.

In recent years, the debate over access to data from digital platforms has become a central issue for researchers and civil society organizations. At CELE, we have been closely following this discussion which raises fundamental questions about transparency, privacy and the role of platforms. To delve deeper into this topic, we spoke with Carlos Cortés (Executive Director) and Laura Palacio (Data and Platforms Leader) of Linterna Verde, a Colombian organization that works “to strengthen civil society’s role in the digital public debate.”

As part of a research project, Linterna Verde recently gained access to Meta's data library. In this interview, Cortés and Palacioshare their insights on the (complicated) process of accessing the database and reflect on the role that Latin American civil society plays and could play in advocating for better transparency policies.

The Long Road to Data

The current landscape of access to data from digital platforms presents significant challenges for researchers. As Laura Palacio explains, even with Meta’s new data library – one of the most promising tools in this field – the access process is complex and demanding. “To submit a request, it is mandatory to present a project. However, Meta does not focus so much on the project's topic but rather on the profile of the research leader: they must have solid background in data management and analytical tools,” Palacio points out.

The mention of a “research leader” is no coincidence: as Palacio comments, “When requesting access, Meta also asks for your Facebook profile and only authorizes that specific profile. You cannot authorize other profiles. For instance, if you want to add more people to a research project, you have to submit another request and get approval again for those specific profiles.” “It would be better,” the researcher suggests, “to have general access for the organization and then decide internally who can use it, even if certain profiles were restricted.”

The process, however, does not end with the initial approval. Researchers must complete specific courses and meet various technical requirements. In the case of Linterna Verde, the entire process took between a month and a half and two months, including an initial rejection due to the absence of an Ethics Committee. “Later, we resubmitted the application, adjusted some details, and coincidentally, at that time, Meta removed that restriction, so we no longer needed the committee’s endorsement. After that, the process was much faster; within two or three weeks we were granted access,” Palacio explained.

Overall, the available tools, while valuable, have significant limitations. Meta’s library, for example, does not allow access to comments or reveal who is commenting, nor does it provide direct access to the links of Facebook posts. Additionally, updates are not in real time but weekly. As a result, “we often rely on manual searches to get what we need,” says Palacio. “Conducting searches to reach the content we need is a difficult and tedious process.” She concludes, “I feel that access to this data is not easy at all for people without advanced knowledge in data science.”

Data Interoperability: A Path Forward?

One of the most complex aspects of this landscape is the ability to combine and compare data from different platforms, what is often referred to as "interoperability.” As Laura Palacio points out, this goal is typically hindered not only by technical challenges but also by economic considerations. “Platforms need to manage large servers that require significant space and consume resources such as water and energy.” “Providing access to data represents a significant cost for platforms, which do not see the benefit of offering unrestricted access,” Palacio argues. She concludes that interoperability “will not be easy to achieve, mainly because of the immense value that data holds for major tech companies today.”

Cortés, meanwhile, reflects on how this debate fits into a broader discussion about the digital “public space.” “Our main challenge is that the infrastructures where online conversations take place are private, and that makes us vulnerable to changes imposed by the platforms.” Consequently, Cortés observes, “it will be very difficult to discuss data interoperability if the public space for conversation is controlled by private platforms.”

The Role of Civil Society: Between Demand and Action

The question of the role of civil society in accessing data from digital platforms is, according to Carlos Cortés, a complex issue framed within a broader context of discussions with digital rights organizations. “At Linterna Verde, we have been known for a fairly maximalist stance regarding our demands on tech companies.” Although actions such as open letters to figures like Zuckerberg or demands for stricter standards are valid approaches, Cortés acknowledges that, in practice, they often have limited impact.

“This starting point has led us to reflect on the process of developing access to data,” says Cortés. “Although we have explored possible alliances with American universities, we regret that, as Latin American organizations, we have not yet to establish a 'common voice' capable of making a convincing case.” Cortés believes that “it would be essential to articulate this common voice” to “frame a different conversation,” though he acknowledges the existence of complex tensions that are not easy to resolve and that put civil society in the region in a “weak position.”

“From our organization, we seek not only to access data for our own use but also to foster a conversation among organizations in the region facing the same issue,” Carlos Cortés explains. While he does not define it as an activist organization, he highlights that their work actively supports civil society in advancing relevant causes in the region.

Could regulation be a possible solution to these problems? Carlos holds a skeptical view of the current state of digital regulation in Latin America: “Sometimes I wonder what we are optimizing our efforts for.” While he acknowledges some progress in creating regulatory frameworks, he argues that these remain limited and often fail to address the primary concerns of civil society: “It is not clear what direction each country is heading in, and that leaves civil society in a very uncertain strategic position. We end up driving critical debates on platforms about standards and principles, but the resulting regulations are not helpful. That doesn’t mean, however, that we should avoid regulation altogether.” This situation forces them to focus their efforts on critical debates about principles and standards, even though these do not necessarily lead to effective regulatory changes.

Palacio notes that while some organizations seek greater access to data for research, others advocate for stronger privacy protection. “Civil society is divided into two sides,” she explains. “On one hand, there are organizations pushing for access to data to investigate, propose, and move forward. On the other hand, there is a group saying 'Don't sell my data', demanding greater privacy.”

Cortés acknowledges that “this line of work even confronts us with existential questions about our own work: what are we doing with this data and why are we seeking it?” According to Cortés, access to data presents a dual challenge for civil society: “not only obtaining this data, but also proving our ability to manage it responsibly and with clear objectives.”

Towards a Regional Agenda?

How can we generate more effective demands for access to information? Palacio points to two possible courses of action. She highlights the need to raise awareness, both within civil society and institutions, about the availability and relevance of public data: “As we become aware of what information we need, we can demand that those who hold this data—the 'owners', so to speak—grant us access. After all, this information is ours too.”

Furthermore, the researcher argues that “it is crucial to raise awareness among users and consumers, not only to demand access but also to make responsible use of our own data,” and suggests that “we must recognize that access to information needs to be managed in two ways”: “on the one hand, we need access, but we must also think about privacy: to what extent are we, as consumers and users, aware of what information should be public?”

To these observations, Cortés adds the “need to connect the various access points to this data: research data, platform data, and finally, the elements of privacy and data processing, within our Latin American context.” He concludes: “As a civil society, we need to articulate the problem clearly and work towards building a more unified voice. This would be a significant step forward.”