Data

The Platform Governance Archive (PGA) offers different datasets for research, reporting and further exploration. You may use the data for your purposes, we only demand reference to the project and the respective datasets (see below for license and detail.)

PGA v1 (2005-2021) ↗

This is our original dataset with a curated set of policies by four major platforms, ranging back to 2005. Data has been collected by a combination of automated and manual approaches., building on Internet Archive’s Wayback Machine.

More Info and Documentation.

PGA v2 (2022 – …) ↗

This is our new dataset that we continually update each day in cooperation with Open Terms Archive. The engine scrapes policy pages of currently 17 platforms for updates and automatically stores snapshots and new versions, based on our curation.

More Info and Documentation.

Integrated Dataset ↗

Forthcoming…

We plan to merge and integrate the historical dataset and the ongoing data collection in the future.

Documentation

Please consult the individual pages of the datasets for documentation.

Using the Data

We are more than happy if you want to use our dataset in your research, reporting, and explorations. If you do:

  1. Consult the respective data documentation;
  2. reference this project and the actual dataset;
  3. send us a note so that we include you in our research and output page.

PGA data is made available under the Open Data Commons Attribution License (that means what we say above: use it, but reference us).

Cite the Project

Katzenbach, C., et al. (2023). The Platform Governance Archive. Centre for Media, Communication and Information Research (ZeMKI), University of Bremen. DOI: 10.17605/OSF.IO/XSBPT. URL: https://platformgovernancearchive.org.

Cite Datasets and Data paper

Dataset PGA v1 Katzenbach, C., Kopps, A., Magalhaes, J. C., Redeker.  D., Sühr, T. (2023). Platform Governance Archive (PGA) v1. [data set]. DOI: 10.17605/OSF.IO/XSBPT. URL: https://www.platformgovernancearchive.org/data/dataset-pga-v1-historical-dataset/.

Dataset PGAv2 Katzenbach, C., Dergacheva, D., Fischer, A., Kopps, A., Kolesnikov, S., Redeker. D., Viejo Otero, P. (2023). Platform Governance Archive (PGA) v2. [data set]. DOI: 10.17605/OSF.IO/XSBPT. URL: https://www.platformgovernancearchive.org/data/dataset-pga-v2-ongoing-collection/

Datapaper Katzenbach, C., Kopps, A., Magalhaes, J. C., Redeker.  D., Sühr, T., Wunderlich, L. (2023). The Platform Governance Archive v1 – A longitudinal dataset to study the governance of communication and interactions by platforms and the historical evolution of platform policies. Centre for Media, Communication and Information Research (ZeMKI), University of Bremen. https://doi.org/10.26092/elib/2331.

Cite a Single Document

Name of platform. (Date of version). Name of policy. Platform Governance Archive. Direct URL.