Jos Berens, Stefaan Verhulst
An analysis of terms and conditions present in a diversity of data-driven prizes and challenges to better understand governance frameworks of data sharing practices.
Published in The GovLab in 2015
Blog Post
Governance and Operations
Published in The GovLab in 2015
World Economic Forum
An overview report from the World Economic Forum on the existing data deficit and the value and impact of big data for sustainable development.
Published in 2015
Report
Benefits
Incentives
Governance and Operations
Published in 2015
This report captures an overview of the existing data deficit and the value and impact of big data for sustainable development.
The authors of the report focus on four main priorities towards a sustainable data revolution: commercial incentives and trusted agreements with public- and private-sector actors; the development of shared policy frameworks, legal protections and impact assessments; capacity building activities at the institutional, community, local and individual level; and lastly, recognizing individuals as both produces and consumers of data.
Frederika Welle Donker, Bastiaan van Loenen, Arnold K. Bregt
A case study examining the opening of private data by Dutch energy network administrator Liander.
Published in International Journal of Geo-Information in 2016
Case Study
In Practice
Governance and Operations
Published in International Journal of Geo-Information in 2016
This research has developed a monitoring framework to assess the effects of open (private) data using a case study of a Dutch energy network administrator Liander.
Focusing on the potential impacts of open private energy data – beyond ‘smart disclosure’ where citizens are given information only about their own energy usage – the authors identify three attainable strategic goals:
The authors propose a seven-step framework for assessing the impacts of Liander data, in particular, and open private data more generally:
While the authors note that the true impacts of this open private data will likely not come into view in the short term, they argue that, “Liander has successfully demonstrated that private energy companies can release open data, and has successfully championed the other Dutch network administrators to follow suit.”
Nicholas Vogel, Christopher Theisen, Jonathan P. Leidig, Jerry Scripps, Douglas H. Graham, Greg Wolffe
A paper on the use of mobile call records to enable predictive action around Ebola diffusion.
Published in Paper presented at the Procedia Computer Science in 2015
Paper
Benefits
In Practice
Published in Paper presented at the Procedia Computer Science in 2015
The paper presents a research study conducted on the basis of the mobile calls records shared with researchers in the framework of the Data for Development Challenge by the mobile operator Orange.
The study discusses the data analysis approach in relation to developing a situation of Ebola diffusion built around “the interactions of multi-scale models, including viral loads (at the cellular level), disease progression (at the individual person level), disease propagation (at the workplace and family level), societal changes in migration and travel movements (at the population level), and mitigating interventions (at the abstract government policy level).”
The authors argue that the use of their population, mobility, and simulation models provide more accurate simulation details in comparison to high-level analytical predictions and that the D4D mobile datasets provide high-resolution information useful for modeling developing regions and hard to reach locations.
Stefaan Verhulst, Iryna Susha, Alexander Kostura
A report describing emerging practice, opportunities and challenges in data collaboratives as identified at the International Data Responsibility Conference.
Published in 2016
Essay
General
Benefits
Incentives
In Practice
Published in 2016
This piece articulates a set of key lessons learned during a session at the International Data Responsibility Conference focused on identifying emerging practices, opportunities and challenges confronting data collaboratives.
The authors list a number of privately held data sources that could create positive public impacts if made more accessible in a collaborative manner, including:
To the end of identifying and expanding on emerging practice in the space, the authors describe a number of current data collaborative experiments, including:
In order to capitalize on the opportunities provided by data collaboratives, a number of needs were identified:
Elizabeth Stuart, Emma Samman, William Avis, Tom Berliner
The Overseas Development Institute’s annual report focused on solutions toward a sustainable data revolution.
Published in 2015
Report
Benefits
Risks and Challenges
Data Responsibility
In Practice
Governance and Operations
Published in 2015
The authors of this Overseas Development Institute report highlight the need for good quality, relevant, accessible and timely data for governments to extend services into underrepresented communities and implement policies towards a sustainable “data revolution.”
The report explores solutions focused on capacity-building activities of national statistical offices (NSOs), alternative sources of data (including shared corporate data) to address gaps, and building strong data management systems.
Stefaan Verhulst, David Sangokoya
An essay on leveraging the potential of data to solve complex public problems through data collaboratives and four critical accelerators towards responsible data sharing and collaboration.
Published in 2015
Essay
General
Benefits
Incentives
In Practice
Governance and Operations
Published in 2015
The essay refers to data collaboratives as a new form of collaboration involving participants from different sectors exchanging data to help solve public problems. These forms of collaborations can improve people’s lives through data-driven decision-making; information exchange and coordination; and shared standards and frameworks for multi-actor, multi-sector participation.
The essay cites four activities that are critical to accelerating data collaboratives: documenting value and measuring impact; matching public demand and corporate supply of data in a trusted way; training and convening data providers and users; experimenting and scaling existing initiatives.
Stefaan Verhulst, David Sangokoya
This essay describes an emerging taxonomy of activities involving corporate data sharing for public good, an emerging trend in which companies share anonymized and aggregated data with third-party users towards data-driven policymaking and greater public good.
Published in Internet Monitor 2014: Reflections on the Digital World: Platforms, Policy, Privacy, and Public Discourse in 2014
Paper
General
Benefits
In Practice
Governance and Operations
Published in Internet Monitor 2014: Reflections on the Digital World: Platforms, Policy, Privacy, and Public Discourse in 2014
This essay, included in the Harvard Berkman Center’s 2014 Internet Monitor, describes a taxonomy of current corporate data sharing practices for public good: research partnerships; prizes and challenges; trusted intermediaries; application programming interfaces (APIs); intelligence products; and corporate data cooperatives or pooling.
Examples of data collaboratives discussed in the piece include: Yelp Dataset Challenge, the Digital Ecologies Research Partnerhsip, BBVA Innova Challenge, Telecom Italia’s Big Data Challenge, NIH’s Accelerating Medicines Partnership and the White House’s Climate Data Partnerships.
The authors highlight important questions to consider towards a more comprehensive mapping of these activities.
Willem G van Panhuis, Proma Paul, Claudia Emerson, John Grefenstette, Richard Wilder, Abraham J Herbst, David Heymann, Donald S Burke
A literature review of potential barriers to public health data sharing.
Published in BMC Public Health in 2014
Journal Article
Risks and Challenges
Data Responsibility
Published in BMC Public Health in 2014
The authors of this report provide a “systematic literature of potential barriers to public health data sharing.” These twenty potential barriers are classified in six categories: “technical, motivational, economic, political, legal and ethical.” In this taxonomy, “the first three categories are deeply rooted in well-known challenges of health information systems for which structural solutions have yet to be found; the last three have solutions that lie in an international dialogue aimed at generating consensus on policies and instruments for data sharing.”
The authors suggest the need for a “systematic framework of barriers to data sharing in public health” in order to accelerate access and use of data for public good.
Linnet Taylor, Ralph Schroeder
A paper describing how data, such as privately held mobile phone data – could improve development policy.
Published in GeoJournal in 2014
Journal Article
General
Benefits
Risks and Challenges
Published in GeoJournal in 2014
This journal article describes how privately held data – namely “digital traces” of consumer activity – “are becoming seen by policymakers and researchers as a potential solution to the lack of reliable statistical data on lower-income countries.”
They focus especially on three categories of data collaborative use cases:
They note, however, that a number of challenges and drawbacks exist for these types of use cases, including:
Nicholas Robin, Thilo Klein, Johannes Jütting
A working paper describing how privately held data sources could fill current gaps in the efforts of National Statistics Offices.
Published in OECD Development Co-operation Working Papers in 2016
Paper
General
Benefits
Risks and Challenges
Data Responsibility
In Practice
Governance and Operations
Published in OECD Development Co-operation Working Papers in 2016
This working paper acknowledges the growing body of work on how different types of data (e.g, telecom data, social media, sensors and geospatial data, etc.) can address data gaps relevant to National Statistical Offices (NSOs).
Four models of public-private interaction for statistics are describe: in-house production of statistics by a data-provider for a national statistics office (NSO), transfer of data-sets to NSOs from private entities, transfer of data to a third party provider to manage the NSO and private entity data, and the outsourcing of NSO functions.
The paper highlights challenges to public-private partnerships involving data (e.g., technical challenges, data confidentiality, risks, limited incentives for participation), suggests deliberate and highly structured approaches to public-private partnerships involving data require enforceable contracts, emphasizes the trade-off between data specificity and accessibility of such data, and the importance of pricing mechanisms that reflect the capacity and capability of national statistic offices.
Case studies referenced in the paper include:
Markus Perkmann, Henri Schildt
A paper highlighting the advantages of third-party organizations enabling data sharing between industry and academia to uncover new insights to benefit the public good.
Published in Research Policy in 2015
Journal Article
General
Incentives
Risks and Challenges
Governance and Operations
Published in Research Policy in 2015
This paper discusses the concept of a “boundary organization” in relation to industry-academic partnerships driven by data. Boundary organizations perform mediated revealing, allowing firms to disclose their research problems to a broad audience of innovators and simultaneously minimize the risk that this information would be adversely used by competitors.
The authors identify two especially important challenges for private firms to enter open data or participate in data collaboratives with the academic research community that could be addressed through more involvement from boundary organizations:
David Pastor-Escuredo, Alfredo Morales-Guzám, Yolanda Torres-Fernández, Jean-Martin Bauer, Amit Wadhwa, Carlos Castro-Correa, Liudmyla Romanoff, Jong Gun Lee, Alex Rutherford, Vanessa Frias-Martinez, Nuria Oliver, Enrique Frias-Martinez, Miguel Luengo-Oroz
An analysis of aggregated and anonymized call details records (CDR) conducted in collaboration with the UN, Government of Mexico, academia and Telefonica suggests high potential in using shared telecom data to improve early warning and emergency management mechanisms.
Published in Proceedings of the IEEE Global Humanitarian Technology Conference (GHTC) in 2014
Paper
Benefits
Incentives
In Practice
Published in Proceedings of the IEEE Global Humanitarian Technology Conference (GHTC) in 2014
This report describes the impact of using mobile data in order to understand the impact of disasters and improve disaster management. The report was conducted in the Mexican state of Tabasco in 2009 as a multidisciplinary, multi-stakeholder consortium involving the UN World Food Programme (WFP), Telefonica Research, Technical University of Madrid (UPM), Digital Strategy Coordination Office of the President of Mexico, and UN Global Pulse.
Telefonica Research, a division of the major Latin American telecommunications company, provided call detail records covering flood-affected areas for nine months. This data was combined with “remote sensing data (satellite images), rainfall data, census and civil protection data.” The results of the data demonstrated that “analysing mobile activity during floods could be used to potentially locate damaged areas, efficiently assess needs and allocate resources (for example, sending supplies to affected areas).”
In addition to the results, the study highlighted “the value of a public-private partnership on using mobile data to accurately indicate flooding impacts in Tabasco, thus improving early warning and crisis management.”
Gideon Mann
The transcript of a keynote talk on the potential of leveraging corporate data to help solve public problems.
Published in 2016
Blog Post
General
Benefits
Risks and Challenges
Data Responsibility
Governance and Operations
Published in 2016
This Medium post from Gideon Mann, the Head of Data Science at Bloomberg, shares his prepared remarks given at a lecture at the City College of New York. Mann argues for the potential benefits of increasing access to private sector data, both to improve research and academic inquiry and also to help solve practical, real-world problems. He also describes a number of initiatives underway at Bloomberg along these lines.
Mann argues that data generated at private companies “could enable amazing discoveries and research,” but is often inaccessible to those who could put it to those uses. Beyond research, he notes that corporate data could, for instance, benefit:
Mann recognizes the privacy challenges inherent in private sector data sharing, but argues that it is a common misconception that the only two choices are “complete privacy or complete disclosure.” He believes that flexible frameworks for differential privacy could open up new opportunities for responsibly leveraging data collaboratives.
Sharona Hoffman, Andy Podgurski
A journal article primarily focused on the risks involved in health data pooling.
Published in American Journal of Law & Medicine in 2013
Journal Article
Risks and Challenges
Data Responsibility
Published in American Journal of Law & Medicine in 2013
This journal article explores the benefits and, in particular, the risks related to large-scale biomedical databases bringing together health information from a diversity of sources across sectors. Some data collaboratives examined in the piece include:
Hoffman and Podgurski note that biomedical databases populated have many potential uses, with those likely to benefit including: “researchers, regulators, public health officials, commercial entities, lawyers,” as well as “healthcare providers who conduct quality assessment and improvement activities,” regulatory monitoring entities like the FDA, and “litigants in tort cases to develop evidence concerning causation and harm.”
They argue, however, that risks arise based on:
Harlan M. Krumholz, Cary P. Gross, Katrina L. Blount, Jessica D. Ritchie, Beth Hodshon, Richard Lehman, Joseph S. Ross
A review of industry-led efforts and cross-sector collaborations to share data from clinical trials to inform clinical practice.
Published in Circulation: Cardiovascular Quality and Outcomes in 2015
Journal Article
Incentives
In Practice
Published in Circulation: Cardiovascular Quality and Outcomes in 2015
This article provides a comprehensive overview of industry-led efforts and cross-sector collaborations in data sharing by pharmaceutical companies to inform clinical practice.
The article details the types of data being shared and the early activities of GlaxoSmithKline (“in coordination with other companies such as Roche and ViiV”); Medtronic and the Yale University Open Data Access Project; and Janssen Pharmaceuticals (Johnson & Johnson). The article also describes the range of involvement in data sharing among pharmaceutical companies including Pfizer, Novartis, Bayer, AbbVie, Eli Llly, AstraZeneca, and Bristol-Myers Squibb.
Silja M. Eckartz, Wout J. Hofman, Anne Fleur Van Veenstra
A paper proposing a decision model for data sharing arrangements aimed at addressing identified risks and challenges.
Published in International Conference on Electronic Government in 2014
Report
Risks and Challenges
Data Responsibility
Published in International Conference on Electronic Government in 2014
This paper proposes a decision model for data sharing of public and private data based on literature review and three case studies in the logistics sector.
The authors identify five categories of the barriers to data sharing and offer a decision model for identifying potential interventions to overcome each barrier:
Cameron F. Kerry, Jake Kendall, Yves-Alexandre de Montjoye
An issues paper from the Brookings Institution on leveraging the benefits of mobile phone data for humanitarian use while minimizing risks to privacy.
Published in 2016
Report
Benefits
In Practice
Governance and Operations
Published in 2016
Using Ebola as a case study, the authors describe the value of using private telecom data for uncovering “valuable insights into understanding the spread of infectious diseases as well as strategies into micro-target outreach and driving update of health-seeking behavior.”
The authors highlight the absence of a common legal and standards framework for “sharing mobile phone data in privacy-conscientious ways” and recommend “engaging companies, NGOs, researchers, privacy experts, and governments to agree on a set of best practices for new privacy-conscientious metadata sharing models.”
Matthew Brack, Tito Castillo
A Chatham House report describing the need for data sharing and collaboration for global public health emergencies and potential lessons learned from the commercial sector.
Published in Chatham House in 2015
Report
Benefits
Risks and Challenges
Governance and Operations
Published in Chatham House in 2015
This Chatham House report provides an overview on public health surveillance data sharing, highlighting the benefits and challenges of shared health data and the complexity in adapting technical solutions from other sectors for public health.
The report describes data sharing processes from several perspectives, including in-depth case studies of actual data sharing in practice at the individual, organizational and sector levels. Among the key lessons for public health data sharing, the report strongly highlights the need to harness momentum for action and maintain collaborative engagement: “Successful data sharing communities are highly collaborative. Collaboration holds the key to producing and abiding by community standards, and building and maintaining productive networks, and is by definition the essence of data sharing itself. Time should be invested in establishing and sustaining collaboration with all stakeholders concerned with public health surveillance data sharing.”
Examples of data collaboratives include H3Africa (a collaboration between NIH and Wellcome Trust) and NHS England’s care.data programme.
Chris Ansell, Alison Gash
A journal article describing the emerging practice of public-private partnerships, particularly those built around data sharing.
Published in Journal of Public Administration Research and Theory in 2007
Journal Article
Governance and Operations
Published in Journal of Public Administration Research and Theory in 2007
This article describes collaborative arrangements that include public and private organizations working together and proposes a model for understanding an emergent form of public-private interaction informed by 137 diverse cases of collaborative governance.
The article suggests factors significant to successful partnering processes and outcomes include:
The authors provide a ‘’contingency theory model’’ that specifies relationships between different variables that influence outcomes of collaborative governance initiatives. Three “core contingencies’’ for successful collaborative governance initiatives identified by the authors are:
Bellagio Big Data Workshop Participants
A white paper describing the potential of big data, and corporate data in particular, to positively benefit development efforts.
Published in 2014
Report
Benefits
Risks and Challenges
Published in 2014
This white paper, produced by “a group of activists, researchers and data experts” explores the potential of big data to improve development outcomes and spur positive social change in low- and middle-income countries. Using examples, the authors discuss four areas in which the use of big data can impact development efforts:
The authors argue that in order to maximize the potential of big data’s use in development, “there is a case to be made for building a data commons for private/public data, and for setting up new and more appropriate ethical guidelines.”
They also identify a number of challenges, especially when leveraging data made accessible from a number of sources, including private sector entities, such as:
Institute of Medicine
A consensus, peer-revieed IOM report recommending how to promote responsible clinical trial data sharing and minimize risks and challenges of sharing.
Published in 2015
Report
Benefits
Risks and Challenges
Data Responsibility
Published in 2015
Stefaan Verhulst
An essay offering a new understanding of data responsibility comprising a duty to share, a data to protect, and a duty to act.
Published in The Conversation in 2016
Essay
General
Benefits
Data Responsibility
In Practice
Published in The Conversation in 2016
Iryna Susha, Marijn Janssen, Stefaan Verhulst
A research paper providing a new taxonomy for types of data collaboratives.
Published in Proceedings of the 50th Hawaii International Conference on System Sciences in 2017
Paper
General
Governance and Operations
Published in Proceedings of the 50th Hawaii International Conference on System Sciences in 2017
Monica Bulger, Patrick McCormick, Mikaela Pitcan
A report exploring the history of inBloom, and education technology and data platform launched in 2013 and ended a year later.
Published in Data & Society Working Paper Series in 2017
Report
Risks and Challenges
In Practice
Published in Data & Society Working Paper Series in 2017
Stefaan G. Verhulst
This article investigates how companies use their data for social good, and identifies an emerging field of “data responsibility” where proprietary data can be used as a social asset.
Published in Stanford Social Innovation Review in 2017
Essay
Magazine Article
Benefits
Data Responsibility
In Practice
Published in Stanford Social Innovation Review in 2017
In this article for the Stanford Social Innovation Review, Stefaan Verhulst analyses how an increasing number of companies are using their data for social good, evidence for a new concept of “data responsibility” where data and information is used to reach positive public ends. Data Responsibility is defined as steps companies can take to open their proprietary data to external bodies, who can use this information to confront humanitarian emergencies and public problems. Central to this is the creation of “Data Collaboratives”—data-sharing models between corporations and public institutions, NGOs, and academic bodies. The article goes on to detail the “Three Pillars of Data Responsibility” which include: • Share: the duty for data-holders to share their data. • Protect: the need to protect data through adequate anonymization techniques. • Act: the need for policy makers and leaders to act upon the information and data shared. The article ends by calling on a “culture shift” to incorporate concepts of data responsibility into day-to-day company activities, and outlying four practical steps to make this happen.
Freddy De Meersman, Gerdy Seynaeve, Marc Debusschere, Patrick Lusyne, Pieter Dewitte, Youri Baeyens, Albrecht Wirthmann, Christophe Demunter, Fernando Reis, Hannes I. Reuter
This paper presented at the 2016 European Conference on Quality in Official Statistic assesses the ability for mobile phone data in Belgium to be used to collect official statistics, comparing data collected from mobile phones with official census data.
Published in European Conference on Quality in Official Statistics (Q2016) in 2016
Journal Article
Benefits
In Practice
Governance and Operations
Published in European Conference on Quality in Official Statistics (Q2016) in 2016
Jake Porway
This articles provides non profits with three guiding principles on how to effectively incorporate data science into their operations.
Published in Stanford Social Innovation Review in 2017
Magazine Article
Operations
In Practice
Governance and Operations
Published in Stanford Social Innovation Review in 2017
This articles provides non profits with three guiding principles on how to effectively incorporate data science into their operations. (1) Collaborate with data science experts to define your project. (2) Collaborate across your organization to “build with, not for.” (3) Collaborate across your sector to move the needle.
UN Global Pulse, GSMA
“This report outlines the value of harnessing mobile data for social good and provides an analysis of the gaps. Its aim is to survey the landscape today, assess the current barriers to scale, and make recommendations for a way forward.”
Published in 2017
Report
Case Study
Benefits
Risks and Challenges
Data Responsibility
In Practice
Published in 2017
– “The report reviews the challenges the field is currently facing and discusses a range of issues preventing mobile data from being used for social good. – It continues by providing a set of recommendations intended to move beyond short-term and ad hoc projects to more systematic and institutionalized implementations that are scalable, replicable, sustainable and focused on impact. – Finally, the report proposes a roadmap for 2018 calling all stakeholders to work on developing a scalable and impactful demonstration project that will help to establish the value of mobile data for social good. – The report includes examples of innovation projects and ways in which mobile data is already being used to inform development and humanitarian work. It is intended to inspire social impact organizations and mobile network operators (MNOs) to collaborate in the exploration and application of new data sources, methods and technologies.”
Future of Privacy Forum
“In this report, we aim to contribute to the literature by seeking the ‘ground truth’ from the corporate sector about the challenges they encounter when they consider making data available for academic research.”
Published in 2017
Report
Benefits
Incentives
Risks and Challenges
Data Responsibility
In Practice
Published in 2017
– “The report seeks to provide insights about why companies share data with academic researchers, how they make that data available for research, the perceived risks from sharing data, and the strategies that companies employ to address those risks. – Participants that provided information for the report came from 19 companies across diverse sectors, such as high-tech manufacturing, workforce, education, healthcare, telecommunucations, real estate, ecommerce, data-related services, transportation, consumer genetics testing, and online services. – Future of Privacy Forum researchers use the insights gathered from interviews with these companies to identify opportunities for private data sharing. Some recommendations include the potential to enhance a positive public profile of a company sharing data, to increase peer-to-peer knowledge sharing networks, to create a clearinghouse to match academics to companies with the research that they need, and to develop safeguards to mitigate perceived risks.”
Iryna Susha, Marijn Janssen, Stefaan Verhulst
Published in Transforming Government: People, Process and Policy in 2017
Journal Article
Data Responsibility
Published in Transforming Government: People, Process and Policy in 2017
Purpose In “data collaboratives”, private and public organizations coordinate their activities to leverage data to address a societal challenge. This paper aims to focus on analyzing challenges and coordination mechanisms of data collaboratives.
Design/methodology/approach This study uses coordination theory to identify and discuss the coordination problems and coordination mechanisms associated with data collaboratives. The authors also use a taxonomy of data collaborative forms from a previous empirical study to discuss how different forms of data collaboratives may require different coordination mechanisms.
Findings The study analyzed data collaboratives from the perspective of organizational and task levels. At the organizational level, the authors argue that data collaboratives present an example of the bazaar form of coordination. At the task level, the authors identified five coordination problems and discussed potential coordination mechanisms to address them, such as coordination by negotiation, by third party, by standardization, to name a few.
Research limitations/implications This study is one of the first few to systematically analyze the phenomenon of “data collaboratives”.
Practical implications This study can help practitioners better understand the coordination challenges they may face when initiating a data collaborative and to develop successful data collaboratives by using coordination mechanisms to mitigate these challenges.
Originality/value Data collaboratives are a novel form of data-driven initiatives which have seen rapid experimentation lately. This study draws attention to this concept in the academic literature and highlights some of the complexities of organizing data collaboratives in practice.
Gary King, Nate Persily
Published in 2018
Paper
General
Published in 2018
The mission of the academic social sciences is to understand and ameliorate society’s greatest challenges. The data held by private companies holds vast potential to further this mission. Yet, because of its interaction with highly politicized issues, customer privacy, proprietary content, and differing goals of firms and academics, these data are often inaccessible to university researchers. We propose here a new model for industry-academic partnerships that addresses these problems via a novel organizational structure: Respected scholars form a commission which, as a trusted third party, receives access to all relevant firm information and systems, and then recruits independent academics to do research in specific areas following standard peer review protocols organized and funded by nonprofit foundations. We also report on a partnership we helped forge under this model to make data available about the extremely visible and highly politicized issues surrounding the impact of social media on elections and democracy. In our partnership, Facebook will provide privacy-preserving data and access; seven major politically and substantively diverse nonprofit foundations will fund the research; and the Social Science Research Council will oversee the peer review process for funding and data access.
Stefaan Verhulst, Andrew Young
Data collaboratives can help harness the value of data held by the private sector and create a new added value that can address various public issues. This article delineates the potential of public-private partnership for data sharing, and proposes a data responsibility framework that serves as a guideline as well as a safeguard to protect from the risks involved in data sharing.
Published in Harvard Business Review in 2018
Essay
Data Responsibility
Published in Harvard Business Review in 2018
Stefaan Verhulst
Stefaan Verhulst’s latest work centers on how technology can improve people’s lives and the creation of more effective and collaborative forms of governance. Specifically, he is interested in the perils and promise of collaborative technologies and how to harness the unprecedented volume of information to advance the public good.
Published in TedxMidAtlantic in 2017
Paper
Data Responsibility
Published in TedxMidAtlantic in 2017
Skipper Seabold, Andrea Coppola
This study seeks to assess the possibility of using Google Trends data for forecasting price series in Central America. It discusses some of the challenges inherent in working with it in the context of developing countries. It finds that the addition of the Internet search index improves forecasting over benchmark models in about 20 percent of the series and discusses the reasons for the varied success and potential avenues for future research.
Published in World Bank in 2015
Paper
General
Risks and Challenges
Published in World Bank in 2015
European Commission
The European Commission published this accompanying document to the Communication “Towards a common European data space”. This “Staff Working Document aims to provide a toolbox for companies that are data holders, data users, or both at the same time. For this purpose, it contains a “How to” guide on legal, business and technical aspects of data sharing that can be used in practice when considering and preparing data transfers between companies coming from the same or different sectors.”
Published in European Commission in 2018
Paper
General
Data Responsibility
Governance and Operations
Published in European Commission in 2018
The purpose of this Staff Working Document is “to provide a toolbox for companies that are data holders, data users, or both at the same time. For this purpose, it contains a “How to” guide on legal, business and technical aspects of data sharing that can be used in practice when considering and preparing data transfers between companies coming from the same or different sectors.”
This Staff Working Document defines the following principles to ensure fair markets: transparency, shared value creation, respect for each other’s commercial interests, ensure undistorted competition, minimised data lock-in.
Additionally, it also provides a guide of data sharing for businesses and a guide on how to make data sharing successful and fruitful.
European Commission
The European Commission published this document for the following purpose: “With this Communication, the Commission proposes a package of measures as a key step towards a common data space in the EU - a seamless digital area with the scale that will enable the development of new products and services based on data.”
Published in European Commission in 2018
Paper
General
Benefits
Governance and Operations
Published in European Commission in 2018
In building towards a common data space, which is “a seamless digital area with the scale that will enable the development of new products and services based on data,” the European Commission proposes the following steps:
Alberto Alemanno
This article identifies the major challenges of unlocking private-held data to the benefit of society and sketches a research agenda for scholars interested in collaborative and regulatory solutions aimed at unlocking privately-held data for good.
Published in European Journal of Risk Regulation in 2018
Journal Article
General
Benefits
Risks and Challenges
Published in European Journal of Risk Regulation in 2018
Municipality of Copenhagen and Capital Region of Denmark
The City Data Exchange a collaborative project created by the Municipality of Copenhagen, the Capital Region of Denmark, and Hitachi to create a marketplace for public and private organizations to take part in data exchange.
Published in 2018
Report
Paper
General
Data Responsibility
Governance and Operations
Published in 2018
The CDE has created a platform for both public and private organizations to sell and purchase data in an effort to create a data exchange between the two sectors. One of the datasets in highest demand is what they call people movement patterns data, which is how people in a given area move around at different times and places.
Gabriel Popkin
“In the past few years, technology and satellite companies’ offerings to scientists have increased dramatically. Thousands of researchers now use high-resolution data from commercial satellites for their work. […] Researchers use the new capabilities to track and visualize forest and coral-reef loss; monitor farm crops to boost yields; and predict glacier melt and disease outbreaks.”
Published in Nature in 2018
Journal Article
General
Benefits
In Practice
Published in Nature in 2018
In this article, Popkin examines a number of examples of technology companies initiating data collaboratives, including:
Jamie Holton
From the abstract: “This research looks at the emerging phenomenon of data collaboratives, specifically in the ‘crisis response’ sector, with which the private sector assists the public sector’s data-driven efforts to prevent or respond to humanitarian emergencies. This research explores and explains why the private sector participates in crisis response data collaboratives.”
Published in Leiden University in 2018
Paper
Benefits
Incentives
Data Responsibility
Published in Leiden University in 2018
From the abstract: “This research explores and explains why the private sector participates in crisis response data collaboratives.
“Through secondary literature analysis, and primary survey and interview analysis of three case studies, this research provides new insights into data collaborative objectives, the private sector’s activities, the incentives and risks these collaboratives present for the private sector, and how it mitigates such risks.
“The research concludes that the private sector enters crisis response data collaboratives to help the public sector address one or more of its obstacles to creating datadriven solutions to societal problems, and occasionally to achieve additional objectives for the public good.
“Although the private sector is motivated by various incentives, sufficient mitigation of presented risks, especially risks to data subjects’ privacy and security, is a precondition to joining a crisis response data collaborative.”
Spyratos Spyridon, Vespe Michele, Natale Fabrizio, Weber Ingmar, Zagheni Emilio, Rango Marzia
While research on the use of big data sources for migration is in its infancy, and the diffusion of internet technologies in less developed countries is still limited, the use of big data sources can unveil useful insights on quantitative and qualitative characteristics of migration.
Published in Joint Research Centre, European Commission in 2018
Report
In Practice
Published in Joint Research Centre, European Commission in 2018
In this report, the authors examine social media data to study migration patterns:
Richard Beckwith, John Sherry, David Prendergast
From the chapter: “This paper explores the complex relationship between cities and data or, more accurately, the way that the citizens of a city want data about their community to be managed.”
Published in Springer in 2019
Journal Article
General
Published in Springer in 2019
From the abstract: “Much of the recent excitement around data, especially ‘Big Data,’ focuses on the potential commercial or economic value of data. How that data will affect people isn’t much discussed. People know that smart cities will deploy Internet-based monitoring and that flows of the collected data promise to produce new values. Less considered is that smart cities will be sites of new forms of citizen action—enabled by an ‘economy’ of data that will lead to new methods of collectivization, accountability, and control which, themselves, can provide both positive and negative values to the citizenry. Therefore, smart city design needs to consider not just measurement and publication of data but also the implications of city-wide deployment, data openness, and the possibility of unintended consequences if data leave the city.”
Yves-Alexandre de Montjoye, et. al.
From the article: “The breadcrumbs we leave behind when using our mobile phones—who somebody calls, for how long, and from where—contain unprecedented insights about us and our societies. Researchers have compared the recent availability of large-scale behavioral datasets, such as the ones generated by mobile phones, to the invention of the microscope, giving rise to the new field of computational social science.”
Published in Nature in 2018
Journal Article
Risks and Challenges
Published in Nature in 2018
Bram Klievink, Haiko van der Voort, Wijnand Veeneman
From the abstract: “This article looks at the idea of data collaboratives as a form of cross-sector partnership to exchange and integrate data and data use to generate public value.”
Published in Information Polity Journal in 2018
Journal Article
Benefits
Published in Information Polity Journal in 2018
Full abstract: “Driven by the technological capabilities that ICTs offer, data enable new ways to generate value for both society and the parties that own or offer the data. This article looks at the idea of data collaboratives as a form of cross-sector partnership to exchange and integrate data and data use to generate public value. The concept thereby bridges data-driven value creation and collaboration, both current themes in the field. To understand how data collaboratives can add value in a public governance context, we exploratively studied the qualitative longitudinal case of an infomobility platform. We investigated the ability of a data collaborative to produce results while facing significant challenges and tensions between the goals of parties, each having the conflicting objectives of simultaneously retaining control whilst allowing for generativity. Taken together, the literature and case study findings help us to understand the emergence and viability of data collaboratives. Although limited by this study’s explorative nature, we find that conditions such as prior history of collaboration and supportive rules of the game are key to the emergence of collaboration. Positive feedback between trust and the collaboration process can institutionalise the collaborative, which helps it survive if conditions change for the worse.”
Bjorn Lundqvist
From the abstract: “This article will discuss what implications combining data in data pools by firms might have on competition, and when competition law should be applicable. It develops the idea that data pools harbour great opportunities, whilst acknowledging that there are still risks to take into consideration, and to regulate.”
Published in Faculty of Law, Stockholm University Research Paper in 2018
Journal Article
Governance and Operations
Published in Faculty of Law, Stockholm University Research Paper in 2018
Jennifer Shkabatur
This paper makes a case for the practice and application of the global commons of data, by proposing alternatives to address the concern of data sharing and presenting policy framework that will allow the global commons of data to work.
Published in Stanford Technology Law Review in 2018
Journal Article
General
Operations
Published in Stanford Technology Law Review in 2018
Abstract:
Data platform companies (such as Facebook, Google, or Twitter) amass and process immense amounts of data that is generated by their users. These companies primarily use the data to advance their commercial interests, but there is a growing public dismay regarding the adverse and discriminatory impacts of their algorithms on society at large. The regulation of data platform companies and their algorithms has been hotly debated in the literature, but current approaches often neglect the value of data collection, defy the logic of algorithmic decision-making, and exceed the platform companies’ operational capacities.
This Article suggests a different approach — an open, collaborative, and incentives-based stance toward data platforms that takes full advantage of the tremendous societal value of user-generated data. It contends that this data shall be recognized as a “global commons,” and access to it shall be made available to a wide range of independent stakeholders — research institutions, journalists, public authorities, and international organizations. These external actors would be able to utilize the data to address a variety of public challenges, as well as observe from within the operation and impacts of the platforms’ algorithms.
After making the theoretical case for the “global commons of data,” the Article explores the practical implementation of this model. First, it argues that a data commons regime should operate through a spectrum of data sharing and usage modalities that would protect the commercial interests of data platforms and the privacy of data users. Second, it discusses regulatory measures and incentives that can solicit the collaboration of platform companies with the commons model. Lastly, it explores the challenges embedded in this approach.
Meg Young, Luke Rodriguez, Emily Keller, Feiyang Sun, Boyang Sa, Jan Whittington, Bill Howe
From the abstract: “We find that the liberal use of synthetic data, in conjunction with strong legal protections over raw data, strikes a tunable balance between transparency, proprietorship, privacy, and research objectives; and that the legal-technical framework we describe can form the basis for organizational data trusts in a variety of contexts.”
Published in Proceedings of ACM in 2019
Paper
Risks and Challenges
Published in Proceedings of ACM in 2019
Bianca Wylie, Sean McDonald
The authors propose a new concept called “Data Trust” to “steward, maintain and manage how data is used and shared — from who is allowed access to it, and under what terms, to who gets to define the terms, and how.”
Published in Center for International Governance Innovation in 2018
Blog Post
Data Responsibility
Governance and Operations
Published in Center for International Governance Innovation in 2018
Claire Borsenberger, Mathilde Hoang, Denis Joram
This paper seeks to answer the following research questions:
Published in Springer in 2019
Essay
General
Published in Springer in 2019
Abstract: “Thanks to appropriate data algorithms, firms, especially those on-line, are able to extract detailed knowledge about consumers and markets. This raises the question of the essential facility character of data. Moreover, the features of digital markets lead to a concentration of this core input in the hands of few big “superstars” and arouse legitimate economic and societal concerns. In a more and more data-driven society, one could ask if data openness is a solution to deal with power derived from data concentration. We conclude that only a case-by-case approach should be followed. Mandatory open data policy should be conditioned on an ex-ante cost-benefit analysis proving that the benefits of disclosure exceed its costs.”
Natalia Adler, Ciro Cattuto, Kyriaki Kalimeri, Daniela Paolotti, Michele Tizzoni, Stefaan Verhulst, Elad Yom-Tov, Andrew Young
“As the product of a data collaborative, this paper leverages private-sector search engine data toward gaining a fuller, more accurate picture of the suicide issue among young people in India.”
Published in Journal of Medical Internal Research in 2019
Journal Article
In Practice
Published in Journal of Medical Internal Research in 2019
Abstract:
“Background: India is home to 20% of the world’s suicide deaths. Although statistics regarding suicide in India are distressingly high, data and cultural issues likely contribute to a widespread underreporting of the problem. Social stigma and only recent decriminalization of suicide are among the factors hampering official agencies’ collection and reporting of suicide rates.
“Objective: As the product of a data collaborative, this paper leverages private-sector search engine data toward gaining a fuller, more accurate picture of the suicide issue among young people in India. By combining official statistics on suicide with data generated through search queries, this paper seeks to: add an additional layer of information to more accurately represent the magnitude of the problem, determine whether search query data can serve as an effective proxy for factors contributing to suicide that are not represented in traditional datasets, and consider how data collaboratives built on search query data could inform future suicide prevention efforts in India and beyond.
“Methods: We combined official statistics on demographic information with data generated through search queries from Bing to gain insight into suicide rates per state in India as reported by the National Crimes Record Bureau of India. We extracted English language queries on “suicide,” “depression,” “hanging,” “pesticide,” and “poison”. We also collected data on demographic information at the state level in India, including urbanization, growth rate, sex ratio, internet penetration, and population. We modeled the suicide rate per state as a function of the queries on each of the 5 topics considered as linear independent variables. A second model was built by integrating the demographic information as additional linear independent variables.
“Results: Results of the first model fit (R2) when modeling the suicide rates from the fraction of queries in each of the 5 topics, as well as the fraction of all suicide methods, show a correlation of about 0.5. This increases significantly with the removal of 3 outliers and improves slightly when 5 outliers are removed. Results for the second model fit using both query and demographic data show that for all categories, if no outliers are removed, demographic data can model suicide rates better than query data. However, when 3 outliers are removed, query data about pesticides or poisons improves the model over using demographic data.
“Conclusions: In this work, we used search data and demographics to model suicide rates. In this way, search data serve as a proxy for unmeasured (hidden) factors corresponding to suicide rates. Moreover, our procedure for outlier rejection serves to single out states where the suicide rates have substantially different correlations with demographic factors and query rates.”
Kieron O'Hara
“This paper defends the following thesis: A data trust works within the law to provide ethical, architectural and governance support for trustworthy data processing.”
Published in 2019
Paper
Governance and Operations
Published in 2019
Abstract: “In their report on the development of the UK AI industry, Wendy Hall and Jérôme Pesenti recommend the establishment of data trusts, “proven and trusted frameworks and agreements” that will “ensure exchanges [of data] are secure and mutually beneficial” by promoting trust in the use of data for AI. This paper defends the following thesis: A data trust works within the law to provide ethical, architectural and governance support for trustworthy data processing. Data trusts are therefore both constraining and liberating. They constrain: they respect current law, so they cannot render currently illegal actions legal. They are intended to increase trust, and so they will typically act as further constraints on data processors, adding the constraints of trustworthiness to those of law. Yet they also liberate: if data processors are perceived as trustworthy, they will get improved access to data. The paper addresses the areas of: trust and trustworthiness; ethics; architecture; legal status.”
Shannon Lefaivre, Brendan Behan, Anthony Vaccarino, Kenneth Evans, Moyez Dharsee, Tom Gee, Costa Dafnas, Tom Mikkelsen, Elizabeth Theriault
“The aim of this report is to highlight these best practices and develop a key open resource which may be referenced during the development of similar open science initiatives.”
Published in Frontiers in Genetics in 2019
Journal Article
In Practice
Governance and Operations
Published in Frontiers in Genetics in 2019
André Corrêa d'Almeida, Caroline McHeffey, Nilda Mesa, Arnaud Sahuguet, Stefaan G. Verhulst, Andrew Young, Andrew J. Zahuranec
“This inaugural edition of New Lab’s Research Journal (i) describes the process of developing and launching New Lab’s The Circular City program, (ii) introduces circular city data as the first exploration of this program, and (iii) investigates and methodologically tests the value of circular data applied to three urban challenges: economic development, mobility, and resilience.”
Published in New Lab in 2019
Journal Article
General
Benefits
Incentives
Risks and Challenges
In Practice
Published in New Lab in 2019
From the introduction: “Coming out of ten months of applied, participatory, and multidisciplinary research, this journal presents:
A. “One case study developed to document and explain how the program was conceived, designed, and implemented, with the goal of offering lessons for scalability at New Lab and replicability in other cities around the world. The key questions explored in the case study are:
B. “Three research papers developed to investigate three urban challenges:
Ron S. Jarmin
“In this essay, [Jarmin] describe[s] some work underway that hints at what 21st century official economic measurement will look like and offer some preliminary comments on what is needed to get there.”
Published in Journal of Economic Perspectives in 2019
Journal Article
Benefits
Published in Journal of Economic Perspectives in 2019
Geoff Mulgan, Vincent Straub
“Here we attempt to open up part of the debate on data governance; suggesting how to address the twin goals of greater control for citizens, and greater value for the public as a whole. We argue that there are a variety of different solutions that need to be designed, and experimented with.”
Published in Nesta in 2019
Essay
Data Responsibility
Published in Nesta in 2019
“This paper argues that new institutions—an ecosystem of trust—are needed to ensure that uses of data are trusted and trustworthy. It advocates the creation of different kinds of data trust to fill this gap. It argues:
A. Martinez, A. C. Rainie
“The project described here reviewed publicly available data sharing agreements that focus on research with Indigenous nations and communities in the United States. […] The results detail how Indigenous peoples currently use data sharing agreements and potential areas of expansion for language to include in data sharing agreements as Indigenous peoples address the research needs of their communities and the protection of community and cultural data.”
Published in American Geophysical Union in 2018
Journal Article
In Practice
Governance and Operations
Published in American Geophysical Union in 2018
Abstract: “Indigenous communities and scholars have been influencing a shift in participation and inclusion in academic and agency research over the past two decades. As a response, Indigenous peoples are increasingly asking research questions and developing their own studies rooted in their cultural values. They use the study results to rebuild their communities and to protect their lands. This process of Indigenous-driven research has led to partnering with academic institutions, establishing research review boards, and entering into data sharing agreements to protect environmental data, community information, and local and traditional knowledges.Data sharing agreements provide insight into how Indigenous nations are addressing the key areas of data collection, ownership, application, storage, and the potential for data reuse in the future. By understanding this mainstream data governance mechanism, how they have been applied, and how they have been used in the past, we aim to describe how Indigenous nations and communities negotiate data protection and control with researchers.
“The project described here reviewed publicly available data sharing agreements that focus on research with Indigenous nations and communities in the United States. We utilized qualitative analysis methods to identify specific areas of focus in the data sharing agreements, whether or not traditional or cultural values were included in the language of the data sharing agreements, and how the agreements defined data. The results detail how Indigenous peoples currently use data sharing agreements and potential areas of expansion for language to include in data sharing agreements as Indigenous peoples address the research needs of their communities and the protection of community and cultural data.”
Deborah Mascalzoni, et al
“[D]ata deposition requirements and research repositories will have to adapt to the legal and ethical landscape of the GDPR. Noncompliance with the GDPR may incur administrative fines of up to €20 million, and the regulation is enforced by data protection authorities in each EU nation.”
Published in Annals of Internal Medicine in 2019
Essay
Risks and Challenges
Data Responsibility
Governance and Operations
Published in Annals of Internal Medicine in 2019
Stefaan G. Verhulst
Testimony before New York City Council Committee on Technology and the Commission on Public Information and Communication (COPIC).
Published in The GovLab in 2019
Essay
General
Published in The GovLab in 2019
Steve MacFeely
“This paper examines the opportunities and challenges presented by big data for compiling indicators to support Agenda 2030.”
Published in Global Policy Volume in 2019
Journal Article
General
Benefits
Published in Global Policy Volume in 2019
Daniel Kondor, Behrooz Hashemian, Yves-Alexandre de Montjoye, Carlo Ratti
“Extending previous work on reidentifiability of spatial data and trajectory matching, we present the first large-scale analysis of user matchability in real mobility datasets on realistic scales, i.e. among two datasets that consist of several million people’s mobility traces, coming from a mobile network operator and transportation smart card usage.”
Published in IEEE Transactions on Big Data in 2018
Journal Article
General
Benefits
Published in IEEE Transactions on Big Data in 2018
From the abstract:
GloPID-R
According to the Executive Summary, this “roadmap aims to accelerate effective data sharing by highlighting measures GloPID-R research funders can take to improve research data sharing by their grantees and to advocate for increased research and public health data sharing more widely.”
Published in GloPID-R
Report
General
Incentives
Operations
Published in GloPID-R
Recommendations:
OECD
“This report examines the opportunities of enhancing access to and sharing of data (EASD) in the context of the growing importance of artificial intelligence and the Internet of Things. It discusses how EASD can maximise the social and economic value of data re-use and how the related risks and challenges can be addressed. […] It also provides examples of EASD approaches and policy initiatives in OECD countries and partner economies.”
Published in OECD Publishing in 2019
Report
Risks and Challenges
Published in OECD Publishing in 2019
The report argues that data sharing can “increase the value of data to the data holder, and even more so to secondary data users, with additional positive spill-over benefits for country economies and society at large.”
However, several challenges need to be addressed so that the public can harness the benefit of data sharing. Such challenges such as balancing “the benefits of enhancing data “openness” with the risks, while considering legitimate private, national and public interests.”
Examining over 200 policy initiatives across 37 countries, the report finds several trends, such as “few countries have policy initiatives to faciliate data sharing within the private sector.”
Finally, the report proposes that policy makers employ several strategies involving contractual agreements, open data, data portability, and restricted data-sharing agreement to address the major challenges.
Andreas Rasche, Mette Morsing, Erik Wetter
The authors examine the legitimacy of different types of data sharing partnerships (open and closed) for social good.
Published in Business & Society in 2019
Journal Article
Governance and Operations
Benefits
General
Published in Business & Society in 2019
Abstract: “This article examines the legitimacy attached to different types of multi-stakeholder data partnerships occurring in the context of sustainable development. We develop a framework to assess the democratic legitimacy of two types of data partnerships: open data partnerships (where data and insights are mainly freely available) and closed data partnerships (where data and insights are mainly shared within a network of organizations). Our framework specifies criteria for assessing the legitimacy of relevant partnerships with regard to their input legitimacy as well as their output legitimacy. We demonstrate which particular characteristics of open and closed partnerships can be expected to influence an analysis of their input and output legitimacy.”
Robert M. Groves, Adam Neufeld
“The public pays for and provides an incredible amount of data to governments and companies. Yet much of the value of this data is being wasted, remaining in silos rather than being shared to enhance the common good—whether it’s helping governments to stop opioid addiction or helping companies predict and meet the demand for electric or autonomous vehicles.”
Published in Beeck Center, Georgetown University in 2017
Paper
General
Benefits
Incentives
Risks and Challenges
Published in Beeck Center, Georgetown University in 2017
Mars Lan, Seyi Adebajo, Shirshanka Das
LinkedIn created a generalized metadata search and discovery tool, DataHub, to help scale up productivity and innovation that require their data assets.
Published in LinkedIn Blog in 2019
Blog Post
Operations
Published in LinkedIn Blog in 2019
“As the operator of the world’s largest professional network and the Economic Graph, LinkedIn’s Data team is constantly working on scaling its infrastructure to meet the demands of our ever-growing big data ecosystem. As the data grows in volume and richness, it becomes increasingly challenging for data scientists and engineers to discover the data assets available, understand their provenances, and take appropriate actions based on the insights. To help us continue scaling productivity and innovation in data alongside this growth, we created a generalized metadata search and discovery tool, DataHub.
“As a result, we decided to expand the scope of the project to build a fully generalized metadata search and discovery tool, DataHub, with an ambitious vision: connecting LinkedIn employees with data that matters to them.
“We broke the monolithic WhereHows stack into two distinct stacks: a Modular UI frontend and a Generalized Metadata Architecture backend. The new architecture enabled us to rapidly expand our scope of metadata collection beyond just datasets and jobs. At the time of writing, DataHub already stores and indexes tens of millions of metadata records that encompass 19 different entities, including datasets, metrics, jobs, charts, AI features, people, and groups. We also plan to onboard metadata for machine learning models and labels, experiments, dashboards, microservice APIs, and code in the near future.”
Adrian E. Raftery, Janet Currie, Mary T. Bassett, Robert Groves
A guide from the National Academies of Sciences, Engineering, and Medicine examining the strengths and weaknesses of various datasets with potential value for addressing the COVID-19 pandemic.
Published in The National Academies Press in 2020
Guide
Benefits
Risks and Challenges
In Practice
Published in The National Academies Press in 2020
Abstract:
“This rapid expert consultation provides insight into the strengths and weaknesses of the data on the COVID-19 pandemic by applying five criteria to seven types of data available to support decision making. It was produced through the Societal Experts Action Network (SEAN), an activity of the National Academies of Sciences, Engineering, and Medicine that is sponsored by the National Science Foundation. SEAN links researchers in the social, behavioral, and economic sciences with decision makers to respond to policy questions arising from the COVID-19 pandemic.”
Laetitia Gauvin, Michele Tizzoni, Simone Piaggesi, Andrew Young, Natalia Adler, Stefaan Verhulst, Leo Ferres, and Ciro Cattuto
A journal article providing lessons learned from a data collaborative that leveraged anonymized call detail records and other datasets to better understand the mobility experiences of women and girls in Santiago de Chile.
Published in Nature: Humanities and Social Sciences Communications in 2020
Journal Article
In Practice
Published in Nature: Humanities and Social Sciences Communications in 2020
Abstract:
“Mobile phone data have been extensively used to study urban mobility. However, studies based on gender-disaggregated large-scale data are still lacking, limiting our understanding of gendered aspects of urban mobility and our ability to design policies for gender equality. Here we study urban mobility from a gendered perspective, combining commercial and open datasets for the city of Santiago, Chile. We analyze call detail records for a large cohort of anonymized mobile phone users and reveal a gender gap in mobility: women visit fewer unique locations than men, and distribute their time less equally among such locations. Mapping this mobility gap over administrative divisions, we observe that a wider gap is associated with lower income and lack of public and private transportation options. Our results uncover a complex interplay between gendered mobility patterns, socio-economic factors and urban affordances, calling for further research and providing insights for policymakers and urban planners.”
Hayden Dahmm
A report sharing initial lessons learned from the Contracts for Data Collaboration initiative (C4DC) and its study of core components of data sharing agreements and other legal instruments to enable data collaboratives.
Published in UN Sustainable Development Solutions Network’s Thematic Research Network on Data and Statistics (SDSN TReNDS) in 2020
Report
Governance and Operations
Published in UN Sustainable Development Solutions Network’s Thematic Research Network on Data and Statistics (SDSN TReNDS) in 2020
Abstract:
“In the midst of the COVID-19 pandemic, data has never been more salient. COVID has generated new data demands and increased cross-sector data collaboration. Yet, these data collaborations require careful planning and evaluation of risks and opportunities, especially when sharing sensitive data. Data sharing agreements (DSAs) are written agreements that establish the terms for how data are shared between parties and are important for establishing accountability and trust. However, negotiating DSAs is often time consuming, and collaborators lacking legal or financial capacity are disadvantaged. Contracts for Data Collaboration (C4DC) is a joint initiative between SDSN TReNDS, NYU’s GovLab, the World Economic Forum, and the University of Washington, working to strengthen trust and transparency of data collaboratives. The partners have created an online library of DSAs which represents a selection of data applications and contexts. This report introduces C4DC and its DSA library. We demonstrate how the library can support the data community to strengthen future data collaborations by showcasing various DSA applications and key considerations. First, we explain our method of analyzing the agreements and consider how six major issues are addressed by different agreements in the library. Key issues discussed include data use, access, breaches, proprietary issues, publicization of the analysis, and deletion of data upon termination of the agreement. For each of these issues, we describe approaches illustrated with examples from the library. While our analysis suggests some pertinent issues are regularly not addressed in DSAs, we have identified common areas of practice that may be helpful for entities negotiating partnership agreements to consider in the future.”
Victor Couture, Jonathan I. Dingel, Allison E. Green, Jessie Handbury, Kevin R. Williams
An NBER working paper examining the “suitability of smartphone data for quantifying movement and social contact” during the COVID-19 pandemic.
Published in The National Bureau of Economic Research in 2020
Paper
In Practice
Published in The National Bureau of Economic Research in 2020
Abstract:
“Tracking human activity in real time and at fine spatial scale is particularly valuable during episodes such as the COVID-19 pandemic. In this paper, we discuss the suitability of smartphone data for quantifying movement and social contact. We show that these data cover broad sections of the US population and exhibit movement patterns similar to conventional survey data. We develop and make publicly available a location exposure index that summarizes county-to-county movements and a device exposure index that quantifies social contact within venues. We use these indices to document how pandemic-induced reductions in activity vary across people and places.”
Christopher Loynes, Jamal Ouenniche, Johannes De Smedt
A paper, introducing an automated tool targeted at humanitarian actors to help them “detect a disaster using tweets, alongside a portal to identify local and regional NGOs that are best-positioned to provide support to people adversely affected by a disaster.”
Published in Annals of Operations Research in 2020
Journal Article
Operations
In Practice
Published in Annals of Operations Research in 2020
Abstract:
“This paper provides the humanitarian community with an automated tool that can detect a disaster using tweets posted on Twitter, alongside a portal to identify local and regional Non-Governmental Organisations (NGOs) that are best-positioned to provide support to people adversely affected by a disaster. The proposed disaster detection tool uses a linear Support Vector Classifier (SVC) to detect man-made and natural disasters, and a density-based spatial clustering of applications with noise (DBSCAN) algorithm to accurately estimate a disaster’s geographic location. This paper provides two original contributions. The first is combining the automated disaster detection tool with the prototype portal for NGO identification. This unique combination could help reduce the time taken to raise awareness of the disaster detected, improve the coordination of aid, increase the amount of aid delivered as a percentage of initial donations and improve aid effectiveness. The second contribution is a general framework that categorises the different approaches that can be adopted for disaster detection. Furthermore, this paper uses responses obtained from an on-the-ground survey with NGOs in the disaster-hit region of Uttar Pradesh, India, to provide actionable insights into how the portal can be developed further.”
Kaustav Bhattacharjee, Min Chen, Aritra Dasgupta
A paper highlighting the role visualization can play in balancing privacy needs and effective data-driven communication
Published in Computer Graphics Forum in 2020
Journal Article
Governance and Operations
Published in Computer Graphics Forum in 2020
Abstract:
“Preservation of data privacy and protection of sensitive information from potential adversaries constitute a key socio‐technical challenge in the modern era of ubiquitous digital transformation. Addressing this challenge needs analysis of multiple factors: algorithmic choices for balancing privacy and loss of utility, potential attack scenarios that can be undertaken by adversaries, implications for data owners, data subjects, and data sharing policies, and access control mechanisms that need to be built into interactive data interfaces. Visualization has a key role to play as part of the solution space, both as a medium of privacy‐aware information communication and also as a tool for understanding the link between privacy parameters and data sharing policies. The field of privacy‐preserving data visualization has witnessed progress along many of these dimensions. In this state‐of‐the‐art report, our goal is to provide a systematic analysis of the approaches, methods, and techniques used for handling data privacy in visualization. We also reflect on the road‐map ahead by analyzing the gaps and research opportunities for solving some of the pressing socio‐technical challenges involving data privacy with the help of visualization.”
James Brian Byrd, Anna C. Greene, Deepashree Venkatesh Prasad, Xiaoqian Jiang, Casey S. Green
The piece describing “current best practices for various types of genomic data, as well as opportunities to promote ethical data sharing that accelerates science by aligning incentives.”
Published in Nature Reviews Genetics in 2020
Journal Article
In Practice
Data Responsibility
Published in Nature Reviews Genetics in 2020
Abstract:
“Data sharing anchors reproducible science, but expectations and best practices are often nebulous. Communities of funders, researchers and publishers continue to grapple with what should be required or encouraged. To illuminate the rationales for sharing data, the technical challenges and the social and cultural challenges, we consider the stakeholders in the scientific enterprise. In biomedical research, participants are key among those stakeholders. Ethical sharing requires considering both the value of research efforts and the privacy costs for participants. We discuss current best practices for various types of genomic data, as well as opportunities to promote ethical data sharing that accelerates science by aligning incentives.”
Stefano Iacus, Carlos Santamaria Serna, Francesco Sermi, Spyrdion Spyratos, Dario Tarchi, Michele Vespe
A report from the EU Science Hub introducing “the concept of data-driven Mobility Functional Areas (MFAs) as geographic zones with a high degree of intra-mobility exchanges.” These telecom data-calculated MFAs, the report authors argue, “can be useful to inform targeted re-escalation policy responses in cases of future COVID-19 outbreaks.”
Published in Publications Office of the European Union in 2020
Report
In Practice
Governance and Operations
Published in Publications Office of the European Union in 2020
Abstract:
“This work introduces the concept of data-driven Mobility Functional Areas (MFAs) as geographic zones with high degree of intra-mobility exchanges. Such information, calculated at European regional scale thanks to mobile data, can be useful to inform targeted reescalation policy responses in cases of future COVID-19 outbreaks (avoiding large-area or even national lockdowns). In such events, the geographic distribution of MFAs would define territorial areas to which lockdown interventions could be limited, with the result of minimising socio-economic consequences of such policies. The analysis of the time evolution of MFAs can also be thought of as a measure of how human mobility changes not only in intensity but also in patterns, providing innovative insights into the impact of mobility containment measures. This work presents a first analysis for 15 European countries (14 EU Member States and Norway).”
Massimo Russo, Tiang Feng
An article that aims to help companies consider how to “think about external data sharing when potential use cases are distant, unknown, or not yet existent” and to “balance the abstract value of future use cases with the tangible risk of data misuse.”
Published in BCG Henderson Institute in 2020
Blog Post
In Practice
Benefits
Published in BCG Henderson Institute in 2020
Rainer Diaz-Bone, Kenneth Horvath, Valeska Cappel
A paper arguing for “moving towards a sociology of social research in order to characterize the new qualities of big data and its deficiencies.”
Published in Historical Social Research in 2020
Journal Article
Risks and Challenges
Published in Historical Social Research in 2020
Abstract:
“The phenomenon of big data does not only deeply affect current societies but also poses crucial challenges to social research. This article argues for moving towards a sociology of social research in order to characterize the new qualities of big data and its deficiencies. We draw on the neopragmatist approach of economics of convention (EC) as a conceptual basis for such a sociological perspective. This framework suggests investigating processes of quantification in their interplay with orders of justifications and logics of evaluation. Methodological issues such as the question of the “quality of big data” must accordingly be discussed in their deep entanglement with epistemic values, institutional forms, and historical contexts and as necessarily implying political issues such as who controls and has access to data infrastructures. On this conceptual basis, the article uses the example of health to discuss the challenges of big data analysis for social research. Phenomena such as the rise of new and massive privately owned data infrastructures, the economic valuation of huge amounts of connected data, or the movement of “quantified self” are presented as indications of a profound transformation compared to established forms of doing social research. Methodological and epistemological, but also institutional and political, strategies are presented to face the risk of being “outperformed” and “replaced” by big data analysis as they are already done in big US American and Chinese Internet enterprises. In conclusion, we argue that the sketched developments have important implications both for research practices and methods teaching in the era of big data”
Susan Ariel Aaronson
A brief published by the Centre for International Governance Innovation examines the origins of differences in attitudes toward use of data troves and the perceived risks therein in the United States, Canada, and Germany. Through case studies, it also notes the existence of a “governance gap” in personal data use and the threats this inadequate governance presents.
Published in Centre for International Governance Innovation in 2020
Report
Risks and Challenges
Published in Centre for International Governance Innovation in 2020
“This paper attempts to examine why stores of personal data (data troves) held by private firms became a national security problem in the United States and compares the US response to that of Canada and Germany. Citizens in all three countries rely on many of the same data-driven services and give personal information to many of the same companies. German and Canadian policy makers and scholars have also warned of potential national security spillovers of large data troves.”
Heidi Ledford
An article outlining how social scientists are using social media data to conduct studies ranging from “the psychological underpinnings of human morality, to the influence of misinformation, to the factors that make some artists more successful than others.”
Published in Nature in 2020
Journal Article
Benefits
In Practice
Published in Nature in 2020
Fabio Ricciato, Albrecht Wirthmann, Martina Hahn
A paper exploring how “statistical offices are called nowadays to rethink the way they operate in order to reassert their role in modern democratic society,” given the “availability of new digital data sources, new technologies, and new behaviors.”
Published in Data & Policy in 2020
Journal Article
Benefits
Published in Data & Policy in 2020
Abstract:
” In this discussion paper, we outline the motivations and the main principles of the Trusted Smart Statistics (TSS) concept that is under development in the European Statistical System. TSS represents the evolution of official statistics in response to the challenges posed by the new datafied society. Taking stock from the availability of new digital data sources, new technologies, and new behaviors, statistical offices are called nowadays to rethink the way they operate in order to reassert their role in modern democratic society. The issue at stake is considerably broader and deeper than merely adapting existing processes to embrace so-called Big Data. In several aspects, such evolution entails a fundamental paradigm shift with respect to the legacy model of official statistics production based on traditional data sources, for example, in the relation between data and computation, between data collection and analysis, between methodological development and statistical production, and of course in the roles of the various stakeholders and their mutual relationships. Such complex evolution must be guided by a comprehensive system-level view based on clearly spelled design principles. In this paper, we aim at providing a general account of the TSS concept reflecting the current state of the discussion within the European Statistical System.”
Network Advertising Initiative
A guide for effectively and responsibly acting upon “novel data uses” for data collected for advertising purposes.
Published in 2020
Guide
Governance and Operations
Published in 2020
Wenzhi Ding, Ross Levine, Chen Lin, Wensi Xie
An NBER working paper that relied on mobile phone data to “examine how counties responded to both local COVID-19 cases and statewide shelter-in-place orders,” finding that “social distancing increases more in response to cases and official orders in counties where individuals historically (1) engaged less in community activities and (2) demonstrated greater willingness to incur individual costs to contribute to social objectives.”
Published in National Bureau of Economic Research in 2020
Paper
In Practice
Published in National Bureau of Economic Research in 2020
Abstract:
“Since social distancing is the primary strategy for slowing the spread of many diseases, understanding why U.S. counties respond differently to COVID-19 is critical for designing effective public policies. Using daily data from about 45 million mobile phones to measure social distancing we examine how counties responded to both local COVID-19 cases and statewide shelter-in-place orders. We find that social distancing increases more in response to cases and official orders in counties where individuals historically (1) engaged less in community activities and (2) demonstrated greater willingness to incur individual costs to contribute to social objectives. Our work highlights the importance of these two features of social capital—community engagement and individual commitment to societal institutions—in formulating public health policies.”
OpenCorporates
A report sharing insight on shifts in access to private-sector data in the EU, including: “The average score across the EU in terms of access to company data is just 40 out of 100. This is better than the average score 8 years ago, which was just 23 out of 100, but still very low nevertheless.”
Published in 2020
Report
General
In Practice
Published in 2020
Sabine Gerdon, Eddan Katz, Emilie LeGrand, Gordon Morrison, Julian Torres Santeli
A report on the WEF’s AI Procurement in a Box project, which “aims to help governments “rethink the procurement of artificial intelligence (AI) with a focus on innovation, efficiency and ethics,” and to drive the development of “ethical standards in AI development and deployment.
Published in World Economic Forum in 2020
Report
In Practice
Governance and Operations
Published in World Economic Forum in 2020
Denise Linn Riedl
A report featuring a “collection of experiences, cases, and best practices” intended to support “any local worker—inside or outside of government—who is helping to plan or implement technological change in their community.
Published in Benton Institute for Broadband & Society in 2020
Report
In Practice
Published in Benton Institute for Broadband & Society in 2020
Rabia I. Kodapanakkal, Mark J. Brandt, Christoph Kogler, Ilja van Beest
A journal article that finds when it comes to “big data technologies,” people’s rate of adoption tends to be driven by self-interest and data protection.
Published in Computers in Human Behavior in 2020
Journal Article
Governance and Operations
Data Responsibility
Published in Computers in Human Behavior in 2020
Abstract:
“Big data technologies have both benefits and costs which can influence their adoption and moral acceptability. Prior studies look at people’s evaluations in isolation without pitting costs and benefits against each other. We address this limitation with a conjoint experiment (N = 979), using six domains (criminal investigations, crime prevention, citizen scores, healthcare, banking, and employment), where we simultaneously test the relative influence of four factors: the status quo, outcome favorability, data sharing, and data protection on decisions to adopt and perceptions of moral acceptability of the technologies. We present two key findings. (1) People adopt technologies more often when data is protected and when outcomes are favorable. They place equal or more importance on data protection in all domains except healthcare where outcome favorability has the strongest influence. (2) Data protection is the strongest driver of moral acceptability in all domains except healthcare, where the strongest driver is outcome favorability. Additionally, sharing data lowers preference for all technologies, but has a relatively smaller influence. People do not show a status quo bias in the adoption of technologies. When evaluating moral acceptability, people show a status quo bias but this is driven by the citizen scores domain. Differences across domains arise from differences in magnitude of the effects but the effects are in the same direction. Taken together, these results highlight that people are not always primarily driven by self-interest and do place importance on potential privacy violations. The results also challenge the assumption that people generally prefer the status quo.”
Louise Lief
An article advocating for philanthropies to embrace “community-based participatory research and other equity approaches to data…to change the game, revitalize research and communities, and realize greater impact.”
Published in Stanford Social Innovation Review in 2020
Journal Article
Benefits
Published in Stanford Social Innovation Review in 2020
Barend Mons
An article arguing for additional investment in ensuring research data is available for reuse and data collaboration and that the “key is to build capacity, enable groups to collaborate nationally and internationally and share good practices so that good data stewardship becomes the rule, not the exception.”
Published in Nature in 2020
Journal Article
Governance and Operations
Benefits
Published in Nature in 2020
Stefaan G. Verhulst, Andrew J. Zahuranec, Andrew Young, and Michelle Winowatan
A GovLab paper meant to inform the on-going exploration of how to enable systematic, sustainable, and responsible re-use of data through cross-sector data collaboration in the public interest (often called Data for Good). Data stewards build trust between organizations, agilely creating relationships between leaders from different sectors and backgrounds.
Published in 2020
Report
Benefits
Governance and Operations
Published in 2020
SPecifically, the position paper seeks to outline the roles and responsibilites of the emergent data steward profession. It is intended to support data-holding businesses and public institutions to create and promote data stewards in the public and private sectors; and to establish a network of these data stewards—as recently recommended by the High Level Expert Group to the European Commission on Business-to-Government Data Sharing.
The Economist
A special report in The Economist that discusses the distribution of value derived from data across various stakeholders.
Published in The Economist in 2020
Magazine Article
Benefits
Risks and Challenges
Incentives
Published in The Economist in 2020
Esther Huyer, Laura van Knippenberg
The report “researches the value created by open data in Europe. […] The report additionally considers how this market size is distributed along different sectors and how many people are employed due to open data. […] Finally, the report also considers examples and insights from open data re-use in organisations.
Published in European Data Portal in 2020
Report
Benefits
Published in European Data Portal in 2020
Abstract:
“The European Data Portal publishes its study “The Economic Impact of Open Data: Opportunities for value creation in Europe”. It researches the value created by open data in Europe. It is the second study by the European Data Portal, following the 2015 report. The open data market size is estimated at €184 billion and forecast to reach between €199.51 and €334.21 billion in 2025. The report additionally considers how this market size is distributed along different sectors and how many people are employed due to open data. The efficiency gains from open data, such as potential lives saved, time saved, environmental benefits, and improvement of language services, as well as associated potential costs savings are explored and quantified where possible. Finally, the report also considers examples and insights from open data re-use in organisations.”
Andrew Young, Stefaan G. Verhulst
“Data collaboratives are an emerging form of public-private partnership in which actors from across sectors exchange and analyze data, or provide data science insights and expertise to create new public value and generate fresh insights (Verhulst & Sangokoya, 2015). Data collaboratives, sometimes referred to as “corporate data philanthropy” (Taddeo, 2017), can be considered a new form of corporate social responsibility in the data age (Verhulst, 2017).”
Published in The Palgrave Encyclopedia of Interest Groups, Lobbying and Public Affairs in 2020
Journal Article
General
Published in The Palgrave Encyclopedia of Interest Groups, Lobbying and Public Affairs in 2020
“The rise of the open data movement means that a growing amount of data is today being broken out of information silos and released or shared with third parties.”
“Data collaboratives, when designed responsibly (Alemanno, 2018), can help to address such shortcomings. They draw together otherwise siloed data – such as, for example, telecom data, satellite imagery, social media data, financial data – and a dispersed range of expertise. In the process, they help match supply and demand, and ensure that the appropriate institutions and individuals are using and analyzing data in ways that maximize the possibility of new, innovative social solutions (de Montjoye, Gambs, Blondel, et al., 2018).”
Jin Wang, Min Chen, Guonian Lü, Songshan Yue, Yongning Wen, Zhenxu Lan, ShuoZhang
The authors propose a data sharing strategy that can improve the usability of hydrological data.
Published in Journal of Hydrology in 2020
Journal Article
In Practice
Published in Journal of Hydrology in 2020
Abstract:
“Data sharing plays a fundamental role in providing data resources for geographic modeling and simulation. Although there are many successful cases of data sharing through the web, current practices for sharing data mostly focus on data publication using metadata at the file level, which requires identifying, restructuring and synthesizing raw data files for further usage. In hydrology, because the same hydrological information is often stored in data files with different formats, modelers should identify the required information from multisource data sets and then customize data requirements for their applications. However, these data customization tasks are difficult to repeat, which leads to repetitive labor. This paper presents a data sharing method that provides a solution for data manipulation based on a structured data description model rather than raw data files. With the structured data description model, multisource hydrological data can be accessed and processed in a unified way and published as data services using a designed data server. This study also proposes a data configuration manager to customize data requirements through an interactive programming tool, which can help in using the data services. In addition, a component-based data viewer is developed for the visualization of multisource data in a sharable visualization scheme. A case study that involves sharing and applying hydrological data is designed to examine the applicability and feasibility of the proposed data sharing method.”
Tomoya Igarashi , Masanori Koizumi, Michael Widdersheim
This paper discusses the use of public library circulation data to understand citizens’ learning pattern.
Published in Libri in 2020
Journal Article
In Practice
Published in Libri in 2020
Abstract:
“The Japanese government has initiated lifelong learning policies to promote lifelong learning to a super-aging society. It is said that lifelong learning contributes to a richer and more fulfilling life. It is within this context that public libraries have been identified as ideal facilities for promoting lifelong learning. To support lifelong learning successfully, libraries must accurately grasp citizens’ needs, all while working within limited budgets. To understand citizens’ learning needs, this study uses public library circulation data. This study is significant because such data use is often unavailable in Japan. This data was used to clarify citizens’ learning interests. Circulation data was compared from two libraries in Japan: Koto District Library in Tokyo and Tahara City Library in Aichi Prefecture. The data was used to identify general learning needs while also accounting for regional differences. The methodology and results of this research are significant for the development of lifelong learning policy and programming.”
Ciro Cattuto, Alessandro Spina
This paper discusses the use of big data in epidemiology by analyzing use cases during the COVID19 pandemic.
Published in Cambridge University Press in 2020
Journal Article
In Practice
Published in Cambridge University Press in 2020
Abstract:
“Amid the outbreak of the SARS-CoV-2 pandemic, there has been a call to use innovative digital tools for the purpose of protecting public health. There are a number of proposals to embed digital solutions into the regulatory strategies adopted by public authorities to control the spread of the coronavirus more effectively. They range from algorithms to detect population movements by using telecommunications data to the use of artificial intelligence and high-performance computing power to detect patterns in the spread of the virus. However, the use of a mobile phone application for contact tracing is certainly the most popular.”
S. Kalkman, M. Mostert, N. Udo-Beauvisage, J. J. van Delden, G. J. van Thiel
From the abstract: “To foster responsible data sharing in health research, ethical governance complementary to the EU General Data Protection Regulation is necessary. A governance framework for Big Data-driven research platforms will at least need to consider the conditions as specified a priori for individual datasets. We aim to identify and analyze these conditions for the Innovative Medicines Initiative’s (IMI) BigData@Heart platform.”
Published in BMC Medical Informatics and Decision Making in 2019
Journal Article
Data Responsibility
Published in BMC Medical Informatics and Decision Making in 2019
From the abstract:
Methods “We performed a unique descriptive case study into the conditions for data sharing as specified for datasets participating in BigData@Heart. Principle investigators of 56 participating databases were contacted via e-mail with the request to send any kind of documentation that possibly specified the conditions for data sharing. Documents were qualitatively reviewed for conditions pertaining to data sharing and data access.”
Results “Qualitative content analysis of 55 relevant documents revealed overlap on the conditions: (1) only to share health data for scientific research, (2) in anonymized/coded form, (3) after approval from a designated review committee, and while (4) observing all appropriate measures for data security and in compliance with the applicable laws and regulations.”
Conclusions “Despite considerable overlap, prespecified conditions give rise to challenges for data sharing. At the same time, these challenges inform our thinking about the design of an ethical governance framework for data sharing platforms. We urge current data sharing initiatives to concentrate on: (1) the scope of the research questions that may be addressed, (2) how to deal with varying levels of de-identification, (3) determining when and how review committees should come into play, (4) align what policies and regulations mean by “data sharing” and (5) how to deal with datasets that have no system in place for data sharing.”
Michelle Winowatan, Andrew Young, Stefaan Verhulst
This case study analyzes the implementation of a data collaborative project, where MIT Media Lab researchers used location data provided by Cuebiq to understand social inequality.
Published in The GovLab in 2020
Case Study
In Practice
Published in The GovLab in 2020
Summary:
“The Atlas of Inequality is a research initiative led by scientists at the MIT Media Lab and Universidad Carlos III de Madrid. It is a project within the larger Human Dynamics research initiative at the MIT Media Lab, which investigates how computational social science can improve society, government, and companies. Using multiple big data sources, MIT Media Lab researchers seek to understand how people move in urban spaces and how that movement influences or is influenced by income. Among the datasets used in this initiative was location data provided by Cuebiq, through its Data for Good initiative. Cuebiq offers location-intelligence services to approved research and nonprofit organizations seeking to address public problems. To date, the Atlas has published maps of inequality in eleven cities in the United States. Through the Atlas, the researchers hope to raise public awareness about segregation of social mobility in United States cities resulting from economic inequality and support evidence-based policymaking to address the issue.
“Data Collaborative Model: Based on the typology of data collaborative practice areas developed by The GovLab, the use of Cuebiq’s location data by MIT Media Lab researchers for the Atlas of Inequality initiative is an example of the research and analysis partnership model of data collaboration, specifically a data transfer approach. In this approach, companies provide data to partners for analysis, sometimes under the banner of “data philanthropy.” Access to data remains highly restrictive, with only specific partners able to analyze the assets provided. Approved uses are also determined in a somewhat cooperative manner, often with some agreement outlining how and why parties requesting access to data will put it to use.
“Data Stewardship Approach: Through its Data for Good initiative, Cuebiq provided access to location data that supported the Atlas’s development. The Data for Good initiative is an example of how active data stewardship can help data-holding companies find public utility in their data beyond their day-to-day business. Such utility includes the creation of scientific and public value by securely providing academic researchers and nonprofit organizations access to Cuebiq’s data assets. These collaborations involve a licensing agreement between Cuebiq and its partners, where all the parties pledge to adhere to certain privacy and data-handling standards to prevent potential risks that may arise, such as data breach and misuse. Further, Cuebiq also created the Director of Research Partnership and Data for Good role to carry out various data stewards functions, including internal coordination, partnership building, and risk assessment. This approach can serve as a good model for how companies can practice responsible data-sharing. Indeed, in early 2020, the MIT Media Lab team began using Cuebiq data to study the effectiveness of social distancing policies in New York City in light of the COVID-19 pandemic.”
Megumi Kubota, Albert G. Zeufack
“This paper investigates the potential benefits for a country from investing in data transparency. The paper shows that increased data transparency can bring substantive returns in lower costs of external borrowing. This result is obtained by estimating the impact of public data transparency on sovereign spreads conditional on the country’s level of institutional quality and public and external debt.”
Published in The World Bank in 2020
Report
Incentives
Published in The World Bank in 2020
From the abstract:
“This result is obtained by estimating the impact of public data transparency on sovereign spreads conditional on the country’s level of institutional quality and public and external debt. While improving data transparency alone reduces the external borrowing costs for a country, the return is much higher when combined with stronger institutional quality and lower public and external debt. Similarly, the returns on investing in data transparency are higher when a country’s integration to the global economy deepens, as captured by trade and financial openness. Estimation of an instrumental variable regression shows that Sub-Saharan African countries could have saved up to 14.5 basis points in sovereign bond spreads and decreased their external debt burden by US$405.4 million (0.02 percent of gross domestic product) in 2018, if their average level of data transparency was that of a country in the top quartile of the upper-middle-income country category. At the country level, Angola could have reduced its external debt burden by around US$73.6 million.”
Mario Schultz, Peter Seele
This paper discusses the political role of corporations in the digital age.
Published in Business Ethics: A European Review in 2020
Journal Article
Data Responsibility
Governance and Operations
Published in Business Ethics: A European Review in 2020
Abstract:
“Building on an illustrative case of a systemic environmental threat and its multi‐stakeholder response, this paper draws attention to the changing political impacts of corporations in the digital age. Political Corporate Social Responsibility (PCSR) theory suggests an expanded sense of politics and corporations, including impacts that may range from voluntary initiatives to overcome governance gaps, to avoiding state regulation via corporate political activity. Considering digitalization as a stimulus, we explore potential responsibilities of corporations toward public goods in contexts with functioning governments. We show that digitalization—in the form of transparency, surveillance, and data‐sharing—offers corporations’ scope for deliberative public participation. The starry sky beetle infestation endangering public and private goods is thereby used to illustrate the possibility of expanding the political role of corporations in the digital sphere. We offer a contribution by conceptualizing data‐deliberation as a Habermasian variation of PCSR, defined as the (a) voluntary disclosure of corporate data and its transparent, open sharing with the public sector (b) along with the cooperation with governmental institutions on data analytics methods for examining large‐scale datasets (c) thereby complying with existing national and international regulations on data protection, in particular with respect to privacy and personal data.”
Michelle Winowatan, Andrew Young, Stefaan Verhulst
This case study contains analysis of the implementation of a data collaborative project and data stewardship practice in the Accelerating Medicines Partnership project.
Published in The GovLab in 2019
Case Study
In Practice
Published in The GovLab in 2019
Summary:
“Accelerating Medicines Partnership (AMP) is a cross-sector data-sharing partnership in the United States between the National Institutes of Health (NIH), the Food and Drug Administration (FDA), multiple biopharmaceutical and life science companies, as well as non-profit organizations that seeks to improve the efficiency of developing new diagnostics and treatments for several types of disease. To achieve this goal, the partnership created a pre-competitive collaborative ecosystem where the biomedical community can pool data and resources that are relevant to the prioritized disease areas. A key component of the partnership is to make biomarkers data available to the medical research community through online portals.”
Data Collaboratives Model: “Based on our typology of data collaborative models, AMP is an example of the data pooling model of data collaboration, specifically a public data pool. Public data pools co-mingle data assets from multiple data holders — in this case pharmaceutical companies — and make those shared assets available on the web. Pools often limit contributions to approved partners (as public data pools are not crowdsourcing efforts), but access to the shared assets is open, enabling independent re-uses.”
Data Stewardship Approach: “Data stewardship is built into the partnership through the establishment of an executive committee, which governs the entire partnership, and a steering committee for each disease area, which governs each of the sub-projects within AMP. These committees consist of representatives from the institutional partners involved in AMP and perform data stewards function including enabling inter-institutional engagement as well as intra-institutional coordination, data audit and assessment of value and risk, communication of findings, and nurture the collaboration to sustainability.”
Jessica Fjeld, Nele Achten, Hannah Hilligoss, Adam Nagy, Madhulika Srikumar
The authors “analyzed the contents of thirty-six prominent AI principles documents, and in the process, discovered thematic trends that suggest the earliest emergence of sectoral norms.”
Published in Berkman Klein Center for Internet & Society in 2020
Paper
General
Data Responsibility
Published in Berkman Klein Center for Internet & Society in 2020
One of the main findings of the analysis concerns eight key themes, which are:
Geoff Boeing, Max Besbris, Ariela Schachter, John Kuk
The authors “synthesize and extend analyses of millions of US Craigslist rental listings and find they supply significantly different volumes, quality, and types of information in different communities.”
Published in Informa UK Limited in 2020
Paper
Operations
General
Benefits
In Practice
Published in Informa UK Limited in 2020
“This article drew together research investigating Craigslist’s online rental housing market and, using new analyses, theorized the impacts of technology platform- mediated housing search. It described the uneven quantity and quality of online information correlated with neighborhood demographics, creating unequal housing information supplies based on where you search. This calls into question the ability of technology platforms to serve as utopian, democratizing, equalizing forces when they rely on human content creation and preexisting sociospatial relations.”
The authors find that “online platforms with user generated content do not automatically smooth information exchange, reduce information asymmetries, or attenuate entrenched sociospatial inequalities. Craigslist data do not capture all neighborhoods—nor the experiences of all renters—equally well. Importantly, Craigslist data alone do not allow policymakers to fully understand the experiences of renters searching for housing in lower income or minority neighborhoods.”
Michelle Winowatan, Andrew Young, Stefaan Verhulst
This case study presents analysis of data stewards practices from the implementation of data collaborative project Global Fishing Watch.
Published in The GovLab in 2020
Case Study
In Practice
Published in The GovLab in 2020
Summary:
“Summary: Global Fishing Watch, originally set up through a collaboration between Oceana, SkyTruth and Google, is an independent nonprofit organization dedicated to advancing responsible stewardship of our oceans through increased transparency in fishing activity and scientific research. Using big data processing and machine learning, Global Fishing Watch visualizes, tracks, and shares data about global fishing activity in near-real time and for free via their public map. To date, the platform tracks approximately 65,000 commercial fishing vessels globally. These insights have been used in a number of academic publications, ocean advocacy efforts, and law enforcement activities.
“Data Collaborative Model: Based on the typology of data collaborative practice areas, Global Fishing Watch is an example of the data pooling model of data collaboration, specifically a public data pool. Public data pools co-mingle data assets from multiple data holders — including governments and companies — and make those shared assets available on the web. This approach enabled the data stewards and stakeholders involved in Global Fishing Watch to bring together multiple data streams from both public- and private-sector entities in a single location. This single point of access provides the public and relevant authorities with user-friendly access to actionable, previously fragmented data that can drive efforts to address compliance in fisheries and illegal fishing around the world.
“Data Stewardship Approach: Global Fishing Watch also provides a clear illustration of the importance of data stewards. For instance, representatives from Google Earth Outreach, one of the data holders, played an important stewardship role in seeking to connect and coordinate with SkyTruth and Oceana, two important nonprofit environmental actors who were working separately prior to this initiative. The brokering of this partnership helped to bring relevant data assets from the public and private sectors to bear in support of institutional efforts to address the stubborn challenge of illegal fishing.”
Alessandro Blasimme, Effy Vayena
In this paper, the authors “make the case for adaptive and principle-based governance of big data research [and] then outline six principles of adaptive governance (AFIRRM) for big data research and discuss key factors for their implementation into effective governance structures and processes.”
Published in Cambridge University Press in 2019
Paper
Governance and Operations
Published in Cambridge University Press in 2019
The authors argue that adaptive governance is “a suitable regulatory style for big data health research by proposing six governance principles to foster the development of appropriate structures and processes to handle critical aspects of big data health research.”
The six principles are adaptivity, flexibility, inclusiveness, responsiveness, reflexivity, and monitoring (AFIRRM).
To encourage adoption of these principles the authors “have advanced AFIRRM as a viable model for the creation of regulatory tools in the space of big data health research. [Their] model is based on a careful analysis of regulatory scholarship vis-à-vis the key attributes of this type of research. [They] are currently undertaking considerable efforts to introduce AFIRRM to regulators, operators and organizations in the space of research or health policy.”
Peter Dabrock
This article discusses the balance between maintaining data sovereignty and bolstering AI and machine learning enabled innovation.
Published in Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society in 2020
Essay
Data Responsibility
Benefits
Published in Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society in 2020
Abstract:
“Ethical considerations and governance approaches of AI are at a crossroads. Either one tries to convey the impression that one can bring back a status quo ante of our given “onlife”-era [1,2], or one accepts to get responsibly involved in a digital world in which informational self-determination can no longer be safeguarded and fostered through the old fashioned data protection principles of informed consent, purpose limitation and data economy [3,4,6]. The main focus of the talk is on how under the given conditions of AI and machine learning, data sovereignty (interpreted as controllability [not control (!)] of the data subject over the use of her data throughout the entire data processing cycle [5]) can be strengthened without hindering innovation dynamics of digital economy and social cohesion of fully digitized societies. In order to put this approach into practice the talk combines a presentation of the concept of data sovereignty put forward by the German Ethics Council [3] with recent research trends in effectively applying the AI ethics principles of explainability and enforceability [4-9].”
Julia Stoyanovich, Jay J. Van Bavel, Tessa V. West
This essay discusses a framework “needed to connect interpretability and trust in algorithm-assisted decisions, for a range of stakeholders.”
Published in Nature Machine Intelligence in 2020
Essay
Data Responsibility
Published in Nature Machine Intelligence in 2020
The authors conclude that:
“The integration of expertise from behavioural science and computer science is essential to making algorithmic systems interpretable by a wide range of stakeholders, allowing people to exercise agency and ultimately building trust. Individuals and groups who distrust algorithms may be less likely to harness the potential benefits of new technology, and, in this sense, interpretability intimately relates to equity. Education is an integral part of making explanations effective. Recent studies found that individuals who are more familiar with AI fear it less, and are more optimistic about its potential societal impacts24. We share this cautious optimism, but predicate it on helping different stakeholders move beyond the extremes of unbounded techno-optimism and techno-criticism, and into a nuanced and productive conversation about the role of technology in society.”
Ruoxi Jia
“People give massive amounts of their personal data to companies every day and these data are used to generate tremendous business values. Some economists and politicians argue that people should be paid for their contributions—but the million-dollar question is: by how much?
“This article discusses methods proposed in our recent AISTATS and VLDB papers that attempt to answer this question in the machine learning context.”
Published in The Berkeley Artificial Intelligence Research in 2019
Essay
Guide
Operations
Published in The Berkeley Artificial Intelligence Research in 2019
The author examines existing data valuation framework and finds some gap in this practice, which are:
“Task-specificness: The value of data depends on the task it helps to fulfill. For instance, if Alice’s medical record indicates that she has disease A, then her data will be more useful to predict disease A as opposed to other diseases.
“Fairness: The quality of data from different sources varies dramatically. In the worst-case scenario, adversarial data sources may even degrade model performance via data poisoning attacks. Hence, the data value should reflect the efficacy of data by assigning high values to data which can notably improve the model’s performance.
“Efficiency: Practical machine learning tasks may involve thousands or billions of data contributors; thus, data valuation techniques should be capable of scaling up.
Based on this observation, the author discusses “a principled notion of data value and computationally efficient algorithms for data valuation.”
Mara Maretti, Vanessa Russo, & Emiliano del Gobbo
This paper investigates “the communication structure and the governance of open data in the Twitter conversational environment”, highlighting that while there is an Italian open data infrastructure, there are weaknesses in governance and practical reuse of data.
Published in Quality & Quantity in 2020
Journal Article
Governance and Operations
General
Published in Quality & Quantity in 2020
Abstract:
The expression ‘open data’ relates to a system of informative and freely accessible databases that public administrations make generally available online in order to develop an informative network between institutions, enterprises and citizens. On this topic, using the semantic network analysis method, the research aims to investigate the communication structure and the governance of open data in the Twitter conversational environment. In particular, the research questions are: (1) Who are the main actors in the Italian open data infrastructure? (2) What are the main conversation topics online? (3) What are the pros and cons of the development and use (reuse) of open data in Italy? To answer these questions, we went through three research phases: (1) analysing the communication network, we found who are the main influencers; (2) once we found who were the main actors, we analysed the online content in the Twittersphere to detect the semantic areas; (3) then, through an online focus group with the main open data influencers, we explored the characteristics of Italian open data governance. Through the research, it has been shown that: (1) there is an Italian open data governance strategy; (2) the Italian civic hacker community plays an important role as an influencer; but (3) there are weaknesses in governance and in practical reuse.
Wolfgang Kerber
This paper discusses the importance of analysis and regulatory design in re-imagining data governance systems, and proposes potential instruments that can be used within these systems.
Published in University of Marburg - School of Business & Economics in 2020
Journal Article
General
In Practice
Governance and Operations
Published in University of Marburg - School of Business & Economics in 2020
Abstract:
Starting with the assumption that under certain conditions also mandatory solutions for access to privately held data can be necessary, this paper analyses the legal and regulatory instruments for the implementation of such data access solutions. After an analysis of advantages and problems of horizontal versus sectoral access solutions, the main thesis of this paper is that focusing only on data access solutions is often not enough for achieving the desired positive effects on competition and innovation. An analysis of the two examples access to bank account data (PSD2: Second Payment Service Directive) and access to data of the connected car shows that successful data access solutions might require an entire package of additional complementary regulatory solutions (e.g. regarding interoperability, standardisation, and safety and security), and therefore the analysis and regulatory design of entire data governance systems (based upon an economic market failure analysis). In the last part important instruments that can be used within data governance systems are discussed, like, e.g. data trustee solutions.
Jorg Hoffmann and Begona Gonzalaz Otero
This paper addresses the lack of understanding in law and policy discourse pertaining to data interoperability, and subsequently, “explains the technical complexity of interoperability and its enablers, namely data standards and application programming interfaces.”
Published in Max Planck Institute for Innovation & Competition in 2020
Paper
General
Risks and Challenges
Published in Max Planck Institute for Innovation & Competition in 2020
Abstract:
In the current data access and sharing debate, data interoperability is widely proclaimed as being key for efficiently reaping the economic welfare enhancing effects of further data re-use. Although, we agree, we found that the current law and policy framework pertaining data interoperability was missing a groundworks analysis. Without a clear understanding of the notions of interoperability, the role of data standards and application programming interfaces (APIs) to achieve this ambition, and the IP and trade secrets protection potentially hindering it, any regulatory analysis within the data access discussion will be incomplete. Any attempt at untangling the role of data interoperability in the access and sharing regimes requires a thorough understanding of the underlying technology and a common understanding of the different notions of data interoperability. The paper firstly explains the technical complexity of interoperability and its enablers, namely data standards and application programming interfaces. It elaborates on the reasons data interoperability counts with different levels and puts emphasis on the fact that data interoperability is indirectly tangled to the data access right. Since data interoperability may be part of the legal obligations correlating to the access right, the scope of interoperability is and has already been subject to courts’ interpretation. While this may give some manoeuvre for balanced decision-making, it may not guarantee the ambition of efficient re-usability of data. This is why data governance market regulation under a public law approach is becoming more favourable. Yet, and this is elaborated in a second step, the paper builds on the assumption that interoperability should not become another policy on its own. This is followed by a competition economics assessment, taking into account that data interoperability is always a matter of degree and a lack of data interoperability does not necessarily lead to a market foreclosure of competitors and to causing harm to consumer welfare. Additionally, parts of application programming interfaces (APIs) may be protected under IP rights and trade secrets, which might conflict with data access rights. Instead of further solving the conflicting regimes within the respective legal regimes of the exclusive rights the paper concludes by suggesting that (sector-specific) data governance solutions should deal with this issue and align the different interests implied. This may provide for better, practical and well-balanced solutions instead of impractical and dysfunctional exceptions and limitations within the IP and trade secrets regimes.
Leonardo M. Millefiori, Paolo Braca, Dimitris Zissis, Giannis Spiliopoulos, Stefano Marano, Peter K. Willett, Sandro Carniel
This paper analyzes the effects that the COVID-19 pandemic and the containment measures had on the shipping industry using maritime traffic data.
Published in 2020
Paper
General
In Practice
Published in 2020
Abstract:
To prevent the outbreak of the Coronavirus disease (COVID-19), numerous countries around the world went into lockdown and imposed unprecedented containment measures. These restrictions progressively produced changes to social behavior and global mobility patterns, evidently disrupting social and economic activities. Here, using maritime traffic data, collected via a global network of Automatic Identification System (AIS) receivers, we analyze the effects that the COVID-19 pandemic and the containment measures had on the shipping industry, which accounts alone for more than 80% of the world trade. We introduce the notion of a “maritime mobility index,” a synthetic composite index, to quantitatively assess ship mobility in a given unit of time. The mobility index calculation used in this study, has a worldwide extent and is based on the computation of cumulative navigated miles (CNM) of all ships reporting their position and navigational status via AIS. We compare 2020 mobility levels to those of previous years assuming that an unchanged growth rate would have been achieved, if not for COVID-19. Following the outbreak, we find an unprecedented drop in maritime mobility, across all categories of commercial shipping. The reduced activity is observable from March to June, when the most severe restrictions were in force, producing a variation of mobility quantified between -5.62% and -13.77% for container ships, between +2.28% and -3.32% for dry bulk, between -0.22% and -9.27% for wet bulk, and between -19.57% and -42.77% for passenger shipping. The presented study is unprecedented for the uniqueness and completeness of the employed AIS dataset, which comprises a trillion AIS messages broadcast worldwide by 50000 ships, a figure that closely parallels the documented size of the world merchant fleet.
Barbara Wixom, Ina Sebastian, and Robert Gregory
In this paper, MIT CISR Research discuss their findings on interorganizational data sharing; “this briefing introduces three sets of practices—curated content, designated channels, and repeatable controls—that help companies accelerate data sharing 2.0.”
Published in MIT CISR research in 2020
Paper
In Practice
General
Published in MIT CISR research in 2020
Abstract
MIT CISR research has found that interorganizational data sharing is a top concern of companies; leaders often find data sharing costly, slow, and risky. Interorganizational data sharing, however, is requisite for new value creation in the digital economy. Digital opportunities require data sharing 2.0: cross-company sharing of complementary data assets and capabilities, which fills data gaps and allows companies, often collaboratively, to develop innovative solutions. This briefing introduces three sets of practices—curated content, designated channels, and repeatable controls—that help companies accelerate data sharing 2.0.
Geoff Boeing
OpenStreetMap is a crowd-sourced, worldwide mapping project and geospatial data repository; Boeing discusses how “ubiquitous urban data and computation can open up new urban form analyses from both quantitative and qualitative perspectives.”
Published in Urban Experience and Design: Contemporary Perspectives on Improving the Public Realm in 2020
Journal Article
General
Benefits
Published in Urban Experience and Design: Contemporary Perspectives on Improving the Public Realm in 2020
Abstract
This chapter introduces OpenStreetMap—a crowd-sourced, worldwide mapping project and geospatial data repository—to illustrate its usefulness in quickly and easily analyzing and visualizing planning and design outcomes in the built environment. It demonstrates the OSMnx toolkit for automatically downloading, modeling, analyzing, and visualizing spatial big data from OpenStreetMap. We explore patterns and configurations in street networks and buildings around the world computationally through visualization methods—including figure-ground diagrams and polar histograms—that help compress urban complexity into comprehensible artifacts that reflect the human experience of the built environment. Ubiquitous urban data and computation can open up new urban form analyses from both quantitative and qualitative perspectives.
Claudia Wells
This article discusses the importance of timely, disaggregated, community-level data on gender based violence. “As illustrated by experiences in Nepal, data collected within communities can play a vital role to fill the gaps and ensure that data-informed policies reflect the lived experiences of the most marginalized women and girls.”
Published in SDG Knowledge Hub in 2020
Magazine Article
In Practice
General
Published in SDG Knowledge Hub in 2020
A shocking increase in violence against women and girls has been reported in many countries during the COVID-19 pandemic, amounting to what UN Women calls a “shadow pandemic.”
The jarring facts are:
The response to these new challenges were discussed at a meeting in July with a community-led response delivered through local actors highlighted as key. This means that timely, disaggregated, community-level data on the nature and prevalence of gender-based violence has never been more important. Data collected within communities can play a vital role to fill the gaps and ensure that data-informed policies reflect the lived experiences of the most marginalized women and girls.
Andreas Backhus
Though the supply of data has increased since the beginning of the pandemic, “a number of pitfalls have arisen with regard to the interpretation of the data and the conclusions that can be drawn from them.” The paper seeks to highlight these pitfalls to inform the future course of policy action.
Published in Intereconomics
Paper
General
Risks and Challenges
Published in Intereconomics
Policymakers, experts and the general public heavily rely on the data that are being reported in the context of the coronavirus pandemic. Daily data releases on confirmed COVID-19 cases and deaths provide information on the course of the pandemic. The same data are also essential for the estimation of indicators such as the reproduction rate and for the evaluation of policy interventions that seek to slow down the pandemic.
Together with the proliferation of data, however, a number of pitfalls have arisen with regard to the interpretation of the data and the conclusions that can be drawn from them. The aim of this paper is to highlight the most common among these pitfalls given that they have the potential to intentionally or unintentionally mislead the public debate and thereby the course of future policy actions.
The list of pitfalls presented is non-exhaustive. In fact, as the supply of data has increased since the beginning of the pandemic, new pitfalls have emerged in parallel, while others have decreased in relevance; a tendency that seems likely to continue into the future. Beyond explaining some of the current pitfalls, this paper will serve as a more general caveat regarding the interpretation of data in the context of the SARS-CoV-2 pandemic.
Rebecca Root
The report outlines recommendations for “improved collaboration between government agencies and other institutions involved with groundwater data collection, alongside the development of more accessible databases and increased awareness of existing data.”
Published in Devex in 2020
Magazine Article
In Practice
General
Published in Devex in 2020
Excerpt -
“A lack of data on groundwater is impeding water management and could jeopardize climate resilience efforts in some places, according to recent research by WaterAid and the HSBC Water Programme.
Groundwater is found underground in gaps between soil, sand, and rock. Over 2.5 million people are thought to depend on groundwater — which has a higher tolerance to droughts than other water sources — for drinking.
The report looked at groundwater security and sustainability in Bangladesh, Ghana, India, Nepal, and Nigeria, where collectively more than 160 million people lack access to clean water close to home. It found that groundwater data tends to be limited — including on issues such as overextraction, pollution, and contamination — leaving little evidence for decision-makers to consider for its management.”
Massimo Russo, David Young, Tian Feng, and Marine Gerard
This article discusses the importance of sharing data to address societal challenges – “Financial inclusion, crisis response, resource conservation, public health, and climate change are all examples of data mega-use-cases encompassed by the SDGs. But data can only contribute to the solution of these problems if it is readily available and shared.”
Published in BCG in 2021
Blog Post
Benefits
General
Published in BCG in 2021
Excerpt –
“Achieving most, if not all, of the UN’s 17 sustainable development goals (SDGs) will require the use of data from multiple public and private sources. In this sense, the SDGs are the embodiment of what we call “data mega-use-cases”: complex problems and opportunities affecting many different individuals, companies, organizations, and governments. Financial inclusion, crisis response, resource conservation, public health, and climate change are all examples of data mega-use-cases encompassed by the SDGs. But data can only contribute to the solution of these problems if it is readily available and shared.”
Veronica Barassi
“An examination of the datafication of family life—in particular, the construction of our children into data subjects.”
Published in MIT Press in 2020
Paper
General
Published in MIT Press in 2020
From the summary:
“Our families are being turned into data, as the digital traces we leave are shared, sold, and commodified. Children are datafied even before birth, with pregnancy apps and social media postings, and then tracked through childhood with learning apps, smart home devices, and medical records. In Child Data Citizen, Veronica Barassi examines the construction of children into data subjects, describing how their personal information is collected, archived, sold, and aggregated into unique profiles that can follow them across a lifetime. Children today are the very first generation of citizens to be datafied from before birth, and Barassi points to critical implications for our democratic futures.
Barassi draws on a three-year research project with parents in London and Los Angeles, which included the collection of fifty in-depth interviews, a digital ethnography of “sharenting” activities on social media by eight families over the course of eight months, and a two-year exploration of the datafication of her own family. She complements her ethnographic findings with a platform analysis of four social media platforms, ten health tracking apps, four home hubs, and four educational platforms, investigating the privacy policies, business models, and patent applications that enable the mining of children’s data. Barassi considers the implications of building a society where data traces are made to speak for and about citizens across a lifetime. What should we do when we realize that the narratives that algorithms construct about individuals are inaccurate and biased?”
Shomik Jain, Davide Proserpio, Giovanni Quattrone, Daniele Quercia
In this paper, the authors “find that Airbnb data (especially its unstructured part) appears to nowcast neighborhood gentrification, measured as changes in housing affordability and demographics.” Their results “suggest that user-generated data from online platforms can be used to create socioeconomic indices to complement traditional measures that are less granular, not in real-time, and more costly to obtain.”
Published in 2021
Journal Article
In Practice
General
Published in 2021
Abstract –
“There is a rumbling debate over the impact of gentrification: presumed gentrifiers have been the target of protests and attacks in some cities, while they have been welcome as generators of new jobs and taxes in others. Census data fails to measure neighborhood change in real-time since it is usually updated every ten years. This work shows that Airbnb data can be used to quantify and track neighborhood changes. Specifically, we consider both structured data (e.g. number of listings, number of reviews, listing information) and unstructured data (e.g. user-generated reviews processed with natural language processing and machine learning algorithms) for three major cities, New York City (US), Los Angeles (US), and Greater London (UK). We find that Airbnb data (especially its unstructured part) appears to nowcast neighborhood gentrification, measured as changes in housing affordability and demographics. Overall, our results suggest that user-generated data from online platforms can be used to create socioeconomic indices to complement traditional measures that are less granular, not in real-time, and more costly to obtain.”
Nicolás Gonzálvez-Galleg and Laura Nieto-Torrejón
“This paper examines if open government data, a promising governance strategy, may help to boost Millennials’ and Generation Z trust in public institutions and satisfaction with public outcomes.”
Published in PLOS One in 2021
Journal Article
In Practice
Benefits
Published in PLOS One in 2021
Abstract –
“Scholars and policy makers are giving increasing attention to how young people are involved in politics and their confidence in the current democratic system. In a context of a global trust crisis in the European Union, this paper examines if open government data, a promising governance strategy, may help to boost Millennials’ and Generation Z trust in public institutions and satisfaction with public outcomes. First, results from our preliminary analysis challenge some popular beliefs by revealing that younger generations tend to trust in their institutions notably more than the rest of the European citizens. In addition, our findings show that open government data is a trust-enabler for Millennials and Generation Z, not only through a direct link between both, but also thanks to the mediator role of citizens’ satisfaction. Accordingly, public officers are encouraged to spread the implementation of open data strategies as a way to improve younger generations’ attachment to democratic institutions.”
Stefan Wojcik, Avleen Bijral, Richard Johnston, Juan Miguel Lavista, Gary King, Ryan Kennedy, Alessandro Vespignani, and David Lazer
“We demonstrate how behavioral research, linking digital and real-world behavior, along with human computation, can be utilized to improve the performance of studies using digital data streams. This study looks at the use of search data to track prevalence of Influenza-Like Illness (ILI). We build a behavioral model of flu search based on survey data linked to users’ online browsing data. We then utilize human computation for classifying search strings.”
Published in Nature Communications in 2021
Journal Article
General
Published in Nature Communications in 2021
Abstract –
“While digital trace data from sources like search engines hold enormous potential for tracking and understanding human behavior, these streams of data lack information about the actual experiences of those individuals generating the data. Moreover, most current methods ignore or under-utilize human processing capabilities that allow humans to solve problems not yet solvable by computers (human computation). We demonstrate how behavioral research, linking digital and real-world behavior, along with human computation, can be utilized to improve the performance of studies using digital data streams. This study looks at the use of search data to track prevalence of Influenza-Like Illness (ILI). We build a behavioral model of flu search based on survey data linked to users’ online browsing data. We then utilize human computation for classifying search strings. Leveraging these resources, we construct a tracking model of ILI prevalence that outperforms strong historical benchmarks using only a limited stream of search data and lends itself to tracking ILI in smaller geographic units. While this paper only addresses searches related to ILI, the method we describe has potential for tracking a broad set of phenomena in near real-time.”
François Candelon, Massimo Russo, Rodolphe Charme di Carlo, Hind El Bedraoui, Tian Feng
In this article, analysts at The Boston Consulting Group’s Henderson Institute discuss four common barriers to data sharing and six rules that can govern and facilitate data sharing.
Published in The BCG Henderson Institute in 2020
Essay
Governance and Operations
Published in The BCG Henderson Institute in 2020
The paper cites trust and privacy, transaction costs, competitive concerns, and lost financial opportunities as the four main barriers to data sharing. The authors further explain each barrier as follows:
To address these concerns, the analysts provide six rules that can simplify data sharing, which are: Rule 1. Understand what people really do. Rule 2. Reinforce the integrators. Rule 3. Increase the total quantity of power. Rule 4. Increase reciprocity. Rule 5. Expand the shadow of the future. Rule 6. Reward those who cooperate.
Melissa Stock, Tom Orrell
Building upon issues discussed in the C4DC report, “Laying the Foundation for Effective Partnerships: An Examination of Data Sharing Agreements,” this brief examines the potential of sunset clauses or sunset provisions to be a legally binding, enforceable, and accountable way of ensuring COVID-19 related data sharing agreements are wound down responsibly at the end of the pandemic.
Published in Contracts for Data Collaboration in 2020
Report
In Practice
Published in Contracts for Data Collaboration in 2020
The use of data and technology in tackling the COVID-19 pandemic has brought about a range of new questions around how data should or should not be used; intellectual property rights; limitations on data re-use; how long data should be used for; and ultimately what should happen to collected data once the pandemic is over.
“The brief is divided into four substantive parts and is rounded-off with concluding thoughts. Part I introduces sunset clauses as legislative tools, highlighting a number of examples of how they have been used in both COVID-19 related and other contexts. It also shows their potential value as a tool to eventually wind down the use of sensitive data to trace and track individuals. Part II discusses sunset provisions in the context of data sharing agreements and attempts to explain the complex interrelationship between data ownership, intellectual property, and sunset provisions. Building on this, Part III identifies some key issues policymakers should consider when assessing the utility and viability of sunset provisions within their data sharing agreements and arrangements. Finally, Part IV highlights the value of a memorandum of understanding (MoU) as a viable vehicle for sunset provisions in contexts where data sharing agreements are either non-existent or not regularly used.”
The three main findings of the paper are: • Sunset clauses could be effectively used in existing emergency power legislation to limit the risk of governments using sensitive data beyond the end of the pandemic. • There is an opportunity to introduce sunset provisions into legally binding agreements around data sharing as part of the COVID-19 response to help safeguard rights and limit the future use of personal data. • In contexts where data sharing agreements are not routinely used to set terms for data sharing, a memorandum of understanding between parties engaged in data sharing during the COVID-19 pandemic could be used to stipulate how sharing can be wound down.
Tom Orrell, Hayden Dahmm
This project from TReNDS, the GovLab at New York University, University of Washington, and the World Economic Forum aims to shed light on the opportunities and challenges inherent to data collaboratives, and facilitate understanding of the written agreements that underpin them.
Published in Contracts for Data Collaboration in 2019
Guide
In Practice
Operations
Published in Contracts for Data Collaboration in 2019
Excerpts:
“Contracts for Data Collaboration (C4DC) is a partnership between the Sustainable Development Solutions Network’s Thematic Research Network on Data and Statistics (SDSN TReNDS), the University of Washington’s (UW) Center for Information Assurance and Cybersecurity, the World Economic Forum (WEF), and the Governance Lab at New York University (“the GovLab”).
“The project aims to shed light on the opportunities and challenges inherent to data collaboratives. GovLab has worked to document the experiences of data collaboratives, specifically practices that involve the exchange of data or other data-related actions taking place between public and private entities (GovLab 2015). These exchanges and other data actions take place in various ways, from the informal sharing of insights and data-handling practices that don’t involve the sharing of raw data, through to more formal and legally-binding data sharing agreements (DSAs).
“The project is directed towards a number of audiences. These span government data and information management professionals, statisticians, and policymakers, through to development and humanitarian organizations that rely on data to carry out their programs of work. Businesses, data producers, and data stewards considering sharing their data for public good and researchers and academics exploring the use and potential of data sharing agreements are also intended audiences. In this way, the C4DC partnership spans the data supply-and-demand landscape.
“The project aims to provide these groups and others with a range of tools to facilitate understanding of the opportunities and challenges related to formalized data sharing that is underpinned by written agreements. Project outputs will include an online repository of DSAs, together with an analytical framework that will help to demystify these types of agreements for non-legal professionals, raising their understanding of the issues involved in data sharing and related actions. In the medium-term, the project hopes to also produce and publish a series of case studies explaining when and how DSAs are, and should be, most effectively applied for maximum impact and with minimum risk in the public interest.”
Hayden Dahmm
This case study documents a data collaboration between the Ghana Statistical Service (GSS), Vodafone Ghana, and Flowminder. The collaboration enabled the GSS to access insights from mobile phone data to plan public health and sustainable development policies, and it is a useful example of how a government and a private company were able to work with an intermediary partner to gain insights from sensitive data.
Published in Contracts for Data Collaboration in 2020
Report
In Practice
Operations
Published in Contracts for Data Collaboration in 2020
Abstract:
“A data collaboration in 2018 between the Ghana Statistical Service (GSS), Vodafone Ghana, and Flowminder, enabled the GSS to access insights from mobile phone data to plan public health and sustainable development policies. This case study demonstrates how a government and a private company were able to work with an intermediary partner to gain insights from sensitive data. As part of the collaboration, Vodafone Ghana provided access to pseudonymized telecommunications data free of charge, and Flowminder aggregated and analyzed the data on behalf of GSS. Initially, the parties had planned to form a non-binding agreement, but national regulators requested a formal agreement that addressed various data concerns. From the initial draft of the agreement provided by Vodafone Ghana to the final approval, negotiations took some 13 months. The negotiations were made especially challenging because GSS did not have its own legal counsel, and the process overlapped with the introduction of the General Data Protection Regulation (GDPR). Among other issues, the agreement addresses how the data will be aggregated, the parameters for the exchange of the data between the parties, data use limitations, data deletion, and the publication of analysis results. After signing the agreement in December 2018, the parties have since enjoyed a successful collaboration, and the mobile data being used by GSS has proven especially valuable during the current Covid-19 pandemic to document the impact of restriction measures in Ghana.”
Contracts for Data Collaboration
The COVID-19 pandemic has resulted in new demands for data, in particular, there is growing interest in the use of mobile network operator (MNO) data for tracking population movement. In response to this, SDSN TReNDS on behalf of C4DC, has gathered and analyzed example data sharing agreements (DSAs) that have been used to share MNO data for health applications to help guide other data actors considering similar arrangements.
Published in Contracts for Data Collaboration in 2020
Guide
In Practice
Published in Contracts for Data Collaboration in 2020
The COVID-19 pandemic has resulted in new demands for data, and many countries are engaging in emergency data sharing arrangements to understand different dimensions of the health crisis. In particular, there is growing interest in the use of mobile network operator (MNO) data for tracking population movement. As described by Digital Impact Alliance, MNO data can be used to understand mobility patterns, predict new COVID-19 hotspots, model social distancing targets, and monitor health services. The inherent sensitivity of MNO data, however, raises privacy concerns, among other questions. GSMA - an international body representing the interests of more than 750 MNOs - has published COVID-19 privacy guidelines that address how its members can maintain trust while meeting requests from governments. The group Privacy International is also recording the ways that countries around the world are employing MNO data during the pandemic, and it has reported on situations where some of this data use might be illegal, such as the way Pakistan apparently accessed patients’ registered mobile numbers without consent. Even under emergency conditions, MNO data sharing will have to be managed thoughtfully to realize the potential benefits without creating avoidable harm.
Hayden Dahmm
This case study documents a data management plan (DMP) established by Development Gateway and the Government of Moldova to collect data on its development and aid programs. It is a useful example of an unsigned data sharing agreement, which can be a valuable alternative for certain data collaborations. The plan was produced through joint-collaborations and negotiations between the Government of Moldova, Development Gateway, and other data contributors.
Published in Contracts for Data Collaboration in 2019
Case Study
In Practice
Published in Contracts for Data Collaboration in 2019
Abstract:
“In 2013, the Government of Moldova and Development Gateway worked together to create an aid management platform to collect data from different development actors across the country. To accompany the platform, a data management plan (DMP) was established to provide guidelines for data submissions. The plan was first drafted by the government and Development Gateway, and it later incorporated feedback from various development actors and data providers. Most important, the plan was written as a flexible living document and supported by an accompanying law that increased the incentives for submitting data electronically. The DMP has proven effective for project implementation in Moldova, with development actors continuing to regularly submit data.”
Hayden Dahmm
This case study documents a successful data sharing agreement between a private company and an international organization dealing with global environmental data. Google and UN Environment signed a memoranda of understanding (MOU) in 2018 that formalized their collaboration around measures of surface water. Informed by interviews with the parties and the actual agreement text, this case study describes how the MOU was negotiated and highlights key elements from the MOU.
Published in Contracts for Data Collaboration in 2019
Case Study
In Practice
Published in Contracts for Data Collaboration in 2019
Abstract:
“In 2018, UN Environment and Google teamed up to create a global indicator of surface water that has now been incorporated into official Sustainable Development Goal reporting. The collaboration built on satellite derived measures of surface water produced by Google and the European Commission Joint Research Center, filling important data gaps for UN Environment. Although the collaboration involved publicly available data, the partners decided to form a memorandum of understanding, in part to show countries that the data was created through a partnership, and also to secure a long-term commitment to data sharing between Google and UN Environment. The agreement was kept general to provide flexibility, but it served to define the intent of the collaboration, commit to make data products publicly available, and indicate areas of primary responsibility, as well as to clarify the duration of the collaboration and to explain that there was no preferential treatment involved. The project was also enabled by a mutual spirit of cooperation and a collaboratively-developed work plan.”
Tom Orrell, Hayden Dahmm
This insight report draws extensively on interviews with partners and key stakeholders. It aims to break down the complexities and opacity of DSAs as much as possible and to raise awareness of their value and potential, using terms from project partners and development and humanitarian professionals.
Published in Contracts for Data Collaboration in 2019
Paper
Operations
Published in Contracts for Data Collaboration in 2019
Excerpt:
“Contracts for Data Collaboration (C4DC) (UNSDSN 2019) aims to shed light on the opportunities and challenges inherent to data sharing practices. The project will work to provide policy makers, development and humanitarian organisations, and private companies with a range of tools for them to better understand formalized data sharing practices that are underpinned by written agreements. Project outputs will include an online repository of data sharing agreements (DSAs) by the end of 2019, together with an analytical framework explaining the relevance of each part of a DSA. In the mediumterm, the project hopes to also produce and publish a series of case studies documenting when and how DSAs are, and can be, most effectively applied for maximum impact and with minimum risk within development and humanitarian contexts. This insight report draws extensively on interviews with partners and key stakeholders. It aims to break down the complexities and opacity of DSAs as much as possible and to raise awareness of their value and potential, using terms from project partners and development and humanitarian professionals.”
Steve MacFeely, Bojan Nastav
“This paper argues that the Global Indicator Framework required to support the 2030 Agenda Sustainable Development Goals will not be successfully populated, using only existing approaches and mechanisms.”
Published in Statistical Journal of the IAOS in 2019
Journal Article
In Practice
Governance and Operations
Published in Statistical Journal of the IAOS in 2019
From Abstract:
This paper argues that the Global Indicator Framework required to support the 2030 Agenda Sustainable Development Goals will not be successfully populated, using only existing approaches and mechanisms. Official statistical systems must adapt and consider new approaches if only partial success is to be averted. This paper presents a proposal to accredit unofficial statistics as official for the purposes of compiling sustainable development goal indicators. While there may be some reluctance, and there are certainly risks with this proposal, the arguments put forward highlight the potential for collaboration. Keywords: 2030 Agenda, accreditation, risk management, fundamental principles of official statistics.
Nambamallika Dehinga and Anita Raj
Nambamallika Dehinga and Anita Raj at UCSD present findings from an “analysis of a corpus of tweets by 59 Indian feminist activists, tweeted between March and August 2020. The analysis examines how the feminist community in India has used Twitter as a tool for activism during the COVID-19 pandemic.”
Published in Data2x in 2021
Paper
Risks and Challenges
Published in Data2x in 2021
Abstract:
Big data can help fill these data gaps when traditional data is lacking or unavailable by providing unique insights into a wide range of issues affecting women and girls.
Researchers from the Center on Gender Equity and Health at the University of California at San Diego (UC San Diego) have undertaken a series of briefs that analyze big data — such as, Twitter data and Google Trends — to highlight key gendered issues during the pandemic within select country contexts and to offer a “how-to” section for replication of these analyses in any country context. For each brief in the series, researchers provide reproducible codes along with a description of the methodology. Briefs will be published once a month. This work was supported by a grant to UC San Diego from the Bill and Melinda Gates Foundation.
Harrison Wilde , Lucia L. Chen , Austin Nguyen , Zoe Kimpel , Joshua Sidgwick , Adolfo De Unanue , Davide Veronese , Bilal Mateen , Rayid Ghani and Sebastian Vollmer
This paper discusses “work carried out in partnership with Homeless Link (HL), a UK-based charity, in developing a data-driven approach to better connect people sleeping rough on the streets with outreach service providers.”
Published in Data & Policy in 2021
Paper
General
Risks and Challenges
Published in Data & Policy in 2021
Abstract:
Rough sleeping is a chronic experience faced by some of the most disadvantaged people in modern society. This paper describes work carried out in partnership with Homeless Link (HL), a UK-based charity, in developing a data-driven approach to better connect people sleeping rough on the streets with outreach service providers. HL’s platform has grown exponentially in recent years, leading to thousands of alerts per day during extreme weather events; this overwhelms the volunteer-based system they currently rely upon for the processing of alerts. In order to solve this problem, we propose a human-centered machine learning system to augment the volunteers’ efforts by prioritizing alerts based on the likelihood of making a successful connection with a rough sleeper. This addresses capacity and resource limitations whilst allowing HL to quickly, effectively, and equitably process all of the alerts that they receive. Initial evaluation using historical data shows that our approach increases the rate at which rough sleepers are found following a referral by at least 15% based on labeled data, implying a greater overall increase when the alerts with unknown outcomes are considered, and suggesting the benefit in a trial taking place over a longer period to assess the models in practice. The discussion and modeling process is done with careful considerations of ethics, transparency, and explainability due to the sensitive nature of the data involved and the vulnerability of the people that are affected.
Inge Graef, Jens PrüferI
This paper explores the need to “mandate sharing of user information in data-driven markets. Existing legal mechanisms to impose data sharing under EU competition law and data portability under the GDPR are not sufficient to tackle this problem.”
Published in Regulating Socio-Technical ChangeLTMS, home of Tilt and TilecDepartment of EconomicsResearch Group: Economics in 2021
Essay
Governance and Operations
General
Published in Regulating Socio-Technical ChangeLTMS, home of Tilt and TilecDepartment of EconomicsResearch Group: Economics in 2021
Excerpt:
“To prevent market tipping, which inhibits innovation, there is an urgent need to mandate sharing of user information in data-driven markets. Existing legal mechanisms to impose data sharing under EU competition law and data portability under the GDPR are not sufficient to tackle this problem. Mandated data sharing requires the design of a governance structure that combines elements of economically efficient centralization with legally necessary decentralization.”
Geoff Boeing
“This paper builds on the theoretical framework of visual cultures in urban planning and morphology to introduce and situate computational data science processes for exploring urban fabric patterns and spatial order.”
Published in 2021
Paper
In Practice
Published in 2021
Abstract:
“Urban planning and morphology have relied on analytical cartography and visual communication tools for centuries to illustrate spatial patterns, conceptualize proposed designs, compare alternatives, and engage the public. Classic urban form visualizations – from Giambattista Nolli’s ichnographic maps of Rome to Allan Jacobs’s figure-ground diagrams of city streets – have compressed physical urban complexity into easily comprehensible information artifacts. Today we can enhance these traditional workflows through the Smart Cities paradigm of understanding cities via user-generated content and harvested data in an information management context. New spatial technology platforms and big data offer new lenses to understand, evaluate, monitor, and manage urban form and evolution. This paper builds on the theoretical framework of visual cultures in urban planning and morphology to introduce and situate computational data science processes for exploring urban fabric patterns and spatial order. It demonstrates these workflows with OSMnx and data from OpenStreetMap, a collaborative spatial information system and mapping platform, to examine street network patterns, orientations, and configurations in different study sites around the world, considering what these reveal about the urban fabric. The age of ubiquitous urban data and computational toolkits opens up a new era of worldwide urban form analysis from integrated quantitative and qualitative perspectives.”
Mirco Nanni, Gennady Andrienko, Alessandro Vespignani
This paper advocates for a decentralized approach to contract tracing, “where both contact and location data are collected exclusively in individual citizens’ “personal data stores”, to be shared separately and selectively (e.g., with a backend system, but possibly also with other citizens), voluntarily, only when the citizen has tested positive for COVID-19, and with a privacy preserving level of granularity. “
Published in Ethics Inf Technol in 2021
Paper
Benefits
Published in Ethics Inf Technol in 2021
Abstract:
“The rapid dynamics of COVID-19 calls for quick and effective tracking of virus transmission chains and early detection of outbreaks, especially in the “phase 2” of the pandemic, when lockdown and other restriction measures are progressively withdrawn, in order to avoid or minimize contagion resurgence. For this purpose, contact-tracing apps are being proposed for large scale adoption by many countries. A centralized approach, where data sensed by the app are all sent to a nation-wide server, raises concerns about citizens’ privacy and needlessly strong digital surveillance, thus alerting us to the need to minimize personal data collection and avoiding location tracking. We advocate the conceptual advantage of a decentralized approach, where both contact and location data are collected exclusively in individual citizens’ “personal data stores”, to be shared separately and selectively (e.g., with a backend system, but possibly also with other citizens), voluntarily, only when the citizen has tested positive for COVID-19, and with a privacy preserving level of granularity. This approach better protects the personal sphere of citizens and affords multiple benefits: it allows for detailed information gathering for infected people in a privacy-preserving fashion; and, in turn this enables both contact tracing, and, the early detection of outbreak hotspots on more finely-granulated geographic scale. The decentralized approach is also scalable to large populations, in that only the data of positive patients need be handled at a central level. Our recommendation is two-fold. First to extend existing decentralized architectures with a light touch, in order to manage the collection of location data locally on the device, and allow the user to share spatio-temporal aggregates—if and when they want and for specific aims—with health authorities, for instance. Second, we favour a longer-term pursuit of realizing a Personal Data Store vision, giving users the opportunity to contribute to collective good in the measure they want, enhancing self-awareness, and cultivating collective efforts for rebuilding society.”
IASC Operational Policy and Advocacy Group
This report states “[d]ata responsibility in humanitarian action is the safe, ethical and effective management of personal and non-personal data for operational response. It is a critical issue for the humanitarian system to address and the stakes are high [..] This system-wide Operational Guidance, which is a first, will ensure concrete steps for data responsibility in all phases of humanitarian action.”
Published in 2021
Paper
In Practice
Governance and Operations
Operations
Published in 2021
Excerpt:
“Data responsibility in humanitarian action is the safe, ethical and effective management of personal and non-personal data for operational response. It is a critical issue for the humanitarian system to address and the stakes are high. Ensuring we ‘do no harm’ while maximizing the benefits of data requires collective action that extends across all levels of the humanitarian system. Humanitarians must be careful when handling data to avoid placing already vulnerable individuals and communities at further risk. This is especially important in contexts where the urgency of humanitarian needs drives pressure for fast, sometimes untested, data solutions, and the politicization of data can have more extreme consequences for people.”
Ken Steif
This open-access book aims to make data science accessible to social scientists and city planners. Ken Steif writes “I hope to convince readers that one with strong domain expertise plus intermediate data skills can have a greater impact in government than the sharpest computer scientist who has never studied economics, sociology, public health, political science, criminology etc.”
Published in CRC Press in 2021
Guide
In Practice
Operations
Published in CRC Press in 2021
Carolin Martina Rauter, Sabine Wöhlke, Silke Schicktanz
This study discusses the growing use of big data in personalized medicine and the need to consider “moral concerns by stakeholders such as patient organizations (POs).”
Published in Journal of Medical Systems in 2021
Essay
Benefits
Published in Journal of Medical Systems in 2021
Excerpt:
“Personalized medicine (PM) operates with biological data to optimize therapy or prevention and to achieve cost reduction. Associated data may consist of large variations of informational subtypes e.g. genetic characteristics and their epigenetic modifications, biomarkers or even individual lifestyle factors. Present innovations in the field of information technology have already enabled the procession of increasingly large amounts of such data (‘volume’) from various sources (‘variety’) and varying quality in terms of data accuracy (‘veracity’) to facilitate the generation and analyzation of messy data sets within a short and highly efficient time period (‘velocity’) to provide insights into previously unknown connections and correlations between different items (‘value’).”
Aline Blankertz
This essay underscores the importance of moving beyond conceptual discussions toward practical experimentation with data trusts. Blankertz notes that “the concept has been endorsed by a broad range of stakeholders, including privacy advocates, companies and expert commissions. In Germany, for example, the data ethics commission and the commission competition law 4.0 have recommended further exploring data trusts, and the government is incorporating the concept into its data strategy.”
Published in Stiftung Neue Verantwortung in 2021
Essay
Blog Post
In Practice
Published in Stiftung Neue Verantwortung in 2021
Andrew Davis, Ola Engkvist, Rebecca Fairclough
In this paper, authors describe “collaborative efforts between public and private entities such as academic institutions, governments, and pharmaceutical companies” for drug discovery and development, providing successful examples and case studies.
Published in 2021
Journal Article
General
Published in 2021
Excerpt:
“Collaborative efforts between public and private entities such as academic institutions, governments, and pharmaceutical companies form an integral part of scientific research, and notable instances of such initiatives have been created within the life science community. Several examples of alliances exist with the broad goal of collaborating toward scientific advancement and improved public welfare. Such collaborations can be essential in catalyzing breaking areas of science within high-risk or global public health strategies that may have otherwise not progressed”
Chantal Bernier
This article is written as part of a Statistics Canada and the Centre for International Governance Innovation collaboration to discuss data needs for a changing world: “To protect the data from increasing privacy risks, governance structures emerge to allow the use and sharing of data as necessary for innovation while addressing privacy risks. Two frameworks proposed to fulfill this purpose are data trusts and regulatory sandboxes.”
Published in CIGI Online in 2021
Report
General
Published in CIGI Online in 2021
Excerpt:
“The Government of Canada introduced the concept of “data trust” into the Canadian privacy law modernization discussion through Canada’s Digital Charter in Action: A Plan by Canadians, for Canadians, to “enable responsible innovation.” At a high level, a data trust may be defined, according to the Open Data Institute, as a legal structure that is appropriate to the data sharing it is meant to govern and that provides independent stewardship of data.
Bill C-11, known as the Digital Charter Implementation Act, 2020, and tabled on November 17, 2020, lays the groundwork for the possibility of creating data trusts for private organizations to disclose de-identified data to specific public institutions for “socially beneficial purposes.” In her recent article “Replacing Canada’s 20-Year-Old Data Protection Law,” Teresa Scassa provides a superb overview and analysis of the bill.
Another instrument for privacy protective innovation is referred to as the “regulatory sandbox.” The United Kingdom’s Information Commissioner’s Office (ICO) provides a regulatory sandbox service that encourages organizations to submit innovative initiatives without fear of enforcement action. From there, the ICO sandbox team provides advice related to privacy risks and how to embed privacy protection.
Both governance measures may hold the future of privacy and innovation, provided that we accept this equation: De-identified data may no longer be considered irrevocably anonymous and therefore should not be released unconditionally, but the risk of re-identification is so remote that the data may be released under a governance structure that mitigates the residual privacy risk.”
Angela Daly, S Kate Devitt, Monique Mann
In this paper, authors offer “recommendations and remedies towards implementing ‘better’ approaches towards AI. Our strategies enable a different (but complementary) kind of evaluation of AI as part of the broader socio-technical systems in which AI is built and deployed.”
Published in Cornell University in 2021
Journal Article
In Practice
Risks and Challenges
Published in Cornell University in 2021
Abstract:
” In this chapter we argue that discourses on AI must transcend the language of ‘ethics’ and engage with power and political economy in order to constitute ‘Good Data’. In particular, we must move beyond the depoliticised language of ‘ethics’ currently deployed (Wagner 2018) in determining whether AI is ‘good’ given the limitations of ethics as a frame through which AI issues can be viewed. In order to circumvent these limits, we use instead the language and conceptualisation of ‘Good Data’, as a more expansive term to elucidate the values, rights and interests at stake when it comes to AI’s development and deployment, as well as that of other digital technologies. Good Data considerations move beyond recurring themes of data protection/privacy and the FAT (fairness, transparency and accountability) movement to include explicit political economy critiques of power. Instead of yet more ethics principles (that tend to say the same or similar things anyway), we offer four ‘pillars’ on which Good Data AI can be built: community, rights, usability and politics. Overall we view AI’s ‘goodness’ as an explicly political (economy) question of power and one which is always related to the degree which AI is created and used to increase the wellbeing of society and especially to increase the power of the most marginalized and disenfranchised. We offer recommendations and remedies towards implementing ‘better’ approaches towards AI. Our strategies enable a different (but complementary) kind of evaluation of AI as part of the broader socio-technical systems in which AI is built and deployed.”
Kristen Himelein
This paper discusses challenges in mobile phone survey methodologies. They note “mobile phone survey respondents in the poorest countries are more likely to be male, urban, wealthier, and more highly educated” and discuss potential solutions to improve representativeness and boost sample sizes.
Published in World Bank in 2021
Blog Post
Operations
In Practice
General
Published in World Bank in 2021
Excerpt:
“Mobile phone surveys have been rapidly deployed by the World Bank to measure the impact of COVID-19 in nearly 100 countries across the world. Previous posts on this blog have discussed the sampling and implementation challenges associated with these efforts, and coverage errors are an inherent problem to the approach. The survey methodology literature has shown mobile phone survey respondents in the poorest countries are more likely to be male, urban, wealthier, and more highly educated. This bias can stem from phone ownership, as mobile phone surveys are at best representative of mobile phone owners, a group which, particularly in poor countries, may differ from the overall population; or from differential response rates among these owners, with some groups more or less likely to respond to a call from an unknown number. In this post, we share our experiences in trying to improve representativeness and boost sample sizes for the poor in Papua New Guinea (PNG).”
Data2x
The Landscape of Big Data and Gender, a report by Data2x, highlights ongoing work of five grantee partners in using “big data” to fill gendered knowledge gaps through improved data stewardship – “filling these gaps makes inequality and discrimination visible, enabling public agencies, businesses, and civil society organizations to enact reforms that move societies closer to the ideals of equality and justice.”
Published in Data2x in 2021
Report
In Practice
Published in Data2x in 2021
Sinead Mac Manus and Alice Clay
Sinead Mac Manus and Alice Clay, in the Nesta Report Dialogues about Data: Building trust and unlocking the value of citizens’ health and care data, lay out “two interlinked challenges to building a data-driven health and care system. This is interspersed with best practice examples of the potential of data to improve health and care, as well as cautionary tales of what can happen when this is done badly.”
Published in Nesta in 2021
Report
General
Data Responsibility
In Practice
Published in Nesta in 2021
Rediet Abebe, Kehinde Aruleba, Abeba Birhane, Sara Kingsley, George Obaido, Sekou L. Remy, Swathi Sadagopan
Narratives and Counternarratives on Data Sharing in Africa by Rediet Abebe et al., in FAccT ‘21 argues that “challenges of accessing and sharing African data are too often driven by non-African stakeholders. These perspectives frequently employ a deficit narrative, often focusing on lack of education, training, and technological resources in the continent as the leading causes of friction in the data ecosystem.”
Published in Association for Computing Machinery, NY, US in 2021
Paper
Risks and Challenges
Published in Association for Computing Machinery, NY, US in 2021
Rawad Choubassi, Lamia Abdelfattah
How Big Data is Transforming the Way We Plan Our Cities, a paper by Rawad Choubassi and Lamia Abdelfattah written for Fondazione Eni Endrico Matteri, contends that the “gains of Big Data and real-time information has not only improved analytical strength, but has also created ripple effects in the systemic approaches of city planning, integrating ex-post studies within the design cycle and redefining the planning process as a microscopic, iterative and self-correcting process.”
Published in Fondazione Eni Enrico Mattei & Systematica in 2020
Paper
Benefits
In Practice
Published in Fondazione Eni Enrico Mattei & Systematica in 2020
Clare Birchall
Clare Birchall’s book Radical Secrecy: The Ends of Transparency in Datafied America, argues that “progressive social goals would be better served by a radical form of secrecy while state and corporate forces hold an asymmetrical advantage over the less powerful in data control,” and proposes a digital “right to opacity.”
Published in University of Minnesota Press in 2021
Essay
General
Published in University of Minnesota Press in 2021
Gabriel E. Kreindler, Yuhei Miyauchi
Gabriel Kreindler and Yuhei Miyauchi at MIT shared Measuring Commuting and Economic Activity insideCities with Cell Phone Records, a paper outlining the authors’ use of cell phone data from users in Dhaka and Colombo to show “that commuting flows constructed from cell phone transaction data predict the spatial distribution of wages and income in cities.”
Published in MIT Economics in 2019
Paper
Benefits
In Practice
Published in MIT Economics in 2019
Jamie Grace
This paper, by Jaimie Grace at Sheffield Hallam University, makes the case for more “investment to explore the use of data-driven technology to predict, prevent and pursue criminal harms against women.”
Published in Social Science Research Network in 2021
Paper
General
Published in Social Science Research Network in 2021
Sébastien Martin, Prune Gautier, Slim Turki, Alexander Kotsev
This report, by Sébastien Martin, et al. “identifies and analyzes a set of successful data ecosystems and to address recommendations in support of the evolution of contemporary spatial data infrastructures and the implementation of data-driven innovation in line with the recently published European data strategy.”
Published in Publications Office of the European Union, Luxembourg in 2021
Report
In Practice
Data Responsibility
General
Benefits
Published in Publications Office of the European Union, Luxembourg in 2021
Concilio, G., Pucci, P., Raes, L., Mareels, G.
This open-access book, edited by Grazia Concilio, Paola Pucci, Lieven Raes and Geert Mareels, “investigates the operative and organizational implications related to the use of the growing amount of available data on policy making processes, highlighting the experimental dimension of policy making that, thanks to data, proves to be more and more exploitable towards more effective and sustainable decisions.”
Published in Springer International Publishing in 2021
Case Study
Essay
General
Data Responsibility
Incentives
Published in Springer International Publishing in 2021
Edited by German Federal Ministry of Justice and Consumer Protection Max Planck Institute for Innovation and Competition
“Data are considered to be key for the functioning of the data economy as well as for pursuing multiple public interest concerns. Against this backdrop this book strives to device new data access rules for future legislation.”
Published in Max-Planck-Institut für Innovation und Wettbewerb in 2021
Journal Article
Governance and Operations
General
Data Responsibility
Published in Max-Planck-Institut für Innovation und Wettbewerb in 2021
Abstract:
“Data are considered to be key for the functioning of the data economy as well as for pursuing multiple public interest concerns. Against this backdrop this book strives to device new data access rules for future legislation. To do so, the contributions first explain the justification for such rules from an economic and more general policy perspective. Then, building on the constitutional foundations and existing access regimes, they explore the potential of various fields of the law (competition and contract law, data protection and consumer law, sector-specific regulation) as a basis for the future legal framework. The book also addresses the need to coordinate data access rules with intellectual property rights and to integrate these rules as one of multiple measures in larger data governance systems. Finally, the book discusses the enforcement of the Government’s interest in using privately held data as well as potential data access rights of the users of connected devices.”
Stefaan G. Verhulst, Andrew Young, Andrew J. Zahuranec, Susan Ariel Aaronson, Ania Calderon, and Matt Gee
Published in Open Data Policy Lab in 2020
Report
Data Responsibility
General
Published in Open Data Policy Lab in 2020
Excerpt:
“In this piece, we argue these (and many other limitations necessitate additional improvements to accelerate the re-use of data to unleash the public good potential of the digital era. We reflect on the potential of what we call the “Third Wave of Open Data.” We 1discuss both what we see as the emerging elements of this wave and the actions several players have taken that may contribute to its realization. In doing so, we propose actions for how policy makers and data practitioners might address the limitations of previous waves and outline a set of actions that could accelerate data re-use and collaboration.”
Andrew Young, Stefaan G. Verhulst, Nadiya Safonova, and Andrew J. Zahuranec
Published in The GovLab in 2020
Report
General
Data Responsibility
Incentives
In Practice
Published in The GovLab in 2020
Excerpt:
“The core output of the Data Assembly is a Responsible Data Re-Use Framework, which seeks to inform decision-makers on how best to re-use data to solve public problems, such as COVID-19. This framework, presented below seeks to inform if, when and how the re-use personal data can be aligned with people’s expectations and societal values. While the initial focus of the Data Assembly falls on data re-use for COVID-19 measures in New York City, the framework is intended to be applicable to policymaking and data re-use project design in other contexts.”
Andrew Young, Andrew J. Zahuranec, Stefaan G. Verhulst, and Kateryna Gazaryan
Published in 2021
Report
General
Data Responsibility
In Practice
Published in 2021
Excerpt:
“This toolkit is the result of several months of research and conversations with data practitioners from around the world during the Summer of Open Data. These discussions revealed significant barriers that prevented organizations from scaling open data and data collaboratives. Many of these barriers were organizational and contributed to a broader data ecosystem that replicated challenges writ large.This toolkit aims to help organizations deal with these challenges so they can foster more data re-use within their organizations and encourage broader responsible access across the domains they work. It achieves this goal by offering a framework to think about data re-use, one that starts from central questions about how data is created before expanding outward. It also offers eight actions data stewards to foster re-use and specific ways they can change their day-to-day operations to make these actions possible.”
The GovLab and Cuebiq
Published in 2021
Report
General
Benefits
Incentives
Risks and Challenges
Published in 2021
Excerpt:
“The initial response to the Covid-19 pandemic was one of mass collaboration. New networks, initiatives, partnerships and working groups made up of governments, businesses, academia and more were born. Many of these collaborative approaches had data and data sharing at their core.
Through our research we have taken a closer look at how mobility data is being collected, used and shared in order to help understand the spread of the pandemic and plan the response to it. To further this research, in October 2020, the ODI announced a call for proposals to review public–private mobility data sharing in response to the Covid-19 pandemic.
The primary objective of this work was to explore how mobility data has been shared between organisations in the private and public sectors, during the Covid-19 pandemic. What are the success stories? Where has this failed? What are the barriers? What tangible effects or impacts have there been from sharing, or not sharing, between the public and private sectors? What decision making has been made possible from these sectors successfully sharing their mobility data?”
Massimo Russo and Tian Feng
Published in BCG in 2021
Blog Post
General
Published in BCG in 2021
Excerpt:
“Data sharing, by definition, involves multiple parties that tend to coalesce around ecosystems. As these ecosystems grow, they share more types of data, and more detailed data, among the members of an expanding community. They also develop solutions that address an expanding range of use cases, some of which were totally unforeseen when the data was originally generated or shared. Each of these factors introduces its own set of risk-value tradeoffs. The extent of the tradeoffs depends on the specific data-sharing capabilities of the underlying platform.”
Claudio Scardovi
“This book presents the need to re-imagine the future of cities and move beyond “smart city” strategies: “Global cities are facing an almost unprecedented challenge of change. As they re-emerge from the Covid 19 pandemic and get ready to face climate change and other [threats] they need to look for new ways to support wealth and wellbeing creation – leveraging Big Data and AI and suing them into their physical reality and to become greener, more inclusive and resilient, hence sustainable.”
Published in Springer International Publishing in 2021
Book
Benefits
In Practice
Published in Springer International Publishing in 2021
Abstract:
Global cities are facing an almost unprecedented challenge of change. As they re-emerge from the Covid 19 pandemic and get ready to face climate change and other, potentially existential threats, they need to look for new ways to support wealth and wellbeing creation – leveraging Big Data and AI and suing them into their physical reality and to become greener, more inclusive and resilient, hence sustainable.
This book describes how new digital technologies could be used to design digital and physical twins of cities that are able to feed into each other to optimize their working and ability to create new wealth and wellbeing. The book also describes how to increase cities’ social and economic resilience during crisis time and addressing their almost fatal weaknesses – as it became all too obvious during the recent COVID 19 crisis. Also, the book presents a framework for a critical discussion of the concept of “smart-city”, suggesting its development into a “cyber” and “meta” one – meaning, not only digital systems can allow physical ones (e.g. cities, citizens, households and companies) to become “smarter”, but also the vice versa is true, as off line data and real life behaviours can support the optimization and development of virtual brains as a sum of big data and artificial intelligence apps all sitting “over the cloud”. An analysis of the fundamental dynamics of this emerging “info-telligence” economy, and of the potential role of big digital players like Amazon, Google and Facebook is then paving the way to discuss a few strategic forays on how traditional sectors such as financial services, real estate, TMT or health could also evolve, leveraging Big Data and AI in a cyber-physical integrated setting. Finally, a number of thought provoking use cases that could be designed around individuals, and to improve the success and the resilience of households and companies living and working in urban areas are discussed, as an example of one of the most exciting future markets to come: the one of global, sustainable cities.
Secretariat of the Internet & Jurisdiction Policy Network
This report discusses the importance of standardizing common terminology in the data ecosystem, and “offers key recommendations on how to move forward to foster a collaborative discussion on how to organize our common datasphere.”
Published in We Need to Talk About Data: Framing the Debate Around the Free Flow of Data and Data Sovereignty. in 2021
Report
Data Responsibility
General
Published in We Need to Talk About Data: Framing the Debate Around the Free Flow of Data and Data Sovereignty. in 2021
Filippo Candela, Paolo Mulassano
This paper “presents and discusses the method adopted by Compagnia di San Paolo, one of the largest European philanthropic institutions, to monitor the advancement, despite the COVID-19 situation, in providing specific input to the decision-making process for dedicated projects.”
Published in Using Open Data to Monitor the Status of a Metropolitan Area: The Case of the Metropolitan Area of Turin in 2021
Paper
In Practice
Operations
General
Published in Using Open Data to Monitor the Status of a Metropolitan Area: The Case of the Metropolitan Area of Turin in 2021
Abstract:
The paper presents and discusses the method adopted by Compagnia di San Paolo, one of the largest European philanthropic institutions, to monitor the advancement, despite the COVID-19 situation, in providing specific input to the decision-making process for dedicated projects. An innovative approach based on the use of daily open data was adopted to monitor the metropolitan area with a multidimensional perspective. Several open data indicators related to the economy, society, culture, environment, and climate were identified and incorporated into the decision support system dashboard. Indicators are presented and discussed to highlight how open data could be integrated into the foundation’s strategic approach and potentially replicated on a large scale by local institutions. Moreover, starting from the lessons learned from this experience, the paper analyzes the opportunities and critical issues surrounding the use of open data, not only to improve the quality of life during the COVID-19 epidemic but also for the effective regulation of society, the participation of citizens, and their well-being.
German Federal Ministry of Justice and Consumer Protection, Max Planck Institute for Innovation and Competition
This book explores new data access rules for future legislation: “the contributions first explain the justification for such rules from an economic and more general policy perspective. Then, building on the constitutional foundations and existing access regimes, they explore the potential of various fields of the law (competition and contract law, data protection and consumer law, sector-specific regulation) as a basis for the future legal framework.”
Published in 2021
Book
Data Responsibility
General
Governance and Operations
Published in 2021
Abstract:
Data are considered to be key for the functioning of the data economy as well as for pursuing multiple public interest concerns. Against this backdrop this book strives to device new data access rules for future legislation. To do so, the contributions first explain the justification for such rules from an economic and more general policy perspective. Then, building on the constitutional foundations and existing access regimes, they explore the potential of various fields of the law (competition and contract law, data protection and consumer law, sector-specific regulation) as a basis for the future legal framework. The book also addresses the need to coordinate data access rules with intellectual property rights and to integrate these rules as one of multiple measures in larger data governance systems. Finally, the book discusses the enforcement of the Government’s interest in using privately held data as well as potential data access rights of the users of connected devices. The authors Prof. Dr. Josef Drexl, LL.M. (UC Berkeley); Prof. Dr. Thomas Fetzer, LL.M. (Vanderbilt); Prof. Dr. Michael Grünberger, , LL.M. (NYU); Jörg Hoffmann; Prof. Dr. Ruth Janal, LL.M. (New South Wales); Prof. Dr. Wolfgang Kerber; Christine Lambrecht; Prof. Dr. Matthias Leistner, LL.M. (Cambridge); Bertin Martens, Ph.D.; Prof. Dr. Axel Metzger, LL.M. (Harvard); Christian Reimsbach-Kounatze; Dr. Heiko Richter, LL.M. (Columbia); Prof. Dr. Heike Schweitzer, LL.M. (Yale); Prof. Dr. Louisa Specht-Riemenschneider; Prof. Dr. Indra Spiecker gen. Döhmann, LL.M. (Georgetown Univ.) und Robert Welker.
International Organization for Labor Migration
The International Organization for Migration (IOM) published Leave No Migrant Behind: The 2030 Agenda and Data Disaggregation, a guide to help promote migrant-inclusive data practices for those working to achieve Sustainable Development Goals. “To date, disaggregation of global development data by migratory status remains low. Migrants are largely invisible in official SDG data. As the global community approaches 2030, very little is known about the impact of the 2030 Agenda on migrants.”
Published in 2021
Guide
Data Responsibility
In Practice
Published in 2021
Angelina Fisher and Thomas Streinz
This paper argues “that data inequality is a function of unequal control over the infrastructures that generate, shape, process, store, transfer, and use data. Existing law often regulates data as an object to be transferred, protected, and shared and is not always attuned to the salience of infrastructural control over data.”
Published in World Development Report 2021 background paper in 2021
Paper
General
Published in World Development Report 2021 background paper in 2021
Abstract:
Data conveys significant social, economic, and political power. Unequal control over data — a pervasive form of digital inequality — is a problem for economic development, human agency, and collective self-determination that needs to be addressed. This paper takes some steps in this direction by analyzing the extent to which law facilitates unequal control over data and by suggesting ways in which legal interventions might lead to more equal control over data. By unequal control over data, we not only mean having or not having data, but also having or not having power over deciding what becomes and what does not become data. We call this the power to datafy. We argue that data inequality is in turn a function of unequal control over the infrastructures that generate, shape, process, store, transfer, and use data. Existing law often regulates data as an object to be transferred, protected, and shared and is not always attuned to the salience of infrastructural control over data. While there are no easy solutions to the variegated causes and consequences of data inequality, we suggest that retaining flexibility to experiment with different approaches, reclaiming infrastructural control, systematically demanding enhanced transparency, pooling of data and bargaining power, and differentiated and conditional access to data mechanisms may help in confronting data inequality more effectively going forward.
Sun-ha Hong
This essay explores the idea of control creep, which entails: “data-driven technologies [being] pitched for a particular context and purpose, but quickly expand[ing] into new forms of control. Although we often think about data use in terms of trade-offs or bargains, such frameworks can be deeply misleading.”
Published in Centre for International Governance Innovation in 2021
Essay
Governance and Operations
General
Published in Centre for International Governance Innovation in 2021
Excerpt:
The data always travels, creating new possibilities for judging and predicting human lives. We might call it control creep: data-driven technologies tend to be pitched for a particular context and purpose, but quickly expand into new forms of control. Although we often think about data use in terms of trade-offs or bargains, such frameworks can be deeply misleading. What does it mean to “trade” personal data for the convenience of, say, an Amazon Echo, when the other side of that trade is constantly arranging new ways to sell and use that data in ways we cannot anticipate? As technology scholars Jake Goldenfein, Ben Green and Salomé Viljoen argue, the familiar trade-off of “privacy vs. X” rarely results in full respect for both values but instead tends to normalize a further stripping of privacy.
Archita Misra
This paper “presents a few insights with key elements of the literature on data literacy, and then covers common practices of implementing data literacy programmes. It concludes by sharing some takeaways that emerged out of this stock-taking exercise and proposes a few questions to spur further dialogue and discussion”
Published in PARIS21 in 2021
Paper
General
In Practice
Published in PARIS21 in 2021
Abstract:
The COVID-19 crisis presents a monumental opportunity to engender a widespread data culture in our societies. Since early 2020, the emergence of popular data sites like Worldometer2 have promoted interest and attention in data-driven tracking of the pandemic. “R values”, “flattening the curve” and “exponential increase” have seeped into everyday lexicon. Social media and news outlets have filled the public consciousness with trends, rankings and graphs throughout multiple waves of COVID-19.
Yet, the crisis also reveals a critical lack of data literacy amongst citizens in many parts of the world. The lack of a data literate culture predates the pandemic. The supply of statistics and information has significantly outpaced the ability of lay citizens to make informed choices about their lives in the digital data age.
Today’s fragmented datafied information landscape is also susceptible to the pitfalls of misinformation, post-truth politics and societal polarisation – all of which demand a critical thinking lens towards data. There is an urgent need to develop data literacy at the level of citizens, organisations and society – such that all actors are empowered to navigate the complexity of modern data ecosystems.
The paper identifies three key take-aways. It is crucial to
Beatriz Botero Arcila
This paper argues that there is a way in which local governments and other non-private sector stakeholders can access data from technology companies, “without harming the legitimate privacy interests of both individuals and companies.”
Published in William & Mary Bill of Rights Journal, Forthcoming in 2021
Paper
General
Benefits
Published in William & Mary Bill of Rights Journal, Forthcoming in 2021
Abstract:
Cities in the US have started to enact data-sharing rules and programs to access some of the data that technology companies operating under their jurisdiction – like short-term rental or ride hailing companies - collect. This information allows cities to adapt too to the challenges and benefits of the digital information economy. It allows them to understand what their impact is on congestion, the housing market, the local job market and even the use of public spaces. It also empowers them to act accordingly by, for example, setting vehicle caps or mandating a tailored minimum pay for gig-workers. These companies, however, sometimes argue that sharing this information attempts against their users’ privacy rights and their privacy rights, because this information is theirs; it’s part of their business records. The question is thus what those rights are, and whether it should and could be possible for local governments to access that information to advance equity and sustainability, without harming the legitimate privacy interests of both individuals and companies. This Article argues that within current Fourth Amendment doctrine and privacy law there is space for data-sharing programs. Privacy law, however, is being mobilized to alter the distribution of power and welfare between local governments, companies, and citizens within current digital information capitalism to extend those rights beyond their fair share and preempt permissible data-sharing requests. The Article warns that if the companies succeed in their challenges, privacy law will have helped shield corporate power from regulatory oversight, while still leaving individuals largely unprotected and submitting local governments further to corporate interests.
Yusuke Inoue, Masako Okamoto, Takanori Fujita, Seiichiro Yamamoto, Takafumi Ochiai
This white paper proposes a practical approach to implementing APPA – “a new data governance model that aims to strike a balance between individual rights and the interests of data holders and the public interest” – to support organizations and governments achieve public goals through data use.
Published in World Economic Forum in 2021
Report
In Practice
Published in World Economic Forum in 2021
“Abstract:
“In January 2020, our first publication presented Authorized Public Purpose Access (APPA), a new data governance model that aims to strike a balance between individual rights and the interests of data holders and the public interest. It is proposed that the use of personal data for public-health purposes, including fighting pandemics, be subject to appropriate and balanced governance mechanisms such as those set out the APPA approach. The same approach could be extended to the use of data for non-medical public-interest purposes, such as achieving the United Nations Sustainable Development Goals (SDGs). This publication proposes a systematic approach to implementing APPA and to pursuing public-interest goals through data use. The approach values practicality, broad social agreement on appropriate goals and methods, and the valid interests of all stakeholders.”
Paris 21 and the Mo Ibrahim Foundation
This working paper explores the role that accurate national statistics play in designing policies that sufficiently address the needs of citizens and hold governments accountable. It provides “recommendations to national and statistical offices and governments to enhance the production and use of data for evidence-based policymaking.”
Published in 2021
Report
Data Responsibility
Governance and Operations
Published in 2021
Grant Fleming, Peter C. Bruce
Responsible Data Science explores prevalent ethical issues in data science and “delivers a comprehensive, practical treatment of how to implement data science solutions in an even-handed and ethical manner that minimizes the risk of undue harm to vulnerable members of society.”
Published in Wiley in 2021
Book
In Practice
Risks and Challenges
Published in Wiley in 2021
Description:
The increasing popularity of data science has resulted in numerous well-publicized cases of bias, injustice, and discrimination. The widespread deployment of “Black box” algorithms that are difficult or impossible to understand and explain, even for their developers, is a primary source of these unanticipated harms, making modern techniques and methods for manipulating large data sets seem sinister, even dangerous. When put in the hands of authoritarian governments, these algorithms have enabled suppression of political dissent and persecution of minorities. To prevent these harms, data scientists everywhere must come to understand how the algorithms that they build and deploy may harm certain groups or be unfair.
Responsible Data Science delivers a comprehensive, practical treatment of how to implement data science solutions in an even-handed and ethical manner that minimizes the risk of undue harm to vulnerable members of society. Both data science practitioners and managers of analytics teams will learn how to:
Perfect for data science practitioners, Responsible Data Science will also earn a spot on the bookshelves of technically inclined managers, software developers, and statisticians.
Dominik Rozkrut, Olga Świerkot-Strużewska, and Gemma Van Halderen
This paper describes the UN Fundamental Principles for Official Statistics in relation to 8 new data sources, arguing that “these data sources should be used if National Statistical Systems are to adhere to the first Fundamental Principle of compiling and making available official statistics that honor citizen’s entitlement to public”
Published in IOS Press in 2021
Paper
Risks and Challenges
Data Responsibility
Published in IOS Press in 2021
Abstract:
“Never has there been a more exciting time to be an official statistician. The data revolution is responding to the demands of the CoVID-19 pandemic and a complex sustainable development agenda to improve how data is produced and used, to close data gaps to prevent discrimination, to build capacity and data literacy, to modernize data collection systems and to liberate data to promote transparency and accountability. But can all data be liberated in the production and communication of official statistics? This paper explores the UN Fundamental Principles of Official Statistics in the context of eight new and big data sources. The paper concludes each data source can be used for the production of official statistics in adherence with the Fundamental Principles and argues these data sources should be used if National Statistical Systems are to adhere to the first Fundamental Principle of compiling and making available official statistics that honor citizen’s entitlement to public information.”
Christopher Kuner, Lee A. Bygrave, Christopher Docksey, Laura Drechsler, Luca Tosoni
This paper “provides an update for selected articles of the GDPR Commentary published in 2020 by Oxford University Press […] It also includes two appendices that cover the same period as the rest of this update: the first deals with judgments of the European courts and some selected judgments of particular importance from national courts, and the second with EDPB papers.”
Published in SSRN in 2021
Paper
General
Governance and Operations
Published in SSRN in 2021
Abstract: This document provides an update for selected articles of the GDPR Commentary published in 2020 by Oxford University Press. It covers developments between the last date of coverage of the Commentary (1 August 2019) and 1 January 2021 (with a few exceptions when later developments are taken into account). Edited by Christopher Kuner, Lee A. Bygrave, Chris Docksey, Laura Drechsler, and Luca Tosoni, it covers 49 articles of the GDPR, and is being made freely accessible with the kind permission of Oxford University Press. It also includes two appendices that cover the same period as the rest of this update: the first deals with judgments of the European courts and some selected judgments of particular importance from national courts, and the second with EDPB papers. Full reference to it should be made if it is quoted from or paraphrased, as follows: C. Kuner, L.A. Bygrave and C. Docksey (eds.), The EU General Data Protection Regulation (GDPR): A Commentary – 2021 Update (OUP 2021).
Kath Albury, Amir Aryani, Jane Farmer, James Kelly, Anthony McCosker, Sandun S. Silva, Julie Tucket, Jihoon Woo
This report presents findings from the Swimburne Research bank’s partnership with Lord Mayor’s Charitable Foundation, Entertainment Assist, Good Cycles and Yooralla Disability Services. “The project had two aims: [1] Build organisational data capacity through knowledge sharing about data literacy, expertise and collaboration [2] Deliver data insights through a methodology of collaborative data analytics”
Published in Swiburne Research Bank in 2021
Report
In Practice
Published in Swiburne Research Bank in 2021
Sarah Giest, Jose M. Miotto, and Wessel Kraaji
This paper describes challenges to the data-driven analyses of existing poverty programs, feeding “into the discourse on how to operationalize and design data matching work in the multidimensional space of poverty and nonmonetary government initiatives.”
Published in Data & Policy in 2021
Paper
In Practice
Published in Data & Policy in 2021
Abstract:
“The recent surge of data-driven methods in social policy have created new opportunities to assess existing poverty programs. The expectation is that the combination of advanced methods and more data can calculate the effectiveness of public interventions more accurately and tailor local initiatives accordingly. Specifically, nonmonetary indicators are increasingly being measured at micro levels in order to target social exclusion in combination with poverty. However, the multidimensional character of poverty, local context, and data matching pose challenges to data-driven analyses. By linking Dutch household-level data with policy-initiative-specific data at local level, we present an explorative study on the uptake of a local poverty pass. The goal is to unravel pass usage in terms of household income and location as well as the age of users. We find that income and age play a role in whether the pass is used, and usage differs per neighborhood. With this, the paper feeds into the discourse on how to operationalize and design data matching work in the multidimensional space of poverty and nonmonetary government initiatives.”
Land Portal
The report is “a resource aimed to be used by governments from developing countries to collect and release land-related data to improve data quality, availability, accessibility and use for improved citizen engagement, decision making and innovation.”
Published in 2021
Report
In Practice
General
Data Responsibility
Published in 2021
Jasmina Byrne, Emma Day and Linda Raftree
This report calls for a new governance model to ensure “that children’s rights are given due weight in data governance legal frameworks and processes as they evolve around the world.”
Published in UNICEF in 2021
Report
Governance and Operations
General
Data Responsibility
Published in UNICEF in 2021
From text:
“Every child is different, with unique identities and their capacities and circumstances evolve over their lifecycle. Children are more vulnerable than adults and are less able to understand the long-term implications of consenting to their data collection. For these reasons, children’s data deserve to be treated differently.
While responsible data use can underpin many benefits for children, ensuring that children are protected, empowered and granted control of their data is still a challenge.
To maximise the benefits of data use for children and to protect them from harm requires a new model of data governance that is fitting for the 21st century.
UNICEF has worked with 17 global experts to develop a Manifesto that articulates a vision for a better approach to children’s data.
This Manifesto includes key action points and a call for a governance model purposefully designed to deliver on the needs and rights of children. It is the first step in ensuring that children’s rights are given due weight in data governance legal frameworks and processes as they evolve around the world.”
Jennifer Viberg Johansson, Nisha Shah, Eik Haraldsdottir, Heidi Beate Bentzen, Sarah Coy, Jane Kaye, Deborah Mascalzoni, Jorien Veldwijk
This paper describes “the process of identifying attributes for a study aiming to elicit preferences of citizens in Sweden, Iceland and the UK for governance mechanisms for digitally sharing different kinds of health data in different contexts.”
Published in Technology in Society in 2021
Paper
General
In Practice
Published in Technology in Society in 2021
Abstract
Background Discrete Choice Experiment (DCE) is a well-established technique to elicit individual preferences, but it has rarely been used to elicit governance preferences for health data sharing.
Objectives The aim of this article was to describe the process of identifying attributes for a DCE study aiming to elicit preferences of citizens in Sweden, Iceland and the UK for governance mechanisms for digitally sharing different kinds of health data in different contexts.
Methods A three-step approach was utilised to inform the attribute and level selection: 1) Attribute identification, 2) Attribute development and 3) Attribute refinement. First, we developed an initial set of potential attributes from a literature review and a workshop with experts. To further develop attributes, focus group discussions with citizens (n = 13), ranking exercises among focus group participants (n = 48) and expert interviews (n = 18) were performed. Thereafter, attributes were refined using group discussion (n = 3) with experts as well as cognitive interviews with citizens (n = 11).
Results The results led to the selection of seven attributes for further development: 1) level of identification, 2) the purpose of data use, 3) type of information, 4) consent, 5) new data user, 6) collector and 7) the oversight of data sharing. Differences were found between countries regarding the order of top three attributes. The process outlined participants’ conceptualisation of the chosen attributes, and what we learned for our attribute development phase.
Conclusions This study demonstrates a process for selection of attributes for a (multi-country) DCE involving three stages: Attribute identification, Attribute development and Attribute refinement. This study can contribute to improve the ethical aspects and good practice of this phase in DCE studies. Specifically, it can contribute to the development of governance mechanisms in the digital world, where people’s health data are shared for multiple purposes.
Marcela Escobari, Ian Seyal, and Carlos Daboin Contreras
“This report offers a new approach to better understand the contours of mobility: Who is falling behind, where, and by how much. Using data on hundreds of thousands of real workers’ occupational transitions, we use network analysis to create a multidimensional map of the labor market, revealing a landscape riddled with mobility gaps and barriers.”
Published in Brookings in 2021
Report
General
In Practice
Published in Brookings in 2021
Jer Thorp
In this book, Jer Thorp asks the question: “How do we stop passively inhabiting data, and instead become active citizens of it?” The book “not only redefines what data is, but reimagines who gets to speak its language and how to use its power to create a more just and democratic future.”
Published in Macmillan Publishers in 2021
Book
General
Benefits
Published in Macmillan Publishers in 2021
Irina Bernal
The UN Economic and Social Commission for Asia and the Pacific released the brief, Big Data for economic statistics. “This brief highlights examples of the use of big data sources for generating economic statistics from the national statistical systems, mostly the National Statistical Offices (NSO), in Asia and the Pacifics”
Published in United Nation ESCAP Stats Brief in 2021
Paper
Published in United Nation ESCAP Stats Brief in 2021
Sebastian Lehuede
The coloniality of collaboration: sources of epistemic obedience in data-intensive astronomy in Chile by Sebastian Lehuede in Information, Communication and Society, “warns that an increased emphasis on collaboration runs the risk of reproducing planetary hierarchies in times of data-intensive research.”
Published in Information, Communication and Society in 2021
Journal Article
Published in Information, Communication and Society in 2021
Emily AIken
This paper by Emily Aiken et al. for the National Bureau of Economic Research shows “that non-traditional “big” data from satellites and mobile phone networks can improve the targeting of anti-poverty programs […] These results highlight the potential for new data sources to contribute to humanitarian response efforts, particularly in crisis settings when traditional data are missing or out of date.”
Published in National Bureau of Economic Research in 2021
Paper
Published in National Bureau of Economic Research in 2021
Partners for Review, Danish Institute for Human Rights, International Civil Society Centre
Partners for Review, Danish Institute for Human Rights, International Civil Society Centre published the joint learning report, Inclusive SDG Data Partnerships. “This learning report gathers knowledge and recommendations from the Inclusive SDG Data Partnerships initiative [..] to advance data partnerships for the SDGs and to strengthen multi-actor data ecosystems at the national level.”
Published in Danish Institute for Human Rights in 2021
Report
Published in Danish Institute for Human Rights in 2021
Claudia Biancotti, Oscar Borgogno, and Giovanni Veronese
Principled Data Access: Building Public-private Data Partnerships for Better Official Statistics by Claudia Biancotti, Oscar Borgogno, and Giovanni Veronese in the Bank of Italy Occasional Paper Series argues for the need for greater transparency and access to official statistics, and proposes “a set of principles under which the public and the private sector can form partnerships to leverage the potential of new-generation data in the public interest.”
Published in Bank of Italy Occasional Paper Series in 2021
Paper
Published in Bank of Italy Occasional Paper Series in 2021
Principled Data Access: Building Public-private Data Partnerships for Better Official Statistics by Claudia Biancotti, Oscar Borgogno, and Giovanni Veronese in the Bank of Italy Occasional Paper Series argues for the need for greater transparency and access to official statistics, and proposes “a set of principles under which the public and the private sector can form partnerships to leverage the potential of new-generation data in the public interest.”
Griet Verhenneman
The Patient, Data Protection and Changing Healthcare Models by Griet Verhenneman “assesses the adequacy of [three principles of the European data protection law – informed consent, anonymisation and purpose limitation] and considers them in the context of technological and societal evolutions.”
Published in Intersentia in 2021
Book
Published in Intersentia in 2021
Gelb Pepyshev and Masaru Yarime
In Data & Policy, this paper by Gelb Pepyshev and Masaru Yarime “analyzes the history of the development of the concept of digital twins and how it is now being adopted on a city-scale […] and its potential applications in the policymaking context.” See also the “Digital Twins” entry in The Living Library’s 21st Century Vocabulary series.
Published in Cambridge University Press in 2021
Paper
Published in Cambridge University Press in 2021
Elizabeth Hansen Shapiro, Michael Sugarman, Fernando Bermejo, Ethan Zuckerman
This report “explores the challenges and potentials of a wide range of approaches to studying social media platforms, ranging from cooperative to adversarial strategies” and “represents the results of 32 interviews, and includes an overview of the different ways researchers are trying to understand social media data, the obstacles to accessing that data, and a set of recommendations for policymakers and philanthropic funders to increase access to that data.”
Published in NetGain Partnership in 2021
Report
Published in NetGain Partnership in 2021
Gavin Starks, Miles Cheetham, Jannah Patchay
This report presents “the scope and complexity of data sharing across markets, supply and value-chains” and “[highlights] challenges that create friction, inertia and inhibit action in delivering goals, including hard-programmed habits and underlying presuppositions and premises.” It also explores “solutions that can reduce friction in data-sharing and propose levers of change that can help unlock innovation (e.g. policy and regulatory, perception shifts, behavioral changes)”
Published in Icebreaker One in 2021
Report
Published in Icebreaker One in 2021
Wilma Dragonetti
Published in EuroCities in 2021
Report
Guide
Published in EuroCities in 2021
“The framework should foster collaboration and experimentation by the local ecosystems’ stakeholders to co-develop advanced data-driven applications and services as well as enable scaling up of practices in local data sharing between cities across Europe. It should promote and support data quality culture in the public sector and especially in city governments to increase the efficiency of public services.”
Bennett Cyphers
This piece discusses how “data brokers and federal military, intelligence, and law enforcement agencies have formed a vast, secretive partnership to surveil the movements of millions of people.” It provides an overview of how different organizations acquire data and then sell it to or share it with other parties.
Published in Electronic Frontier Foundation in 2022
Blog Post
Published in Electronic Frontier Foundation in 2022
Caitrin Pilkington
This piece describes an effort by the Łı́ı́dlı̨ı̨ Kų́ę́ First Nation of the Northwest Territories of Canada, in partnership with the Scotty Creek research facility, to create a new application process for researchers to use to request data on the community. The process asks researchers to declare all raw data will be co-owned by the First Nation among other details.
Published in Cabin Radio in 2022
Blog Post
Published in Cabin Radio in 2022
Louise Mc Grath-Lone, Matthew A Jay, Ruth Blackburn, Emma Gordon, Ania Zylbersztejn, Linda Wijlaars, Ruth Gilbert
This article provides an overview and thematic analysis of why administrative data is such a rich research resource. The piece defines “the characteristics of research-ready administrative data based on a systematic review and synthesis of existing literature.”
Published in The International Journal of Population Data Science (IJPDS) in 2022
Journal Article
Published in The International Journal of Population Data Science (IJPDS) in 2022
Corina Pascu and Jean-Claude Burgelman
This paper identifies the potential benefits of data sharing and open science when data is open and findable, accessible, interoperable, and reusable. The authors highlight the opportunities and risks posed by open research data (ORD). “However, minimal safeguards need to be put in place for open research data to stay open. The policy challenge is how to avoid misuse by public and private actors and dependability of all kind of providers.”
Published in Data & Policy in 2022
Journal Article
Published in Data & Policy in 2022
Larissa Fast
This paper “examines data sharing between humanitarian organisations and donors, with a focus on governing frameworks and how these are implemented and avenues for more responsible data sharing.” It seeks to identify new responsible practices.
Published in The Norwegian Centre for Humanitarian Studies in 2022
Paper
Published in The Norwegian Centre for Humanitarian Studies in 2022
Martin Lnenicka, Anastasija Nikiforova, Mariusz Luterek, Otmane Azeroual, Dandison Ukpabi, Visvaldis Valtenbergs, Renata Machova
The piece examines 34 smart city data portals across 24 cities and assesses their maturity, identifying ways that future portals might be more “sustainable, transparent, citizen-centered, and socially resilient.”
Published in Sustainable Cities and Society in 2022
Journal Article
Published in Sustainable Cities and Society in 2022
N/A
This article argues that “the open data revolution won’t happen unless the research system values the sharing of data as much as authorship on papers.” It urges researchers to credit all those who contribute their knowledge to research output.
Published in Nature in 2022
Magazine Article
Published in Nature in 2022
Renato A.F. Lima and colleagues
This piece notes that “data on tropical forests are in high demand. But ground forest measurements are hard to sustain and the people who make them are extremely disadvantaged compared to those who use them.” The authors propose a new approach focused on “the needs of data originators, and ensures users and funders contribute properly.”
Published in Nature in 2022
Journal Article
Published in Nature in 2022
Matthias Evers, Lucy Pérez, Lucas Robke, and Katarzyna Smietana
This piece suggests that “health data platforms that participants trust could bring an end to today’s reductionist approach to drug development, revolutionizing our understanding of the disease.
Published in McKinsey & Company in 2022
Blog Post
Published in McKinsey & Company in 2022
Matteo Nebbiai
This piece describes an understudied application of the European Union’s General Data Protection Regulation on data portability. It describes how the Right to Data Portability framework’s wording “creates some “grey areas” that allow data controllers a broad interpretation of the right” and “shows why the regulatory initiatives affecting the interpretation of these “grey areas” can be framed as “regulatory standard-setting (RSS) schemes.”
Published in Internet Policy Review in 2022
Journal Article
Published in Internet Policy Review in 2022
N/A
The framework is “a tool leaders can use to put data to work to solve the challenges most important to them.” It offers a way for organizations to assess their infrastructure, understand the questions they want to answer with data, assemble talent in their organization, and build community trust. The framework synthesizes lessons learned from the GovLab’s Open Data Policy Lab initiative and other efforts in the field.
Published in Microsoft in 2022
Report
Published in Microsoft in 2022
João Marinotti
This article is an article describing the diversity of concepts included in “data” and how these concepts require different governance regimes in commerce, life, and law. It also argues that data trusts, as a form of governance, need to ensure they can be recognized as a trust (of property) under the law of the jurisdiction in which they operate.
Published in New York University Law Review Online in 2022
Journal Article
Published in New York University Law Review Online in 2022
Max Kozlov
The news item describes a new requirement by the US National Institutes of Health that funded organizations include a data-management plan in their grant applications. The author describes the move as one that could “set a global standard for biomedical research.”
Published in Nature in 2022
Magazine Article
In Practice
Governance and Operations
Published in Nature in 2022
The news item describes a new requirement by the US National Institutes of Health that funded organizations include a data-management plan in their grant applications. The author describes the move as one that could “set a global standard for biomedical research.”
Charlotte van Ooijen, Nathan da Silva Carvalho, Alice Iordache, David Osimo
The piece “provides an overview of the state-of-the-art of existing approaches and indicators in the European open data landscape to assess public institutions’ needs as data re-users.”
Published in data.europa.eu in 2022
Paper
Published in data.europa.eu in 2022
N/A
This is a report on the state of data portals in national statistical offices. The report “proposes a holistic method to evaluate data portals and proposes recommendations to improve their use and function.”
Published in PARIS21 in 2021
Report
Published in PARIS21 in 2021
Alexander Kroll
This article examines how collaborations between the public sector and private and nonprofit organizations can operate. The piece draws on a case study in North Carolina to respond to the opioid epidemic to understand ideas of “shared measures” and “collective data use.”
Published in Cambridge University Press in 2022
Journal Article
Published in Cambridge University Press in 2022
Brandie Nonnecke And Camille Carlton
The piece describes “two notable, legislative efforts aimed at opening up platform data: the Digital Services Act (DSA), recently approved by the European Parliament, and the Platform Accountability and Transparency Act (PATA), recently proposed by several US senators.”
Published in Science in 2022
Journal Article
Published in Science in 2022
Maxat Kassen
This piece examines the driving forces that initiate and advance open data governance. It relies on theory and practice from Finland and Sweden to illustrate these concepts.
Published in Palgrave Macmillan in 2022
Journal Article
General
In Practice
Governance and Operations
Published in Palgrave Macmillan in 2022