Historical Overview
Indigenous data sovereignty emerges from centuries of colonial data extraction practices that treated Indigenous Peoples as subjects of study rather than agents of knowledge production. The historical pattern is stark: colonial authorities collected data about Indigenous communities to facilitate land seizure, population control, and cultural assimilation. These practices created what scholars now term “data colonialism”, the contemporary equivalent of resource extraction that transforms human experience into digital assets without consent or benefit to those providing the data.5
The intellectual genealogy of Indigenous data sovereignty traces to broader movements for Indigenous self-determination that gained momentum following the 1960s civil rights era. The 1975 establishment of the First Nations Research and Data Centre in Canada represented an early attempt to shift control over Indigenous information back to Indigenous communities. This precedent influenced the development of the Ownership, Control, Access, Possession (OCAP) principles in 1998, which established the first systematic framework for Indigenous data governance.6 7 8 9
Contemporary Indigenous data sovereignty movements gained theoretical foundation through the 2007 United Nations Declaration on the Rights of Indigenous Peoples, particularly Article 31, which affirms Indigenous Peoples’ rights to “maintain, control, protect and develop their cultural heritage, traditional knowledge and traditional cultural expressions” including their intellectual property over such heritage.This legal framework provided legitimacy for Indigenous claims to data sovereignty that had been developing organically within communities for decades.5 10
The emergence of digital technologies and big data analytics intensified these concerns. Indigenouscommunities watched their cultural knowledge, genetic information, and territorial data extracted and commodified through digital platforms without meaningful participation in governance structures. This “algorithmic colonisation” became particularly acute as artificial intelligence systems trained onIndigenous data began reproducing colonial biases and stereotypes in automated decision-making systems.11 12 13 14
Australia’s Framework for Governance of Indigenous Data: Institutionalising Sovereignty
Australia’s 2025 implementation of the Framework for Governance of Indigenous Data15 represents a systematic attempt to embed Indigenous data sovereignty within bureaucratic structures. The Framework operates on four foundational principles that directly challenge traditional government approaches toIndigenous affairs. Rather than treating Indigenous data as another government asset, the Framework requires all Australian Public Service agencies to partner with Aboriginal and Torres Strait Islander communities throughout the data lifecycle.
This approach reflects what anthropologist Sarah Hunt terms “decolonising bureaucracy”, the deliberate restructuring of institutional processes to accommodate Indigenous governance systems. The Framework’semphasis on co-design moves beyond consultation to require genuine power-sharing in data governance decisions. This represents a significant departure from previous models that positioned Indigenous communities as stakeholders rather than decision-makers.16
The Framework’s strength lies in its recognition that Indigenous data sovereignty cannot be achieved through policy declarations alone. Building data-related capabilities within Indigenous communities requires long-term investment in technical infrastructure, governance training, and institutional development. This capacity-building approach acknowledges what political scientist Glen Coulthard identifies as the “political economy of recognition”, the need for material changes alongside symbolic acknowledgements of Indigenous rights.17 18

However, implementation challenges reveal the persistence of colonial structures within government agencies. Many public servants lack basic understanding of Indigenous governance systems and struggle to operationalise partnership principles in practice. The Framework’s seven-year timeline acknowledges this reality by fostering continuous dialogue with Indigenous communities, enhancing cultural competency, building technical and governance capacity, and refining policies through ongoing learning, creating an adaptive environment that responds to emerging challenges and insights.19
The Waitangi Sheet of the Treaty of Waitangi, signed between the British Crown and various Māori chiefs in 1840 (William Hobson, James Freeman, and James Busby (English version); Henry Williams and Edward Williams (Māori translation) Wikimedia Commons 2011)
New Zealand’s Administrative Data Transition: Surveillance Capitalism Meets Indigenous Rights
New Zealand’s decision to replace the traditional census with administrative datasets exposes fundamental tensions between state efficiency and Indigenous sovereignty. The shift represents what media scholar Shoshana Zuboff terms“surveillance capitalism”, the systematic extraction of human experience as raw material for predictive products.2 For Māori communities, administrative data collection violates core principles embedded in Te Tiriti o Waitangi (The Treaty of Waitangi). The Treaty’s Māori text guarantees tino rangatiratanga, absolute sovereignty over Māori affairs. Administrative data systems undermine this sovereignty by removing Māori control over how their information is collected, categorised, and used.2 20 21 22 23
The technical architecture of administrative data systems embeds colonial assumptions about identity and belonging. Government databases define Māori primarily through interaction with state services, such as health problems, benefit receipt or criminal justice involvement. This creates what sociologist Tahu Kukutaicalls “deficit data” representations that emphasise pathology while ignoring Māori strengths, cultural practices, and community resilience.2 24 25
The contrast with culturally grounded surveys like Te Kupenga illustrates the difference between extractive and respectful data collection. Te Kupenga was designed by Māori researchers to capture Māori cultural experiences, language use, and spiritual connections to place. Administrative datasets cannot replicate thiscultural richness because they are structured around bureaucratic categories rather than Māori worldviews.2
Te Mana Raraunga’s resistance to the census replacement demonstrates sophisticated understanding of data as taonga (treasure). The Waitangi Tribunal’s recognition that Māori data constitutes taonga protected under Te Tiriti establishes a legal foundation for Māori data sovereignty claims. This legal precedent positions datagovernance as a Treaty issue requiring Crown consultation and Māori consent.24 26 27
International Standard-Setting: from Principles to Practice
The July 2025 UN Expert Mechanism session marked a turning point in international Indigenous data governance advocacy. The Inuit Circumpolar Council’s intervention demonstrated how Indigenous organisations use international forums to challenge dominant data governance frameworks. The InternationalCriminal Court (ICC) Chair Sara Olsvig’s statement that “Inuit data governance means the right of Inuit to autonomously decide what, how and why Inuit data are collected, accessed and used” establishes data sovereignty as fundamental to self-determination rather than just a technical preference.28
The Collective Benefit, Authority to Control, Responsibility, Ethics (CARE) principles represent sophisticated attempts to operationalise Indigenous data sovereignty within existing research infrastructures. Unlike technical standards focused on data interoperability, the CARE principles are centered around Indigenous rights and community benefit. The principles also challenge the “open data” movement’s assumption that maximum accessibility equals maximum social benefit.6 29 30
However, implementing CARE principles requires confronting structural inequalities within research institutions. Universities and government agencies typically lack Indigenous leadership in data governance roles. This absence of Indigenous authority means CARE principles often become checkbox exercises rather than genuine power-sharing arrangements.31
Algorithmic Bias and Digital Colonialism
The expansion of artificial intelligence (AI) systems poses particular threats to Indigenous communities already marginalised within existing data systems. Algorithmic bias in AI reflects what computer scientist Abeba Birhane terms “algorithmic colonisation”, the reproduction of colonial power relations through automated systems.14 29
Facial recognition technologies exemplify these risks. New Zealand’s Privacy Commissioner found that 53% of Māori respondents expressed privacy concerns compared to 47% of the general population. This disparity reflects both historical experiences with surveillance and contemporary risks of misidentification by biometric systems designed primarily for non- Indigenous faces.32 33
The Chiefs of Ontario’s analysis of AI risks for First Nations identifies three primary concerns: algorithmic discrimination, data sovereignty violations, and cultural appropriation. These risks compound existing inequalities while creating new forms of digital exclusion.33 However, Indigenous communities also recognise AI’s potential for cultural revitalisation.
The First Languages AI Reality project demonstrates how AI can support Indigenous language preservation when developed under Indigenous control. For instance, Te Hiku Media in New Zealand developed an automatic speech recognition model for Te Reo Māori with 92% accuracy, outperforming international techgiants through community-led language revitalisation efforts. This approach requires what researchers term“algorithmic sovereignty”, indigenous authority over AI systems affecting Indigenous communities.15 33 34
Free, Prior and Informed Consent in Digital Contexts
Traditional Free, Prior and Informed Consent (FPIC) protocols require adaptation for digital data environments. Digital data creation is continuous and often invisible to those generating the data. Indigenous communities may consent to specific research projects without understanding how their data will be reused in AI systems or linked with other datasets.17
The challenge intensifies with administrative data systems that collect information through routine government interactions. How can Indigenous communities exercise FPIC over data collected during hospital visits, school enrolment, or social service delivery? These transactions occur within systems of structural inequality where genuine refusal may compromise access to essential services.35
Emerging solutions focus on collective consent mechanisms that respect Indigenous governance structures.Some Indigenous organisations develop data governance protocols that establish community-wide standards for data collection and use. These protocols create frameworks for ongoing consent that can adapt to technological changes while maintaining community control.35 36
Capacity Building and Infrastructure Development
Effective Indigenous data sovereignty requires substantial investment in technical capacity and governance infrastructure. Many Indigenous communities lack reliable internet access, digital literacy training, and technical expertise necessary for meaningful participation in data governance.12 37
The digital divide compounds historical disadvantages while creating new forms of exclusion. Indigenous communities without robust digital infrastructure cannot participate effectively in online consultation processes or access digital services. This technological marginalisation reinforces broader patterns of political and economic exclusion.12 Successful capacity building programmes emphasise Indigenous leadership and cultural grounding.

Australia’s Framework acknowledges that technical training must be coupled with governance development that respects Indigenous decision-making processes. This holistic approach recognises that data sovereignty requires both technical capabilities and political authority.
The transformation of digital governance to respect Indigenous data sovereignty requires confronting centuries of colonial practice embedded within contemporary institutions. Australia and New Zealand’s experiences in 2025 demonstrate both the potential for genuine change and the persistence of structural barriers. Success depends not on technical solutions alone, but on fundamental shifts in power relations that position Indigenous communities as authorities over their own information.
The stakes extend beyond data governance to broader questions of justice and self- determination. As digital technologies reshape social relations, Indigenous data sovereignty represents a crucial test of whether societies can learn from colonial histories or remain trapped by them. The choice is not technical, it is political.