One Register to Rule them All: terminology
An explanation of terms
Integrated Data Infrastructure (IDI)
A large research database that holds de-identified admin data or microdata about people and households. It is rebuilt every quarter and contains data about life events like education, income, benefits, migration, justice, and health.
It comes from government agencies, Stats NZ surveys, and non-government organisations (NGOs). The data is linked together, or integrated, to form the IDI.
The IDI complements the Longitudinal Business Database (LBD) , which holds linked microdata about businesses. The two databases are linked through tax data.
Researchers use the IDI to conduct cross-sector research that provides insight into our society and economy.
Integrated Statistical Data System (ISDS)
A register-based statistical system that enables the use of integrated data in a production setting. The ISDS integrates administrative data through up-to-date lists concerning population, places, and businesses, facilitating more effective and efficient data sourcing and ingestion. This system is foundational for utilising administrative data, including for the 2028 Census, ensuring the delivery of high-quality, integrated statistics.
Statistical Register
A register aims to be a complete list of units in a population. A unit's identity should be available so that the register can be updated and expanded with new variable values for that unit. A complete listing of known identities are the important features of a register. In New Zealand we have a Statistical Location Register (SLR) and Statistical Business Register (SBR) and Stats is building a Statistical Person Register (SPR) from the IDI.
Register-based Statistical System
The internationally recognised standard for the integration of the data across the social, economic and environmental domains for the production of official statistics and research purposes.
Integrated Person Spine (IPS)
The Integrated Person Spine (IPS) as the central dataset within the Integrated Data Infrastructure (IDI). The IPS serves as the primary person-level dataset to which all other person-level datasets are linked. It is constructed by combining data from key sources, including Inland Revenue's client register, the Ministry of Health's National Health Index, and the Ministry of Business, Innovation and Employment's visa records. This integration forms a comprehensive list of individuals, facilitating the connection of various administrative data sources for research and statistical analysis.
Unique Identifier (UID)
A specific code assigned to an individual within a dataset, such as a tax number, passport number, or driver's licence number. These identifiers facilitate exact linkage between records across different datasets when the same UID is present.
Persistent Unique IdentIfier (PUI)
Stats NZ defines a Persistent Unique Identifier (PUI) as a unique and enduring reference assigned to an entity within their data systems. This identifier ensures consistent tracking and integration of data across various datasets over time, facilitating accurate analysis and reporting. PUIs are crucial in maintaining data integrity and enabling seamless linkage between different statistical collections.

