IDI Data Dictionary: Child, Youth and Family data

June 2015 edition

Crown copyright © This work is licensed under the Creative Commons Attribution 3.0 New Zealand licence. You are free to copy, distribute, and adapt the work, as long as you attribute the work to Statistics NZ and abide by the other licence terms. Please note you may not use any departmental or governmental emblem, logo, or coat of arms in any way that infringes any provision of the Flags, Emblems, and Names Protection Act 1981. Use the wording ‘Statistics New Zealand’ in your attribution, not the Statistics NZ logo. Liability While all care and diligence has been used in processing, analysing, and extracting data and information in this publication, Statistics New Zealand gives no warranty it is error free and will not be liable for any loss or damage suffered by the use directly, or indirectly, of the information in this publication. Citation Statistics New Zealand (2015). IDI Data Dictionary: Child, Youth and Family data (June 2015 edition). Available from www.stats.govt.nz. ISSN 2423-0952 (online) Published in June 2015 by Statistics New Zealand Tatauranga Aotearoa Wellington, New Zealand Contact Statistics New Zealand Information Centre: [email protected] Phone toll-free 0508 525 525 Phone international +64 4 931 4600 www.stats.govt.nz

Contents 1 Purpose of this data dictionary ....................................................................................5 2 Background ....................................................................................................................6 3 About the Child, Youth and Family data .....................................................................7 Coverage .........................................................................................................................7 Methodology ....................................................................................................................7 Quality information ...........................................................................................................7 Privacy, security, or confidentiality issues .......................................................................7 List of datasets.................................................................................................................8 4 Data dictionary for CYF abuse findings details table ................................................9 Dataset description ..........................................................................................................9 Dataset variables .............................................................................................................9 5 Data dictionary for CYF abuse findings events table ..............................................12 Dataset description ........................................................................................................12 Dataset variables ...........................................................................................................12 6 Data dictionary for CYF intakes table........................................................................17 Dataset description ........................................................................................................17 Dataset variables ...........................................................................................................17 7 Data dictionary for CYF intakes events table ...........................................................20 Dataset description ........................................................................................................20 Dataset variables ...........................................................................................................20 8 Data dictionary for CYF details about placement events table ..............................25 Dataset description ........................................................................................................25 Dataset variables ...........................................................................................................25 9 Data dictionary for CYF placement events table......................................................27 Dataset description ........................................................................................................27 Dataset variables ...........................................................................................................27 10 Data dictionary for CYF identity cluster table ..........................................................32 Dataset description ........................................................................................................32 Dataset variables ...........................................................................................................32 11 Data dictionary for sociocultural characteristics of a person table ......................36 Dataset description ........................................................................................................36 Dataset variables ...........................................................................................................36 Ethnicity classification ....................................................................................................43 12 Data dictionary for date information and ‘from dates’ of events table ..................44 3

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

Dataset description ........................................................................................................44 Dataset variables ...........................................................................................................44 Unknown, not applicable, and invalid records ...............................................................47 13 Data dictionary for date information and ‘to dates’ of events table ......................48 Dataset description ........................................................................................................48 Dataset variables ...........................................................................................................48 Appendix ...........................................................................................................................51

4

1 Purpose of this data dictionary IDI Data Dictionary: Child, Youth and Family data (June 2015 edition) documents the content of the Child, Youth and Family (CYF) datasets the Ministry of Social Development (MSD) provides to Statistics New Zealand to use in the Integrated Data Infrastructure (IDI). This dictionary gives information on the variables contained in the datasets from 1991 – including technical information and descriptions. Use this data dictionary if you are interested in understanding and accessing CYF data in the IDI.

5

2 Background The Ministry of Social Development (MSD) provides Child, Youth and Family (CYF) data to Statistics NZ for integration into the IDI. The CYF datasets contain data about a child or young person (CYP), where:  it is believed that a CYP is being or is likely to be harmed, ill-treated, abused, neglected, or deprived;  it is believed that a CYP is alleged to have committed an offence;  a concern is raised about the CYP’s behaviour or insecurity of care. This CYP is captured in the dataset where this concern is raised or report made to either CYF, the Police (other enforcement agency), Youth Court or Family Court. The datasets currently include facts about identities (ie person identities, identity characteristics, and relationships) and they capture these facts over time. They also capture information about the events in the life of a person at a specific time or over a specific period. These events can be one of the following:  proceedings  actions  measures  trials  procedures  dealings  occasions  happenings. The dataset is operational and is used by MSD to process business operations.

6

3 About the Child, Youth and Family data Coverage Reference period start: 30 September 1991 Reference period end: ongoing. Geographic coverage: New Zealand excluding Waiheke Island and the Chatham Islands. Target population: total New Zealand population. Observed population: people who have had any interaction with MSD Analysis unit: person, event, time, and cost.

Methodology Type of data: administrative data capture. Data collector: MSD captures the data from their operational systems. Frequency of data collection: MSD systems update data on a daily basis. Data in the IDI is refreshed regularly, approximately every three months, to include the most up-todate data available. Typically the data is extracted with a two-month lag to account for changes made by MSD process operations.

Quality information Editing: data cleansing has been done on the data where applicable according to MSD internal business rules. Other quality issues: known data issues, edits, or anomalies are detailed within relevant dataset sections in this document.

Privacy, security, or confidentiality issues The CYF tables that are accessible to researchers do not contain any name or address information to identify an individual. All researchers who have access to the CYF data have had their research proposals assessed using Statistics NZ’s microdata access protocols and only approved researchers who have been granted access by Statistics NZ and the Ministry of Social Development may view the CYF data. Read Statistics NZ’s microdata access protocols. All outputs produced from CYF data must be aggregated and counts suppressed if the underlying unrounded count is fewer than 6.

7

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

List of datasets CYF_ABUSE_DETAILS CYF_ABUSE_EVENT CYF_INTAKES_DETAILS CYF_INTAKES_EVENT CYF_PLACEMENTS_DETAILS CYF_PLACEMENTS_EVENT CYF_IDENTITY_CLUSTER CYF_SOCIOCULTURAL_CHARACTERISTICS CYF_FROM_DATE CYF_TO_DATE

8

4 Data dictionary for CYF abuse findings details table Dataset description Contents of dataset: holds the abuse findings details pertaining to the event data (CYF_ABUSE_DETAILS).

Dataset variables Variable group: cyf_abuse_details

9

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

snz_uid *

Integer

A global unique identifier created by Statistics NZ. There is a snz_uid for each distinct identity in the IDI. This identifier is changed and reassigned each refresh.

snz_msd_uid

Integer

A local unique identifier derived by Statistics NZ from the source agency’s unique identifier(s). This identifier will remain the same for an identity across refreshes. Where Statistics NZ receives more information during a subsequent refresh that indicates that two or more identities represent the same identity, the identifier may change. The snz_msd_uid represents a distinct identity in the MSD benefits, CYF, and MSD data for student loans and Working For Families.

Integer

A unique identifier created by Statistics NZ for each distinct event. This variable is the primary key for each event table. Combining it with cyf_abe_event_from_datetime makes a row of information unique.

composite_event_id

Integer

A unique identifier created by Statistics NZ for each distinct perpetrator.

perp_system_prsn_id_ value

snz_composite_event_ uid

snz_perp_prsn_uid

Y

10

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

snz_org_unit_uid

Integer

A unique identifier created by Statistics NZ for each distinct organisation unit identified by MSD.

organisation_unit_cod e

cyf_abd_extracted_dat etime*

Date time eg yyyy-mm-dd hh:mm:ss.fff

The date and time at which data was extracted from MSD systems. Note: mandatory field in MSD’s operational systems.

data_extracted_dateti me

cyf_abd_business_are a_type_code

Varchar (20)

Business area eg CNP = Care and Protection, YJU = Youth Justice

business_area_type_c ode

cyf_abd_social_work_ phase_id_nbr

Integer

Social work phases under which social work is conducted.

social_work_phase_id

business_area_type_co de

* Mandatory

11

Dictionary of Child, Youth and Family data in the Integrated Data Infrastructure

5 Data dictionary for CYF abuse findings events table Last updated on: 19 February 2015

Dataset description Contents of dataset: holds details pertaining to CYF abuse findings events (CYF_ABUSE_EVENT). An abuse finding event records the assessment that a social worker makes about whether or not a client has suffered abuse. This is a point-in-time event and so the cyf_abe_event_from_datetime and cyf_abe_event_to_datetime will be the same. This is the ‘assessment date’ of the related assessment. Historically, this date is entered by the social worker on the narrative held on the assessment record, but now it is the ‘date created’ in the same phase. There is one event for every combination of client, perpetrator, and abuse type. There will often be multiple abuse findings event records for a client because there may be multiple notifications for a client, each requiring an investigation. The same client may have more than one type of abuse within the same period (eg physically and sexually abused). Similarly, a client may have the same type of abuse more than once for the same notification, as a result of more than one perpetrator subjecting the client to the same abuse. For example, a child is neglected by both parents. Findings are recorded as past involvement types in PSTINVTP_CODE.

Past involvement types code SEX

Past involvement types

PHY

Physically abused by

EMO

Emotionally abused by

NEG

Neglected by

SHS

Self harm / suicidal

BRD

Behavioural / relationship difficulties

NTF

Not found

Sexually abused by

Dataset variables Variable group: cyf_abuse_event

12

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition A global unique identifier created by Statistics NZ. There is a snz_uid for each distinct identity in the IDI. This identifier is changed and reassigned each refresh.

Source agency variable name

snz_uid*

Integer

snz_msd_uid

Integer

A local unique identifier derived by Statistics NZ from the source agency’s unique identifier(s). This identifier will remain the same for an identity across refreshes. Where Statistics NZ receives more information during a subsequent refresh that indicates that two or more identities represent the same identity, the identifier may change. The snz_msd_uid represents a distinct identity in the MSD benefits, CYF, and MSD data for student loans and Working For Families.

current_unique_identit y_id

Integer

A unique identifier created by Statistics NZ for each distinct event. This variable is the primary key for each event table. Combining it with cyf_abe_event_from_datetime makes a row of information unique.

composite_event_id

snz_composite_event_ uid

Y

13

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

snz_systm_prsn_uid

Integer

An identifier created by Statistics NZ based on the system person ID from MSD systems.

system_prsn_id_value

cyf_abe_event_type_wi d_nbr

Tinyint Numeric

The value will be ‘12’ for an abuse findings event.

event_type_wid

cyf_abe_source_uk_va r1_text

Varchar (50)

(Unique key variable 1 value) For this abuse findings table, this is the assessment_id. See Appendix for more information.

source_table_uk_var1 _value

cyf_abe_source_uk_va r2_text

Varchar (50)

(Unique key variable 2 value) For this abuse findings table, this is the assessment_finding_type_code. See Appendix for more information.

source_table_uk_var2 _value

snz_uk_var3_uid

Integer

Encrypted unique key variable 3 value. Contains the third variable in the event which makes up the unique identifying key to the event. In this abuse findings table this is the perpetrator_person_id Note: this variable is the same variable as snz_perp_prsn_uid (cyf_abuse_details).

source_table_uk_var3 _value

source_table_uk_var2_ name

14

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

cyf_abe_source_uk_va r4_text

Integer

This variable will not contain any data.

source_table_uk_var4 _value

cyf_abe_event_from_d atetime

Datetime yyyy-mmddThh:mm:ss.f fffff

Event from date and time.

event_from_datetime

cyf_abe_event_to_date time

Datetime yyyy-mmddThh:mm:ss.f fffff

Event to date and time.

event_to_datetime

cyf_abe_event_from_d ate_wid_date

Date yyyy-mm-dd

Event from date. This variable can be used to join with the cyf_from_date table.

event_from_date_wid

cyf_abe_event_to_date _wid_date

Date yyyy-mm-dd

Event to date, this variable can be used to join with the cyf_to_date table.

event_to_date_wid

cyf_abe_number_of_da ys_nbr

Integer

Number of days in the event so far. Abuse events are a point-in-time event so cyf_abe_event_from_datetime and cyf_abe_event_to_datetime will be the same and this variable will always be equal to 1.

number_of_days

15

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

cyf_abe_direct_daily_n ett_amt

Decimal (9,3)

Direct daily nett cost

direct_daily_nett_cost

cyf_abe_direct_daily_gr oss_amt

Decimal (9,3)

Direct daily gross cost

direct_daily_gross_co st

cyf_abe_indirect_daily_ nett_amt

Decimal (9,3)

Indirect daily nett cost

indirect_daily_nett_co st

cyf_abe_indirect_daily_ gross_amt

Decimal (9,3)

Indirect daily gross cost

indirect_daily_gross_c ost

cyf_abe_count_nbr

tinyint

Count

count

cyf_abe_extracted_dat etime

Datetime yyyy-mmddThh:mm:ss.f fffff

Data extraction datetime

data_extracted_dateti me

* Mandatory

16

Dictionary of Child, Youth and Family data in the Integrated Data Infrastructure

6 Data dictionary for CYF intakes table Last updated on: 19 February 2015

Dataset description Contents of dataset: holds the details pertaining to the intake event (CYF_INTAKES_DETAILS). A care and protection client intake event occurs when a person who believes a child or young person (CYP) is being (or is likely to be) harmed, ill-treated, abused, neglected, or deprived, reports the matter to CYF or the Police. CYF also receive reports when there are concerns regarding a child or young person's behaviour, or insecurity of care. A youth justice client intake event occurs when a child or young person is alleged to have committed an offence and the matter is referred by the Police (or other enforcement agency), Youth Court, or Family Court. Where a child or young person appears before the court, they may also be placed in the custody of CYF following arrest. The client intake event start date is the incident date of the notification to CYF and the end date is the end date of the client role in the intake phase.

Dataset variables Variable group: cyf_intakes_details

17

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name snz_composite_event_ui d

Primary key Y

Format

Classification name

Variable definition A unique identifier created by Statistics NZ for each distinct event. This variable is the primary key for each event table.

Integer

Source agency variable name composite_event_i d

cyf_ind_extracted_dateti me

Date time yyyy-mm-dd hh:mm:ss.fff

The date and time at which data was extracted from MSD.

data_extracted_dat etime

cyf_ind_social_work_ph ase_id_nbr

Integer

Social work phase. Social work phases under which social work is conducted.

social_work_phase _id

cyf_ind_rollfwd_sw_phas e_id_nbr

Integer

Rollforward social work phase ID is the phase ID into which the event is being rolled into.

rollforward_sw_pha se_id

cyf_ind_fgc_referral_ind

Varchar (3)

Family group conference indicator eg Y, N

source_fgc_referral _yn

cyf_ind_cnp_notification _ind

Varchar (3)

Care and protection notification e.g Y, N

cnp_notification_yn

cyf_ind_business_area_t ype_code

Varchar (20)

business_area_type_ code

Business area

business_area_typ e_code

cyf_ind_intake_type_cod e

Varchar (60)

intake_type_code

Intake type code

intake_type_code

cyf_ind_final_urgency_ty pe_code

Varchar (20)

final_urgency_type_c ode

Final urgency type code

final_urgency_type _code

18

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

cyf_ind_final_outcome_a ctn_code

Varchar (20)

Final_outcome_action Final outcome action code _type_code eg FAR, NFA, ADD

cyf_ind_final_outcome_t ype_code

Varchar (20)

final_outcome_type_c ode

cyf_ind_crnt_org_unit_ui d

Source agency variable name final_outcome_acti on_type_code

Final outcome type code eg CRT, FAR, FARCFA

final_outcome_type _code

Integer

Current organisation unit uid. A unique identifier created by Statistics NZ for each distinct current organisation unit identified by MSD.

crnt_organisation_ unit_code

cyf_ind_rcvd_org_unit_ uid

Integer

Received organisation unit uid. A unique identifier created by Statistics NZ for each distinct current organisation unit identified by MSD.

rcvd_organisation_ unit_code

cyf_ind_refrd_org_unit_ uid

Integer

Referred organisation unit uid. A unique identifier created by Statistics NZ for each distinct current organisation unit identified by MSD.

refrd_organisation_ unit_code

cyf_ind_intake_ref_statu s_code

Varchar (20)

Intake referral status code

intake_referral_stat us_code

cyf_ind_contact_method _code

Varchar (20)

contact_method_cod e

Contact method code

contact_method_c ode

cyf_ind_notifier_role_typ e_code

Varchar (20)

notifier_role_type_co de

Notifier role type code

notifier_role_type_ code

19

Dictionary of Child, Youth and Family data in the Integrated Data Infrastructure

7 Data dictionary for CYF intakes events table Last updated on: 19 February 2015

Dataset description Contents of dataset: holds client intake event for the integrated person view (CYF_INTAKES_EVENT). A care and protection client intake event occurs when a person who believes a child or young person (CYP) is being (or is likely to be) harmed, ill-treated, abused, neglected, or deprived, reports the matter to CYF or the Police. CYF also receive reports when there are concerns regarding a child or young person's behaviour, or insecurity of care. A youth justice client intake event occurs when a child or young person is alleged to have committed an offence and the matter is referred by the Police (or other enforcement agency), Youth Court, or Family Court. Where a child or young person appears before the court, they may also be placed in the custody of CYF following arrest. The client intake event start date is the incident date of the notification to CYF, and the end date is the dnd date of the client role in the intake phase.

Dataset variables Variable group: cyf_intake_event

20

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

snz_uid*

Integer

A global unique identifier created by Statistics NZ. There is a snz_uid for each distinct identity in the IDI. This identifier is changed and reassigned each refresh.

snz_msd_uid

Integer

A local unique identifier derived by Statistics NZ from the source agency’s unique identifier(s). This identifier will remain the same for an identity across refreshes. Where Statistics NZ receives more information during a subsequent refresh that indicates that two or more identities represent the same identity, the identifier may change. The snz_msd_uid represents a distinct identity in the MSD benefits, CYF, and MSD data for student loans and Working For Families.

Tinyint

Source agency variable name

current_unique_identit y_id

event_type_wid

cyf_ine_event_type_wi d_nbr

21

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

snz_composite_event_ uid

Y

Classification name

Variable definition

Source agency variable name

Integer

A unique identifier created by Statistics NZ for each distinct event. This variable is the primary key for each event table. Combining it with cyf_abe_event_from_datetime makes a row of information unique.

composite_event_id

snz_systm_prsn_uid

Integer

An identifier created by Statistics NZ based on the system person ID from MSD systems.

system_prsn_id_value

cyf_ ine _source_uk_var1_text

Varchar (50)

(Unique key variable 1 value) For this intakes table, this is the notification_record_id. See Appendix for more information.

source_table_uk_var1 _value

cyf_ ine _source_uk_var2_text

Varchar (50)

This variable will not contain any data.

source_table_uk_var2 _value

cyf_ ine _source_uk_var3_text

Varchar (50)

This variable will not contain any data.

source_table_uk_var3 _value

cyf_ ine _source_uk_var4_text

Varchar (50)

This variable will not contain any data.

source_table_uk_var4 _value

22

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

cyf_ ine _event_from_datetime

Datetime yyyy-mmddThh:mm:ss.f fffff

Event from date and time

event_from_datetime

cyf_ ine _event_to_datetime

Datetime yyyy-mmddThh:mm:ss.f fffff

Event to date and time

event_to_datetime

cyf_ ine _event_from_date_wid _date

Date yyyy-mm-dd

Event from date, this variable can be used to join with the cyf_from_date table.

event_from_date_wid

cyf_ ine _event_to_date_wid_d ate

Date yyyy-mm-dd

Event to date, this variable can be used to join with the cyf_to_date table.

event_to_date_wid

cyf_ ine _number_of_days_nbr

Integer

Number of days in the event so far.

number_of_days

cyf_ ine _direct_daily_nett_amt

Decimal (9,3)

Direct daily nett cost

direct_daily_nett_cost

cyf_ ine _direct_daily_gross_a mt

Decimal (9,3)

Direct daily gross cost

direct_daily_gross_co st

23

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

cyf_ ine _indirect_daily_nett_a mt

Decimal (9,3)

Indirect daily nett cost

indirect_daily_nett_co st

cyf_ ine _indirect_daily_gross_a mt

Decimal (9,3)

Indirect daily gross cost

indirect_daily_gross_c ost

cyf_ ine _count_nbr

tinyint

Count

count

cyf_ ine _extracted_datetime

Datetime yyyy-mmddThh:mm:ss.f fffff

Data extraction datetime

data_extracted_dateti me

* Mandatory

24

Dictionary of Child, Youth and Family data in the Integrated Data Infrastructure

8 Data dictionary for CYF details about placement events table Last updated on: 19 February 2015

Dataset description Contents of dataset: holds the details pertaining to the placement event (CYF_PLACEMENTS_DETAILS).

Dataset variables Variable group: cyf_placements_details

25

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name snz_composite_event_ uid

Primary key Y

Format

Classification name

Variable definition A unique identifier created by Statistics NZ for each distinct event. This variable is the primary key for each event table.

Integer

Source agency variable name composite_event_i d

cyf_pld_extracted_datet ime

Date time yyyy-mm-dd hh:mm:ss.fff

The date and time at which data was extracted from MSD.

data_extracted_dat etime

cyf_pld_open_suspensi on_date

Date yyyy-mm-dd

Latest open suspension date

open_suspension_ date

cyf_pld_placement_typ e_code

Varchar (20)

placement_type_code

Placement type code

placement_type_co de

cyf_pld_output_class_c ode

Varchar (20)

output_class_code

Output class code

output_class_code

snz_org_unit_uid

Integer

A unique identifier created by Statistics NZ for each distinct current organisation unit identified by MSD.

organisation_unit_c ode

cyf_pld_business_area _type_code

Varchar (20)

Business area

business_area_typ e_code

cyf_pld_social_work_ph ase_id_nbr

Integer

Social work phases under which social work is conducted.

social_work_phase _id

cyf_pld_open_suspensi on_ind

Varchar (1)

Suspension flag eg Y, N

open_suspension_ yn

cyf_pld_full_time_ind

Varchar (1)

Placement full-time flag eg Y, N

full_time_yn

business_area_type_co de

26

Dictionary of Child, Youth and Family data in the Integrated Data Infrastructure

9 Data dictionary for CYF placement events table Last updated on: 19 February 2015

Dataset description Contents of dataset: holds CYF placement event for the integrated client person/cost/time event table (CYF_PLACEMENTS_EVENT). A placement event occurs when a placement record is created for a client. Some placement records for a given client may overlap. An example of this is where a placement is in force but then a respite placement (perhaps for a few days) occurs for the same client, who then returns to the original placement after the respite placement.

Dataset variables Variable group: cyf_placements_event

27

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

snz_uid*

Integer

snz_msd_uid

Integer

cyf_ple_event_type_ wid_nbr

Tinyint

snz_composite_event _uid

Y

Classification name

Variable definition A global unique identifier created by Statistics NZ. There is a snz_uid for each distinct identity in the IDI. This identifier is changed and reassigned each refresh. A local unique identifier derived by Statistics NZ from the source agency’s unique identifier(s). This identifier will remain the same for an identity across refreshes. Where Statistics NZ receives more information during a subsequent refresh that indicates that two or more identities represent the same identity, the identifier may change. The snz_msd_uid represents a distinct identity in the MSD benefits, CYF, and MSD data for student loans and Working For Families.

Source agency variable name

current_unique_identit y_id

event_type_wid

Integer

A unique identifier created by Statistics NZ for each distinct event. This variable is the primary key for each event table. Combining it with cyf_abe_event_from_datetime makes a row of information unique.

28

composite_event_id

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

snz_systm_prsn_uid

Integer

An identifier created by Statistics NZ based on the system person ID from MSD systems.

system_prsn_id_value

cyf_ple_source_uk_v ar1_text

Varchar (50)

(Unique key variable 1 value) For this placements table it is the placement_id. See Appendix for more information.

source_table_uk_var1 _value

cyf_ ple _source_uk_var2_tex t

Varchar (50)

This variable will not contain any data.

source_table_uk_var2 _value

cyf_ ple _source_uk_var3_tex t

Varchar (50)

This variable will not contain any data.

source_table_uk_var3 _value

cyf_ ple _source_uk_var4_tex t

Varchar (50)

This variable will not contain any data.

source_table_uk_var4 _value

cyf_ ple _event_from_datetim e

Datetime yyyy-mmddThh:mm:ss.f fffff

Event from date and time

event_from_datetime

29

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

cyf_ ple_event_to_datetim e

Datetime yyyy-mmddThh:mm:ss.f fffff

Event to date and time

event_to_datetime

cyf_ ple_event_from_date _wid_date

Date yyyy-mm-dd

Event from date, this variable can be used to join with the cyf_from_date table.

event_from_date_wid

cyf_ ple_event_to_date_wi d_date

Date yyyy-mm-dd

Event to date, this variable can be used to join with the cyf_to_date table.

event_to_date_wid

cyf_ ple_number_of_days _nbr

Integer

Number of days in the event so far.

number_of_days

cyf_ ple_direct_daily_nett_ amt

Decimal (9,3)

Direct daily nett cost

direct_daily_nett_cost

cyf_ ple_direct_daily_gros s_amt

Decimal (9,3)

Direct daily gross cost

direct_daily_gross_co st

cyf_ ple_indirect_daily_net t_amt

Decimal (9,3)

Indirect daily nett cost

indirect_daily_nett_co st

30

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

cyf_ ple_indirect_daily_gro ss_amt

Decimal (9,3)

Indirect daily gross cost

indirect_daily_gross_c ost

cyf_ ple_count_nbr

Tinyint

Count

count

cyf_ ple_extracted_dateti me

Datetime yyyy-mmddThh:mm:ss.f fffff

Data extraction datetime

data_extracted_dateti me

* Mandatory

31

Dictionary of Child, Youth and Family data in the Integrated Data Infrastructure

10 Data dictionary for CYF identity cluster table Last updated on: 19 February 2015

Dataset description Contents of dataset: holds the matched cluster groups and the distinctive characteristics for a person: gender, date of birth, given names, last name, and date of death (CYF_IDENTITY_CLUSTER). This table can be used for any CYF event to join to any other CYF event information for a ‘person’.

Dataset variables Variable group: cyf_identity_cluster

32

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

snz_uid*

Integer

A global unique identifier created by Statistics NZ. There is a snz_uid for each distinct identity in the IDI. This identifier is changed and reassigned each refresh.

snz_msd_uid

Integer

A local unique identifier derived by Statistics NZ from the source agency’s unique identifier(s). This identifier will remain the same for an identity across refreshes. Where Statistics NZ receives more information during a subsequent refresh that indicates that two or more identities represent the same identity, the identifier may change. The snz_msd_uid represents a distinct identity in the MSD benefits, CYF, and MSD data for student loans and Working For Families. This is based on the current_unique_identity_id, which is assigned to a distinct person. Where a person’s identity record has been matched against other identity records, this source agency variable will contain the final identity ID to which this identity record belongs at a point in time. Therefore, records that had been identified by MSD as belonging to the same person will have the same snz_msd_uid. This snz_msd_uid will stay the same for a person across refreshes except where cross-referencing during a subsequent refresh has indicated that two records are the same person and then the uid may change.

33

Source agency variable name

current_unique_id entity_id

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

snz_systm_ prsn_uid

Integer

The source system identity for each person.

system_prsn_id_v alue

snz_crnt_systm_prsn _uid

Integer

The current source system identity for each person. If a person’s identity has been matched against other identities, this column will contain the final identity to which this identity record belongs to at a point in time.

current_system_pr sn_id_value

snz_from_msd_uid

Varchar (20)

Contains the ID that is distinct to each record, ie records that have been identified as the official and alias identity of a person will have different snz_from_msd_uid (because their unique_identity_ids are different).

unique_identity_id

cyf_idc_identity_type_ text

Char (20)

Identity role type within the MSD system eg official, alias, duplicate, not applicable

identity_type

cyf_idc_role_type_tex t

Varchar (20)

Role type eg other, client

role_type

cyf_idc_source_syste m_code

Varchar (50)

The MSD source system where the identity originated eg C0YS

source_system_co de

cyf_idc_systm_persn_ id_var_text

Varchar (50)

The MSD source system information eg PERSON_ID

system_prsn_id_v ar_name

cyf_idc_sex_code

Varchar (100)

The sex of the person ie male = 1, female = 2, unknown

gender

34

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

cyf_idc_birth_month_ nbr

Tinyint

The birth month of the person

dob

cyf_idc_birth_year_nb r

Smallint

The birth year of the person

dob

cyf_idc_death_month _nbr

Tinyint

The month of death

dod

cyf_idc_death_year_n br

Smallint

The year of death

dod

cyf_idc_extracted_dat etime

Datetime yyyy-mmddThh:mm: ss.ffffff

Data extraction datetime

* Mandatory

35

Dictionary of Child, Youth and Family data in the Integrated Data Infrastructure

11 Data dictionary for sociocultural characteristics of a person table Last updated on: 19 February 2015

Dataset description Contents of dataset: holds the sociocultural characteristics of a person (CYF_SOCIOCULTURAL_CHARACTERISTICS). Characteristics have been standardised across MSD systems and the number of characteristics are expanded as the need for them arises.

Dataset variables Variable group: cyf_sociocultural_characteristics

36

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

snz_uid*

Integer

A global unique identifier created by Statistics NZ. There is a snz_uid for each distinct identity in the IDI. This identifier is changed and reassigned each refresh.

snz_msd_uid

Integer

A local unique identifier derived by Statistics NZ from the source agency’s unique identifier(s). This identifier will remain the same for an identity across refreshes. Where Statistics NZ receives more information during a subsequent refresh that indicates that two or more identities represent the same identity, the identifier may change. The snz_msd_uid represents a distinct identity in the MSD benefits, CYF, and MSD data for student loans and Working For Families.

Source agency variable name

unique_identity_id

For this table, the source agency variable used to create snz_msd_uid was the unique_identity_id (and not the current_unique_identity_id), as only the records where unique_identity_id = current_unique_identity_id (in identity_cluster table) were populated in the sociocultural table. cyf_soc_ethnic_1_co de

Varchar (20)

The primary ethnicity recorded for the person in the CYF data.

37

ethnicity_1_code

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

cyf_soc_ethnic_2_co de

Varchar (20)

The 2nd ethnicity code recorded for the person identity in the CYF data.

ethnicity_2_code

cyf_soc_ethnic_3_co de

Varchar (20)

The 3rd ethnicity code recorded for the person identity in the CYF data.

ethnicity_3_code

cyf_soc_ethnic_4_co de

Varchar (20)

The 4th ethnicity code recorded for the person identity in the CYF data.

ethnicity_4_code

cyf_soc_ethnic_5_co de

Varchar (20))

The 5th ethnicity code recorded for the person identity in the CYF data.

ethnicity_5_code

cyf_soc_ethnic_6_co de

Varchar (20)

The 6th ethnicity code recorded for the person identity in the CYF data.

ethnicity_6_code

cyf_soc_ethnic_asian _ind

Varchar (1)

Yes/no flag for whether the person is part of the Asian ethnicity category. eg Y , N

ethnicity_asian_yn

cyf_soc_ethnic_europ ean_ind

Varchar (1)

Yes/no flag for whether the person is part of the European ethnicity category. eg Y , N

ethnicity_europea n_yn

cyf_soc_ethnic_maori _ind

Varchar (1)

Yes/no flag for Statistics NZ level 1 classification for Māori ethnicity category, where one of the six ethnicity codes will begin with a 2 eg Y , N

ethnicity_maori_yn

38

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

cyf_soc_ethnic_mela a_ind

Varchar (1)

Yes/no flag for Statistics NZ level 1 classification for Middle Eastern / Latin American / African (MELAA) ethnicity category, where one of the six ethnicity codes will begin with a 5 eg Y , N

ethnicity_melaa_y n

cyf_soc_ethnic_other _ind

Varchar (1)

Yes/no flag for Statistics NZ level 1 classification for Other ethnicity category, where one of the six ethnicity codes will begin with a 6 eg Y , N

ethnicity_other_yn

cyf_soc_ethnic_pacifi c_ind

Varchar (1)

Yes/no flag for Statistics NZ level 1 classification for Pacific peoples ethnicity category, where one of the six ethnicity codes will begin with a 3 eg Y , N

ethnicity_pacific_p eople_yn

cyf_soc_ethnic_resid ual_ind

Varchar (1)

Yes/no flag for Statistics NZ level 1 classification for Māori ethnicity category, where one of the six ethnicity codes will begin with a 9. These are where ethnicity is not known or not stated. eg Y , N

ethnicity_residual_ yn

cyf_soc_extracted_da tetime

Datetime yyyy-mmddThh:mm:ss .ffffff

Data extraction datetime

39

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

cyf_soc_ethnic1_snz _code

Varchar (6)

The primary ethnicity recorded for the person, using the Statistics NZ Standard Classification 2005 code.

cyf_soc_ethnic2_snz _code

Varchar (6)

The 2nd ethnicity code recorded for the person identity, using the Statistics NZ Standard Classification 2005 code.

cyf_soc_ethnic3_snz _code

Varchar (6)

The 3rd ethnicity code recorded for the person identity, using the Statistics NZ Standard Classification 2005 code.

cyf_soc_ethnic4_snz _code

Varchar (6)

The 4th ethnicity code recorded for the person identity, using the Statistics NZ Standard Classification 2005 code.

cyf_soc_ethnic5_snz _code

Varchar (6)

The 5th ethnicity code recorded for the person identity, using the Statistics NZ Standard Classification 2005 code.

cyf_soc_ethnic6_snz _code

Varchar (6)

The 6th ethnicity code recorded for the person identity, using the Statistics NZ Standard Classification 2005 code.

cyf_soc_ethnic_grp1_ snz_ind*

bit

An indicator created by Statistics NZ to categorise a person by their ethnicity group according to the Statistics NZ ethnicity classification. Where 1 appears in this column it indicates that the person has given their ethnicity as European. eg 1, 0

40

Source agency variable name

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

cyf_soc_ethnic_grp2_ snz_ind*

bit

An indicator created by Statistics NZ to categorise a person by their ethnicity group according to the Statistics NZ ethnicity classification. Where 1 appears in this column it indicates that the person has given their ethnicity as Māori. eg 1, 0

cyf_soc_ethnic_grp3_ snz_ind*

bit

An indicator created by Statistics NZ to categorise a person by their ethnicity group according to the Statistics NZ ethnicity classification. Where 1 appears in this column it indicates that the person has given their ethnicity as Pacific eoples. eg 1, 0

cyf_soc_ethnic_grp4_ snz_ind*

bit

An indicator created by Statistics NZ to categorise a person by their ethnicity group according to the Statistics NZ ethnicity classification. Where 1 appears in this column it indicates that the person has given their ethnicity as Asian. eg 1, 0

cyf_soc_ethnic_grp5_ snz_ind*

bit

An indicator created by Statistics NZ to categorise a person by their ethnicity group according to the Statistics NZ ethnicity classification. Where 1 appears in this column it indicates that the person has given their ethnicity as Middle Eastern / Latin America / African (MELAA). eg 1, 0

41

Source agency variable name

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name cyf_soc_ethnic_grp6_ snz_ind*

Primary key

Format

Classification name

bit

Variable definition An indicator created by Statistics NZ to categorise a person by their ethnicity group according to the Statistics NZ ethnicity classification. Where 1 appears in this column it indicates that the person has given their ethnicity as Other ethnicity. eg 1, 0

* Mandatory

42

Source agency variable name

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

Ethnicity classification The Ethnicity New Zealand Standard Classification 2005 has been used by MSD and Statistics NZ to classify ethnicities.  The ethnicity code data is recorded in the Statistics NZ level 3 format.  The primary ethnicity for the person identity will be recorded in the cyf_soc_ethnic_1_code variable.  The other non-primary ethnicity codes will be recorded in the cyf_soc_ethnic_2_code – cyf_soc_ethnic_6_code variables.  The non-primary ethnicity codes will be sorted in ascending order.  There is a maximum of six ethnicity codes reported per person identity. In this table, MSD has supplied a cyf_soc_ethnic_code as 541, which does not exist in the Ethnicity New Zealand Standard Classification 2005. This code has therefore been coded to 611 ‘Other ethnicity’ in the cyf_soc_ethnic_snz_code. While this means that this record will have cyf_soc_ethnic_grp6_snz_ind = 1 (‘Other’ group), the original grouping from CYF is cyf_soc_ethnic_melaa_ind = 1 (‘MELAA’ group). We cannot code this record to another cyf_soc_ethnic_snz_code because we cannot break the MELAA category down any further then level 3 without knowing which ethnicity the identity is associated with.

43

12 Data dictionary for date information and ‘from dates’ of events table Last updated on: 19 February 2015

Dataset description Contents of dataset: holds the date information and contains the ‘from dates’ of each event (CYF_FROM_DATE).

Dataset variables Variable group: cyf_from_date

44

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name date_wid

cyf_frm_date_wid_dat e

Date yyyy-mm-dd

The from date of the event

cyf_frm_calendar_dat e

Date yyyy-mm-dd

The from date of the event

calendar_date_co de

cyf_frm_date_number _format_nbr

Integer

The from date of the event in number format (not a SAS numeric representation) eg 20140101 (YYYYMMDD)

date_in_number_f ormat

cyf_frm_day_in_week _nbr

Tinyint

Day of the week

day_number_in_w eek

cyf_frm_day_name_t ext

Varchar (10)

Day name eg Thursday

day_name

cyf_frm_month_name _text

Varchar (10)

Month name eg January

month_name

cyf_frm_calendar_ye ar_nbr

Smallint

Calendar year eg 2014

calendar_year

cyf_frm_calendar_mo nth_nbr

Tinyint

Calendar month eg 1

calendar_month_n umber

cyf_frm_calendar_qu arter_nbr

Tinyint

Calendar quarter eg 2014Q1

calendar_quarter_ number

45

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

cyf_frm_year_quarter _text

Varchar(7)

Calendar year quarter code eg 2014Q1

year_quarter

cyf_frm_fiscal_year_t ext

Varchar(7)

Fiscal year eg 2014/15

fiscal_year

cyf_frm_fiscal_month _nbr

Tinyint

Month number in fiscal year ie 1–12

fiscal_month

cyf_frm_fiscal_quarte r_nbr

Tinyint

Quarter number in fiscal year ie 1–4

fiscal_quarter_nu mber

cyf_frm_fiscal_year_q uarter_text

Varchar(9)

Fiscal year quarter code eg2014/15Q1

fiscal_year_quarte r

cyf_frm_weekend_ind

Varchar(1)

Weekend indicator, when the date is on a weekend (Saturday or Sunday) Eg Y,N

weekend_yn

cyf_frm_holiday_ind

Varchar(1)

Public holiday indicator, when the date falls on a public holiday eg Y, N

holiday_yn

cyf_frm_extracted_da tetime

Datetime yyyy-mmddThh:mm:ss.fffff f

Data extraction datetime

data_extracted_da tetime

46

Unknown, not applicable, and invalid records These values apply to both the ‘from_date’ and ‘to_date’ tables.

Date_wid 12DEC9999

calendar_date_code day_name Unknown UNK

day_number_in_week 6

13DEC9999

Not applicable

XXX

7

14DEC9999

Invalid

INV

1

15DEC9999

Exceeds

EXC

2

31DEC9999

Null

NUL

4

01JAN1900

Default

DEF

2

47

13 Data dictionary for date information and ‘to dates’ of events table Last updated on: 19 February 2015

Dataset description Contents of dataset: holds the date information and contains the ‘to dates’ of each event (CYF_TO_DATE).

Dataset variables Variable group: cyf_to_date

48

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name date_wid

cyf_to_date_wid_date

Date yyyy-mm-dd

‘To date’ of the event

cyf_to_calendar_date

Date yyyy-mm-dd

‘To date’ of the event

calendar_date_co de

cyf_to_date_number_fo rmat_nbr

Integer

The ‘to date’ of the event number format (not a SAS numeric representation) eg 20140101 (YYYYMMDD)

date_in_number_f ormat

cyf_to_day_in_week_nb r

Tinyint

Day of the week

day_number_in_w eek

cyf_to_day_name_text

Varchar (10)

Day name eg Thursday

day_name

cyf_to_month_name_te xt

Varchar (10)

Month name eg January

month_name

cyf_to_calendar_year_n br

Smallint

Calendar year eg 2014

calendar_year

cyf_to_calendar_month _nbr

Tinyint

Calendar month eg 1

calendar_month_n umber

cyf_to_calendar_quarte r_nbr

Tinyint

Calendar quarter eg 2014Q1

calendar_quarter_ number

cyf_to_year_quarter_te xt

Varchar(7)

Calendar year quarter code eg 2014Q1

year_quarter

49

IDI Data Dictionary: Child, Youth and Family data (June 2015 edition)

IDI variable name

Primary key

Format

Classification name

Variable definition

Source agency variable name

cyf_to_fiscal_year_text

Varchar(7)

Fiscal year eg 2014/15

fiscal_year

cyf_to_fiscal_month_nb r

Tinyint

Month number in fiscal year ie 1–12

fiscal_month

cyf_to_fiscal_quarter_n br

Tinyint

Quarter number in fiscal year ie 1–4

fiscal_quarter_nu mber

cyf_to_fiscal_year_quar ter_text

Varchar(9)

Fiscal year quarter code eg 2014/15Q1

fiscal_year_quarte r

cyf_to_weekend_ind

Varchar(1)

Weekend indicator, when the date is on a weekend (Saturday or Sunday) eg Y

weekend_yn

cyf_to_holiday_ind

Varchar(1)

Public holiday indicator, when the date falls on a public holiday eg Y, N

holiday_yn

cyf_to_extracted_dateti me

Datetime yyyy-mmddThh:mm:ss.f fffff

Data extraction datetime

data_extracted_da tetime

50

Appendix The event_type table below shows which code or ID the source_uk_varN_text variable represents in each of the placements_event, intakes_event, and abuse_event tables.

These are event_type_description the source variables as we do not put this table into the clean environmen t.event_typ e_wid 12 Client abuse finding event sourced from CYRAS source system

service_line_descript ion

source_table_u k_var1_name

source_table_u k_var2_name

source_table_u k_var3_name

source_table_u k_var4_name

Child, Youth and Family

assessment_id

assessment_find ing_type_code

perpetrator_pers on_id

NULL

2

Client Intake Events sourced from CYRAS source system

Child, Youth and Family

notification_recor d_id

NULL

NULL

NULL

14

Client Placement Event sourced Child, Youth and from CYRAS source system Family

placement_id

NULL

NULL

NULL

51