User Tools

Site Tools


family_relations

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
family_relations [2023/07/11 11:23]
simone
family_relations [2023/07/11 11:39]
simone
Line 5: Line 5:
 The basis of the family relations data within Lifelines consists of information from the municipal registries (Basisregistratie Personen (BRP)). Based on this information,​ researchers from the [[ugli|UGLI-consortium]] have optimized the family relations using recoded surname data (for privacy purposes) and data from questionnaires ([[family_composition|Family composition]]). The process of this optimization has been documented and can be found here: {{ :​lifelines_familyreconstruction.pdf | Family relations reconstruction}}. The basis of the family relations data within Lifelines consists of information from the municipal registries (Basisregistratie Personen (BRP)). Based on this information,​ researchers from the [[ugli|UGLI-consortium]] have optimized the family relations using recoded surname data (for privacy purposes) and data from questionnaires ([[family_composition|Family composition]]). The process of this optimization has been documented and can be found here: {{ :​lifelines_familyreconstruction.pdf | Family relations reconstruction}}.
  
-These optimized family relations were then verified by the UGLI-consortium using available genetic data. The most recent version of the family relations data (sec_family_relations_v3.1) has been verified by using genetic data from [[gwas|GWAS (CytoSNP)]],​ [[ugli|UGLI 1 (GSA), and UGLI 2 (Affymetrix)]]. ​ More detailed information on this process can be found in the quality control document of UGLI2-Affymetrix:​ QC_report_UGLI2_(release_1)-v1.pdf , page 6 - 9, section 6b and in the readme of the family relations file .+These optimized family relations were then verified by the UGLI-consortium using available genetic data. The most recent version of the family relations data (sec_family_relations_v3.1) has been verified by using genetic data from [[gwas|GWAS (CytoSNP)]],​ [[ugli|UGLI 1 (GSA), and UGLI 2 (Affymetrix)]]. More detailed information on this process can be found in the quality control document of UGLI2-Affymetrix: {{ :​QC_report_UGLI2_(release_1)-v1.pdf ​|QC_report_UGLI2_(release_1)-v1.pdf}}, page 6 - 9, section 6b and in the {{ :​readme_family_relations_v3.1.pdf |readme of the familyrelations}}.
  
-In the latest version of the family relations ​data, couples who share household, but don't have children, have also been included. This provides ​the opportunity to work on research involving environmental factors.+Note that the family relations ​file does not provide information on whether ​family member lives in the same house. Information ​on [[default_variables|postal code 4]] is available by default and data on [[household_composition|household composition]] can be requested in our [[https://​data-catalogue.lifelines.nl/​|catalogue]].
  
 ===== Data and structure ===== ===== Data and structure =====
  
-The data consists of 5 columns:+The data in sec_family_relations_v3.1 ​consists of 5 columns:
   * Participant identifiers   * Participant identifiers
   * Family id   * Family id
Line 19: Line 19:
   * Partner identifier   * Partner identifier
  
-The identifiers can be used to pair the family data to other phenotypic and genotypic data. The same participant identifier can also be found in the column for father or mother and partner. ​This depends on the participant'​s family size and if they are, for example, both a child and a parent. ​Participants who have indicated to have family members ​who are not participating ​in Lifelines have also received a unique identifier, ​but this identifier ​can not be used to pair the data to other Lifelines data. An example ​of this is when two siblings have indicated to have the same parent. The identifiers of these family members, who are not participating ​in Lifelines, start with an 'S' (i.e. S1031206). The family id is used to identify participants that are part of the same family.+The identifiers can be used to pair the family data to other phenotypic and genotypic data. The same participant identifier can also be found in the column for father or mother and partner. ​In which column the identifier can be found, ​depends on the participant'​s family size and if they are, for example, both a child and a parent. ​With the use of self reported data on family members ​like birthdays of parents and siblings, family members shared between siblings could be derived. These are family members that do not participate ​in Lifelines. Other than the family member connection, we do not have any other information on these persons. These derived family members ​have received a unique identifier ​(an S-numberi.e. S1031206). This identifier ​cannot ​be used to pair the data to other Lifelines data. An example is when two siblings have indicated to have the same parent; the (non-participating) parent has then received ​an S-number. 
 +The family id is used to identify participants that are part of the same family. Couples who share a household, but don't have children, also receive a family id. This provides the opportunity to work on research involving environmental factors. Note that partner information is based on [[1A|baseline data]].
  
-===== Genetic data verification ===== 
  
-The latest version of family relations data has been verified using the collected genetic data from CytoSNP, UGLI-GSA, and UGLI2-Affymetrix. In the image below you can see a flow diagram of how this process has been performed: 
- 
-{{::​pedigree_concordance_analysis.png?​direct&​600|}} 
- 
-More detailed information on this process can be found in the quality control document of UGLI2-Affymetrix:​ {{ :​QC_report_UGLI2_(release_1)-v1.pdf |QC_report_UGLI2_(release_1)-v1.pdf}},​ page 6 - 9, section 6b. 
family_relations.txt · Last modified: 2023/07/11 12:14 by simone