This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
family_relations [2023/07/11 11:30] simone |
family_relations [2025/09/02 12:45] (current) simone |
||
---|---|---|---|
Line 5: | Line 5: | ||
The basis of the family relations data within Lifelines consists of information from the municipal registries (Basisregistratie Personen (BRP)). Based on this information, researchers from the [[ugli|UGLI-consortium]] have optimized the family relations using recoded surname data (for privacy purposes) and data from questionnaires ([[family_composition|Family composition]]). The process of this optimization has been documented and can be found here: {{ :lifelines_familyreconstruction.pdf | Family relations reconstruction}}. | The basis of the family relations data within Lifelines consists of information from the municipal registries (Basisregistratie Personen (BRP)). Based on this information, researchers from the [[ugli|UGLI-consortium]] have optimized the family relations using recoded surname data (for privacy purposes) and data from questionnaires ([[family_composition|Family composition]]). The process of this optimization has been documented and can be found here: {{ :lifelines_familyreconstruction.pdf | Family relations reconstruction}}. | ||
- | These optimized family relations were then verified by the UGLI-consortium using available genetic data. The most recent version of the family relations data (sec_family_relations_v3.1) has been verified by using genetic data from [[gwas|GWAS (CytoSNP)]], [[ugli|UGLI 1 (GSA), and UGLI 2 (Affymetrix)]]. More detailed information on this process can be found in the quality control document of UGLI2-Affymetrix: {{ :QC_report_UGLI2_(release_1)-v1.pdf |QC_report_UGLI2_(release_1)-v1.pdf}}, page 6 - 9, section 6b and in the {{ :readme_family_relations_v3.1.pdf |readme of the familyrelations}}. | + | These optimized family relations were then verified by the UGLI-consortium using available genetic data. The most recent version of the family relations data (sec_family_relations_v3.1; July 2023) has been verified by using genetic data from [[gwas|GWAS (CytoSNP)]], [[ugli|UGLI 1 (GSA), and UGLI 2 (Affymetrix)]]. More detailed information on this process can be found in the quality control document of UGLI2-Affymetrix: {{ :QC_report_UGLI2_(release_1)-v1.pdf |QC_report_UGLI2_(release_1)-v1.pdf}}, page 6 - 9, section 6b and in the {{ :readme_family_relations_v3.1.pdf |readme of the familyrelations}}. |
- | In the latest version of the family relations data, couples who share a household, but don't have children, have also been included. This provides the opportunity to work on research involving environmental factors. | + | Note that the family relations file does not provide information on whether a family member lives in the same house. Information on [[default_variables|postal code 4]] is available in your data by default and data on [[household_composition|household composition]] can be requested through our [[https://data-catalogue.lifelines.nl/|catalogue]]. |
- | ===== Data and structure ===== | + | The family relations file is not available through our catalogue, but can be requested by sending an e-mail to data@lifelines.nl (this does not apply to UGLI consortium cluster users, who can already find the file in pheno_lifelines). If you do not already have access to the family relation file, please also complete our [[https://www.lifelines.nl/researcher/how-to-apply|amendment form]]. |
- | The data consists of 5 columns: | + | ===== Data and Structure ===== |
+ | |||
+ | The data in sec_family_relations_v3.1 consists of 5 columns: | ||
* Participant identifiers | * Participant identifiers | ||
* Family id | * Family id | ||
Line 19: | Line 21: | ||
* Partner identifier | * Partner identifier | ||
- | The identifiers can be used to pair the family data to other phenotypic and genotypic data. The same participant identifier can also be found in the column for father or mother and partner. This depends on the participant's family size and if they are, for example, both a child and a parent. Participants who have indicated to have family members who are not participating in Lifelines have also received a unique identifier, but this identifier can not be used to pair the data to other Lifelines data. An example of this is when two siblings have indicated to have the same parent. The identifiers of these family members, who are not participating in Lifelines, start with an 'S' (i.e. S1031206). The family id is used to identify participants that are part of the same family. | + | The identifiers can be used to pair the family data to other phenotypic and genotypic data. The same participant identifier can also be found in the column for father or mother and partner. In which column the identifier can be found, depends on the participant's family size and if they are, for example, both a child and a parent. With the use of self reported data on family members like birthdays of parents and siblings, family members shared between siblings could be derived. These are family members that do not participate in Lifelines. Other than the family member connection, we do not have any other information on these persons. These derived family members have received a unique identifier (an S-number, i.e. S1031206). This identifier cannot be used to pair the data to other Lifelines data. An example is when two siblings have indicated to have the same parent; the (non-participating) parent has then received an S-number. |
+ | The family id is used to identify participants that are part of the same family. Couples who share a household, but don't have children, also receive a family id. This provides the opportunity to work on research involving environmental factors. Note that partner information is based on [[1A|baseline data]]. | ||
- | ===== Genetic data verification ===== | + | ===== Family Duos and Trios ===== |
+ | From the family relations file the following numbers have been extracted: | ||
- | The latest version of family relations data has been verified using the collected genetic data from CytoSNP, UGLI-GSA, and UGLI2-Affymetrix. In the image below you can see a flow diagram of how this process has been performed: | + | {{:family_relations.png?600|}} |
- | {{::pedigree_concordance_analysis.png?direct&600|}} | + | ===== Twins and Triplets ===== |
+ | Based on birth and family records, the following numbers were identified within Lifelines: | ||
+ | * Twins: 632 pairs (1,264 unique individuals) | ||
+ | * Monozygotic (MZ): 152 pairs | ||
+ | * Dizygotic (DZ): 350 pairs | ||
+ | * Zygosity undetermined: 130 pairs | ||
+ | * Triplets: 6 sets (18 unique individuals). As far as could be ascertained, none of these sets included monozygotic pairs or trios. | ||
+ | The classification of twin zygosity was determined using a combination of DNA data, sex, and questionnaire responses (see Table 1). | ||
+ | ^ Method ^ DZ ^ MZ ^ | ||
+ | | DNA | 134 | 124 | | ||
+ | | Mixed gender | 187 | – | | ||
+ | | Questionnaire completed by both twins | 10 | 15 | | ||
+ | | Questionnaire completed by one twin | 19 | 13 | | ||
+ | | Total | 350 | 152 | |