User Tools

Site Tools


data_structure

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
data_structure [2021/02/19 12:14]
trynke created
data_structure [2021/02/19 12:22] (current)
trynke
Line 1: Line 1:
 ====== Data structure ====== ====== Data structure ======
 +All unique datapoints within the Lifelines dataset have three pieces of basic metadata attached: the WHO, WHAT, and WHEN metadata.
  
 +===== WHO =====
 +Each datapoint is collected for a specific [[cohort|participant]]. Each participant has certain characteristics,​ which are provided as [[default variables]] in your dataset.
  
 +===== WHAT =====
 +Each datapoint has a specific meaning. The meaning is provided as variable information,​ i.e. the variable code, the label, the datatype, the answer options (if any) en the [[sections|section]] and subsection under which the variable falls.
  
 ===== WHEN ===== ===== WHEN =====
-All datapoints are collected at a specific time and under a specific protocol. ​All the "​when" ​information is structured as follows:+Each datapoint is collected at a specific time and under a specific protocol. ​This information is structured as follows:
  
-  * An **assessment** is the complete collection of data for a given project (see: [[general assessments]] and [[additional assessments]]).  +  * An **assessment** is the complete collection of data for a given project (see: [[general assessments]] and [[additional assessments]]). Each assessment has at least 1 element (and often more). 
-  * An **element** is a part of an assessment for which participants are separately invited and that can be pinpointed to one specific date. Each assessment has at least one element (and often more). For example, general assessment [[1A]] (baseline) for adults has 4 elements: visit 1, visit 2, questionnaire 1 and questionnaire 2. +  * An **element** is a part of an assessment for which participants are separately invited and that can be pinpointed to one specific date. For example, general assessment [[1A]] (baseline) for adults has 4 elements: visit 1, visit 2, questionnaire 1 and questionnaire 2. Additional assessment [[COVQ]] has many elements: one for each consecutive sub-questionnaire. Each element has at least 1 variant (and often more)
-Additional assessment [[COVQ]] has many elements: one for each consecutive sub-questionnaire. +  * A **variant** is part of an element that contains a set of variables that is collected under a unique, coherent protocol. Significant changes to that protocol (i.e. the addition/​removal/​modification of variables, ​a new method used for the measurement,​ new selection criteria) results in a new variant. ​ 
-  * A **variant** is part of an element that contains a set of variables that is collected under a unique, coherent protocol. Significant changes to that protocol (i.e. the addition/​removal/​modification of variables, ​the method used for the measurement,​ new selection criteria) results in a new variant. ​Each element has at least one variant (and often more). ​Variant names are informative codes that will tell you about the context and uniqueness of the variant, as follows: assessment_element_number_description_agegroup_version. For example: ​1a_q_1_paper_18-65_index.+ 
 +Variant names are informative codes that will tell you about the context and uniqueness of the variant, as follows: assessment_element_number_description_agegroup_version ​(1a_q_1_paper_18-65_index).
data_structure.1613733291.txt.gz · Last modified: 2021/02/19 12:14 by trynke