Overview: Formalization Steps
This can be used as a checklist for your course report. The details for each step are explained in the linked slides.
- Set up technical infrastructure.
- Create a fresh Github repository and a folder on Google Drive (or a comparable collaborative document) for the tables.
- Link the latter in the README of the repository.
- Invite the lecturer as a collaborator: Settings → Collaborators → “Add people”
- Choose your starting point:
- Existing theory –> Narrative Theory Reconstruction (NTR)
- Robust phenomenon –> Theory Construction Methodology (TCM)
- Define the scope
- Define scope of original literature that is used as basis for the formalization.
- Limit the number of phenomena that your model is supposed to explain.
- Limit the number of constructs and their relationships.
- Collect robust phenomena
- Evaluate the generalizability separately for each UTOS dimension
- Evaluate the strength of evidence (Quality of studies? Number of studies? Large samples?)
- Create your Construct Source Table
- Create your VAST display (see slides on VAST and VAST application)
- Distill a working definition for each construct
- Add relationships between constructs
- Formalize model: Create your Variable Table (see slides on Mathematical Modeling)
- List all concepts of your VAST display that are not higher-order concepts (because higher-order concepts typically are not reducible to a single metric)
- Define the scale level, the possible range of values, and semantic anchors
- Formalize model: Create your Relationship Table (see slides on Mathematical Modeling)
- Develop a mathematical function for each endogenous variable in the VAST display.
- What functional form does the theory imply? What are free parameters that can “bend” the function? Which parameters can be fixed?
- Program the functions in
R
- Build atomic functions based on the Relationship Table
- Document the functions with roxygen2
- Combine atomic functions into one “super function” that takes all exogenous variables of the model as input and computes the focal output variable (typically, an observable behavior). Make sure that it can take vectors as input.
- Evaluate the model behavior
- Try different parameter settings. Under which conditions does the model produce the phenomenon?
- Plot the model behavior: Choose a focal predictor for the x-axis, focal outcome on y-axis, additional parameters as colors, shapes, or multiple plots.
- Simulate the model: Would a real study (with noise) be able to detect the phenomenon?
- Simulate data
- Do the statistical analysis on your simulated data: Is the focal effect significant?
- What is the necessary sample size to reliably detect the phenomenon (i.e., with a high statistical power)?