Synthetic Population Generation and Validation Proposal

Problem Statement

The Problem: We currently lack a real population dataset to test the synthetic population (synth-pop) method effectively.

Axioms

  1. A synthetic population should be as close to a twin of the target population as possible.
  2. The synthetic population generation method should be applicable to all populations.
  3. We can create a precise artificial population using deterministic methods.

Hypothesis

The synthetic population generation method should be able to create a population that is a twin of an artificial population

Methodology

Deterministic Rules for Artificial Population Generation:

  • Create sample LOAs with a mixed makeup.
  • Take a subsection of the artificial population for survey purposes.
  • Conduct a census on the full artificial population.

Comparison Methods:

  • Compare the results between:
    • IPF + Monte Carlo
    • Just Synthetic Annealing
    • GAN + Simulated Annealing (offer this to Daniel)

Proposed Steps

Step 1: Create a National Artifical Population

Expected Outcomes

Conclusion

This proposal outlines a comprehensive approach to generating and validating a synthetic population using deterministic methods and various comparison techniques. The results will provide valuable insights into the accuracy and applicability of the synthetic population generation method.