No articles match
Bootstrap weights1 days ago
The method: a resampling bootstrap | Two modes | Where the weights are stored (DuckDB path) | Identifying rows | Stratified bootstrap weights | Estimating uncertainty | Incremental re-runs | Reuse — nothing to do | More replicates requested | Rows added to the survey table | Forcing a full regeneration | Multiple weight columns | Filtered input tables | Connection lifecycle | Inspecting and removing weights
canpumf Pipeline Architecture1 days ago
High-level flow | Stage 1 — Locate or download | Version resolution | Stage 2 — Parse metadata | Format detection | Parsers | SPSS monolithic (parse_spss_mono) | SPSS split-file (parse_spss_split) | SAS reading cards (parse_sas_cards) | LFS codebook CSV (parse_lfs_codebook) | CPSS variables CSV (parse_cpss_csv) | SPSS .sav (parse_spss_sav) | PDF Data Dictionary (parse_pdf_dictionary) | PDF frequency codebook (parse_pdf_codebook) | Metadata encoding | Merge | Stage 3 — Build DuckDB | Data file selection | FWF vs. CSV | Trailing junk row removal (FWF only) | Data fixups (pre-label) | Bootstrap weight join (BSW) | Numeric conversion | Code labels → factors | DuckDB write and ENUM enforcement | Multi-module surveys | LFS pipeline | Connection provenance registry | Registry configuration | Newest-sibling inheritance
Census1 days ago
LFS1 days ago
Timelines
Onboarding a new PUMF1 days ago
Naming conventions and where to put the files | Smart defaults and the newest-sibling fallback | When the automatic import fails | See what is actually in the directory | Parse the metadata in isolation | Start from an existing entry as a template | Tweak fixups for data-level issues | Build the full table | Promote the configuration into the registry | Summary
Working with canpumf1 days ago
Forced moves
Working with multi-module PUMF surveys1 days ago
Loading the primary module | Opening a sibling module | Joining modules for analysis | A second example: the Survey of Household Spending | Cleaning up | Database connections | Notes
Working with multi-module PUMF surveys7 days ago
Loading the primary module | Opening a sibling module | Joining modules for analysis | A second example: the Survey of Household Spending | Cleaning up | Database connections | Notes
Bootstrap weights12 days ago
The method: a resampling bootstrap | Two modes | Where the weights are stored (DuckDB path) | Identifying rows | Stratified bootstrap weights | Estimating uncertainty | Incremental re-runs | Reuse — nothing to do | More replicates requested | Rows added to the survey table | Forcing a full regeneration | Multiple weight columns | Filtered input tables | Connection lifecycle | Inspecting and removing weights
canpumf Pipeline Architecture12 days ago
High-level flow | Stage 1 — Locate or download | Version resolution | Stage 2 — Parse metadata | Format detection | Parsers | SPSS monolithic (parse_spss_mono) | SPSS split-file (parse_spss_split) | SAS reading cards (parse_sas_cards) | LFS codebook CSV (parse_lfs_codebook) | CPSS variables CSV (parse_cpss_csv) | SPSS .sav (parse_spss_sav) | PDF Data Dictionary (parse_pdf_dictionary) | PDF frequency codebook (parse_pdf_codebook) | Metadata encoding | Merge | Stage 3 — Build DuckDB | Data file selection | FWF vs. CSV | Trailing junk row removal (FWF only) | Data fixups (pre-label) | Bootstrap weight join (BSW) | Numeric conversion | Code labels → factors | DuckDB write and ENUM enforcement | Multi-module surveys | LFS pipeline | Connection provenance registry | Registry configuration | Newest-sibling inheritance
Census12 days ago
Working with canpumf12 days ago
Forced moves
LFS18 days ago
Timelines
Onboarding a new PUMF18 days ago
Naming conventions and where to put the files | Smart defaults and the newest-sibling fallback | When the automatic import fails | See what is actually in the directory | Parse the metadata in isolation | Start from an existing entry as a template | Tweak fixups for data-level issues | Build the full table | Promote the configuration into the registry | Summary
Demo4 months ago
Get a list of property tax related datasets | Get metadata for tax report | Get an overview of land and building values in RS zones | Get data for property tax report and property polygons | Compute and plot relative land values
Isolines4 months ago
Additional datasets: Annual T1FF taxfiler data8 months ago
Background | Example usage: constructing a multi-year series of families in low-income status
Additional datasets: Structural type of dwelling by document type8 months ago
Background | Example usage: buildings unoccupied vs not occupied by usual residents
cancensus8 months ago
Cancensus and CensusMapper | API Key | Installing cancensus | Accessing Census Data | Census Datasets | Census Regions | Census Geographic Levels | Working with Census Variables | Displaying available Census variables | Variable characteristics | Variable search | Managing variable hierarchy
Data discovery8 months ago
Census datasets | Variable vectors | View available Census variable vectors | Searching for Census variable vectors | Exact search | Keyword search | Semantic search | Census regions | Standard Geographical Classification | A note on Census Metropolitan Areas and Census Agglomerations | Aside: dissemination areas, blocks, and enumeration areas | Viewing available Census regions | Searching through named Census regions | Exploring Census variable vectors and regions interactively
Finding intersecting geometries from custom data8 months ago
A simple example | Addendum
Making maps with cancensus8 months ago
Spatial data in cancensus | Maps with base R graphics | Maps with ggplot2 | Interactive maps with leaflet
StatCan WDS8 months ago
Word of caution | Ukrainians by Federal Electoral Districts
Getting started with the cansim package11 months ago
About | Installing cansim | Usage | License | Attribution
Working with large tables11 months ago
Working with cached tables | parquet | feather | sqlite | Filtering and loading into memory | traditional | Working with the data | Partitioning | Repartitioning | Keeping track of cached data | Removing cached data
Partial table data download1 years ago
Using vectors instead of coordinates
Rental Universe3 years ago
StatCan attribute files3 years ago
Background | Match between Census Tracts and Census Subdivisions
Basic usage4 years ago
General TongFen5 years ago
Population change
TongFen for Canadian census data5 years ago
Aggregating up data across regions | Change in Vancouver children
Estimating Canadian data on custom geographies5 years ago
Polling Districts5 years ago
TongFen for US census data6 years ago