OverviewData › Gold variables

rendered from notes/Roofing/data/gold_variable_inventory.md

Gold Variable Inventory

← Data hubStep 1 · LabelingFeature taxonomyPermit classification
Generated by scripts/roofing/inventory_v3.py against the Gold parquet on disk (data/sandbox/roofing_audit/gold_<FIPS>/). Role rules: see notes/Roofing/steps/01_variable_inventory.md. Spec ratified 2026-05-20 rev2.
Originator = field provenance per the BuildZoom data dictionary (buildzoom_permit_dictionary.md). permit = primary AHJ/owner source text · BuildZoom = BuildZoom-derived classification (a secondary machine guess) · 8020rei-ETL = added by the pipeline. PROJECT_TYPE is BuildZoom-derived, NOT an AHJ field — a signal-tier scheme must not rank it above DESCRIPTION.

Role summary

RoleCount
LABEL_SIGNAL7
FEATURE23
METADATA22
DROP2

Master role table

columndtyperoleoriginatornull_p50null_maxdist_p50n_fips
BUSINESS_NAMEVARCHARLABEL_SIGNALpermit25.58%99.96%7832431/31
DESCRIPTIONVARCHARLABEL_SIGNALpermit12.77%61.23%87248831/31
JOB_VALUEDOUBLELABEL_SIGNALpermit49.22%100.00%5810931/31
PROJECT_NAMEVARCHARLABEL_SIGNALpermit89.67%100.00%10660831/31
PROJECT_TYPEVARCHAR[]LABEL_SIGNALBuildZoom11.43%39.09%3022831/31
SUBTYPEVARCHARLABEL_SIGNALpermit77.09%99.95%59031/31
TYPEVARCHARLABEL_SIGNALpermit1.40%49.85%97931/31
APPLIED_DATEDATEFEATUREpermit31.85%79.43%1081931/31
AUX_EFFECTIVE_STATUS_DATEDATEFEATUREBuildZoom1.67%19.34%1252431/31
AUX_PERMIT_STATUSVARCHARFEATUREBuildZoom1.24%17.17%1231/31
CANCELLED_DATEDATEFEATUREpermit90.19%100.00%882731/31
CBSA_FIPSVARCHARFEATUREBuildZoom1.39%7.25%2831/31
CBSA_NAMEVARCHARFEATUREBuildZoom1.39%7.25%2831/31
CITYVARCHARFEATUREBuildZoom0.00%0.03%21231/31
COMPLETED_DATEDATEFEATUREpermit40.78%99.97%1002831/31
CONTRACTOR_IDVARCHARFEATUREBuildZoom39.80%100.00%2236331/31
FEESDOUBLEFEATUREpermit33.64%99.98%1431031/31
HOMEOWNERVARCHARFEATUREpermit39.68%100.00%25342631/31
INITIAL_STATUSVARCHARFEATUREpermit0.84%17.16%1231/31
INITIAL_STATUS_DATEDATEFEATUREpermit0.18%8.24%1171731/31
ISSUED_DATEDATEFEATUREpermit19.96%60.44%1070431/31
LATEST_STATUSVARCHARFEATUREpermit0.84%17.16%1331/31
LATEST_STATUS_DATEDATEFEATUREpermit0.18%8.24%1247931/31
LATITUDEDOUBLEFEATUREBuildZoom0.00%0.00%16488631/31
LONGITUDEDOUBLEFEATUREBuildZoom0.00%0.00%18175931/31
PLACE_NAMEVARCHARFEATUREBuildZoom1.59%47.13%10831/31
SQUARE_FEETINTEGERFEATUREpermit92.93%100.00%325431/31
STREETVARCHARFEATUREBuildZoom0.00%0.00%27271931/31
SUBDIVISIONVARCHARFEATUREpermit97.40%100.00%228931/31
ZIP_CODEINTEGERFEATUREBuildZoom1.45%98.11%14231/31
BUILDING_PERMIT_IDVARCHARMETADATABuildZoom0.00%0.00%192418431/31
COUNTY_FIPSINTEGERMETADATABuildZoom0.00%0.00%131/31
COUNTY_NAMEVARCHARMETADATABuildZoom0.00%0.00%131/31
DateSnapshotDATEMETADATA8020rei-ETL0.00%0.00%931/31
FA_PROPERTYIDINTEGER[]METADATA8020rei-ETL0.00%0.00%21993231/31
FIPSINTEGERMETADATA8020rei-ETL0.00%0.00%131/31
ID_PREFIXVARCHARMETADATA8020rei-ETL0.00%0.00%4431/31
PARCEL_NUMBERVARCHARMETADATApermit83.97%100.00%8043731/31
PARTITION_YEARINTEGERMETADATA8020rei-ETL0.00%0.00%3931/31
PERMIT_JURISDICTIONVARCHARMETADATABuildZoom0.00%0.00%3731/31
PERMIT_NUMBERVARCHARMETADATApermit0.00%0.02%157863731/31
PLACE_FIPSVARCHARMETADATABuildZoom1.59%47.13%11831/31
Partition_FIPS_SaltingVARCHARMETADATA8020rei-ETL0.00%0.00%531/31
PropertyIDVARCHARMETADATABuildZoom0.00%0.00%27579231/31
STATEVARCHARMETADATABuildZoom0.00%0.00%1131/31
STREET4JOIN1VARCHARMETADATA8020rei-ETL0.00%0.00%27253131/31
_hoodie_commit_seqnoVARCHARMETADATA8020rei-ETL0.00%0.00%192418431/31
_hoodie_commit_timeVARCHARMETADATA8020rei-ETL0.00%0.00%131/31
_hoodie_file_nameVARCHARMETADATA8020rei-ETL0.00%0.00%6631/31
_hoodie_partition_pathVARCHARMETADATA8020rei-ETL0.00%0.00%131/31
_hoodie_record_keyVARCHARMETADATA8020rei-ETL0.00%0.00%192418431/31
tsTIMESTAMP WITH TIME ZONEMETADATA8020rei-ETL0.00%0.00%131/31
STORIESINTEGERDROPpermit96.30%100.00%1431/31
UNITSINTEGERDROPpermit98.98%100.00%7331/31