update dockerfile to custom ghedesigner version #266

mpraprost · 2025-01-03T19:09:10Z

Pull request overview

Fixes #ISSUENUMBERHERE (IF RELEVANT)

Pull Request Author: Marley

This pull request makes changes to (select all the apply):

Author pull request checklist:

Review Checklist

This will not be exhaustively relevant to every PR.

Perform a code review on GitHub
All related changes have been implemented: data and method additions, changes, tests
If fixing a defect, verify by running develop branch and reproducing defect, then running PR and reproducing fix
Reviewed change documentation
Ensured code files contain License reference
Results differences are reasonable
Make sure the newly added measures has been added with tests and indexed properly
CI status: all tests pass

ComStock Licensing Language - Add to Beginning of Each Code File

# ComStock™, Copyright (c) 2023 Alliance for Sustainable Energy, LLC. All rights reserved.
# See top level LICENSE.txt file for license terms.

1. thermalzone initialize check 2. options_lookup clean up

…at-measures For test - fix bugs for df lighting and tstat measures

bring measure changes from fan_updates_6 branch

change to stds method

…ss sampling v2.

This commit should not be merged back to main after the SDR is complete.

Updates column definitions to match: 1. sampling-related changes (rentable_area -> building_area, floor_area_category) 2. column name changes in that have occurred in the reporting measure: (furnace -> gas_coil, gas_coil -> primary/secondary) 3. seasonal daily average emissions 4. disable reporting of columns unused by any ComStock systems.

… & savings and emission intensity & savings (#257) * initial implementation for test * add more * updates fix bugs in added functions, update naming conversion and unit conversion, update column_definition * update column definitions and downselect a few entries * initial implementation for test * add more * updates fix bugs in added functions, update naming conversion and unit conversion, update column_definition * update column definitions and downselect a few entries * not include these var by default * clean up unnecessary condition

Use future pandas behavior, which we handled by casting column type after .replace() calls. Suppresses warning messages when creating CBECS objects.

…error catching remediation and style cleanups.

… wenyi/euss_r3

Move the functionality to export the metadata in OEDI format from telescope into comstockpostproc to accommodate the improved geospatial resolution.

Removes unused or duplicative variables. Should not change calculation functionality.

A few geography columns above the tract level exist in the fkt after creation. Drop all but the tract column from fkt, and join on all the geography columns from the spatial lookup based on the tract after joining onto results for the geography.

Enable loading of apportionment data from parquet file which is written to disk after successful creation during an earlier run. Because apportionment can be slow, this can significantly speed up work on postprocessing.

Consolidated four individual unweighted savings column methods into one

Reduce info messages to make it easier to see warnings and errors

Now that multiple data sources have the same column name it is necessary to downselect to the unique set.

fkt creation is non-deterministic. Cache fkt so it can be reused when a data export is interrupted and needs to be restarted. Hive partition the ComStock wide data to enable much faster filtering down to the upgrade, which was a major bottleneck in exporting data for each geography.

Use the 'detailed' keyword to export all columns present in the raw data, primarily used for internal testing and debugging.

@wenyikuang

Actually compress gz file, code from @wenyikuang

Collect all geographic aggregates in one frame then split the collected frame into individual geographies. This speeds up the processing significantly for by_state aggregates.

PUMAs should be added to fkt during creation, so this change is temporary.

For aggregates, the full dataframe may be collected without memory issues. For non-aggregates, need to collect a dataframe per geography to avoid memory issues.

If there is no aggregation, collect the entire first-level geography at once and then sub-divide this to separate files later.

Allows an array of variables to be used when aggregating. Helpful when you need multiple geographic variables like state and climate zone included in the output files.

Fixes national-scale partitioning to work when no aggregation is supplied.

Reduce the bottleneck for metadata export by parallelizing writing metadata files

Updates postprocessing for new sampling / increased geographic resolution

…lures in the utility bill measure

…quirements.

JieXiong9119 and others added 30 commits November 4, 2024 03:02

fix bugs

7f3a04f

1. thermalzone initialize check 2. options_lookup clean up

fix rebound control bug

d1146a9

update options_lookup for tstat

cc2d9c5

Merge pull request #250 from NREL/jx/fix-bugs-for-df-lighting-and-tst…

3e69b40

…at-measures For test - fix bugs for df lighting and tstat measures

bring measure changes from fam_updates_6 branch

45b9ab5

Merge pull request #251 from NREL/ghp_measure_updates

74d3f07

bring measure changes from fan_updates_6 branch

change to stds method

89d8333

Merge pull request #253 from NREL/ghp_measure_updates

1f01ef4

change to stds method

drop the raising exception of downselect columns in comstock for bypa…

1270810

…ss sampling v2.

drop the estimated_size since we are using lazyframe now.

68aaae1

Fixed the column naming following the other column patterns.

d4b80f2

sampling changes to support new bucket development approach.

b721ee7

make sure outstanding changes are accounted for.

07f51b3

Merge branch 'rhorsey/more-euss-changes' into wenyi/euss_r3

f90b68b

Reverts to pre-emissions-fix emissions columns

b490cd3

This commit should not be merged back to main after the SDR is complete.

Filtered out known missing columns.

89181e4

Add weight to self.data no matter how.

a2e2ac6

Adding fix for Primary School Size Bin 0

c708b75

Handle Pandas deprecation warning for silent type downcasting

689c876

Use future pandas behavior, which we handled by casting column type after .replace() calls. Suppresses warning messages when creating CBECS objects.

Removing very small schools (under 2k sqft) where possible plus some …

935b893

…error catching remediation and style cleanups.

Merge branch 'wenyi/euss_r3' of https://github.com/nrel/comstock into…

65e9379

… wenyi/euss_r3

Add metadata_and_annual_results export capability

0dae8fa

Move the functionality to export the metadata in OEDI format from telescope into comstockpostproc to accommodate the improved geospatial resolution.

Cleans up unweighted savings calcs

15eed72

Removes unused or duplicative variables. Should not change calculation functionality.

Handle duplicate geography columns

ee8e32a

A few geography columns above the tract level exist in the fkt after creation. Drop all but the tract column from fkt, and join on all the geography columns from the spatial lookup based on the tract after joining onto results for the geography.

Enable apportionment to be reloaded from cache

ac9ab13

Enable loading of apportionment data from parquet file which is written to disk after successful creation during an earlier run. Because apportionment can be slow, this can significantly speed up work on postprocessing.

Combined unweighted savings calculations

2e4f061

Consolidated four individual unweighted savings column methods into one

Reduce terminal output

01a618c

Reduce info messages to make it easier to see warnings and errors

Downselect to unique columns for export

bad1408

Now that multiple data sources have the same column name it is necessary to downselect to the unique set.

asparke2 and others added 6 commits December 30, 2024 13:35

Export all columns with no downselection

b51f3d1

Use the 'detailed' keyword to export all columns present in the raw data, primarily used for internal testing and debugging.

Removes metadata_index and orders basic columns

4f54f8a

Fixes csv.gz creation

2c3fb15

Actually compress gz file, code from @wenyikuang

Collect all geographies at once

2e9658d

Collect all geographic aggregates in one frame then split the collected frame into individual geographies. This speeds up the processing significantly for by_state aggregates.

update dockerfile to custom ghedesigner version

a26285c

mpraprost requested a review from ChristopherCaradonna January 3, 2025 19:09

mpraprost assigned ChristopherCaradonna Jan 3, 2025

Praprost and others added 22 commits January 3, 2025 13:00

new update to dockerfile

fd01ec7

Fix raw un-aggregated and adds PUMA to fkt

60a8b7e

PUMAs should be added to fkt during creation, so this change is temporary.

Handle aggregate and non-aggregate differently

a4a7c73

For aggregates, the full dataframe may be collected without memory issues. For non-aggregates, need to collect a dataframe per geography to avoid memory issues.

Collect first-level geographies for multi-level no-aggregations

a24497d

If there is no aggregation, collect the entire first-level geography at once and then sub-divide this to separate files later.

Multi-variable aggregations in export_metadata

ccc3a86

Allows an array of variables to be used when aggregating. Helpful when you need multiple geographic variables like state and climate zone included in the output files.

Fixes national-scale partitioning

e91a230

Fixes national-scale partitioning to work when no aggregation is supplied.

Parallelize export_metadata

7c157c0

Reduce the bottleneck for metadata export by parallelizing writing metadata files

Merge pull request #275 from NREL/wenyi/euss_r3

2c3cf2b

Updates postprocessing for new sampling / increased geographic resolution

Create compare_comstock_to_cbecs.py

eaf134e

Minimal change to run the integrated test.

7b52294

reset unitary sys temp after sizing run

29b3c95

Merge branch 'sdr_2024_r2' into ghedesigner_fix

22ff942

revert this change because it was just for testing and is causing fai…

a827376

…lures in the utility bill measure

Merge branch 'sdr_2024_r2' into ghedesigner_fix

f225901

Appened serveral tests in integrated tests.

24a32dc

Set up the Dockerfile for ci.

17d2916

Loose the restriction of the format of csv.

c233870

Bumped utility bill measure and dockerfile to py3.11 to align with re…

1dddcc7

…quirements.

Don't reload the csv in CI system.

288073b

Merge branch 'sdr_2024_r2' into ghedesigner_fix

7d68b3b

Use the right ssl cert for updating python3.11 in build/Dockerfile.

3e84adc

Merge branch 'sdr_2024_r2' into ghedesigner_fix

2aef965

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update dockerfile to custom ghedesigner version #266

update dockerfile to custom ghedesigner version #266

mpraprost commented Jan 3, 2025

update dockerfile to custom ghedesigner version #266

Are you sure you want to change the base?

update dockerfile to custom ghedesigner version #266

Conversation

mpraprost commented Jan 3, 2025

Pull request overview

Pull Request Author: Marley

Review Checklist

ComStock Licensing Language - Add to Beginning of Each Code File