Hierarchical Causal Models #236

adamrupe · 2024-09-06T16:17:50Z

Closes #278

This PR implements Hierarchical Causal Models (Weinstein and Blei, 2024)

This PR will be ready for review when the following algorithms have been tested and implemented.

Algorithm 1: Graphical algorithm for collapsing a hierarchical causal graphical model (HCGM). This algorithm transforms the graph of a hierarchical causal model (HCM) into the graph of its collapsed model, following Definition 4.
Algorithm 2: Graphical algorithm for augmenting a collapsed model. This algorithm adds an
augmentation variable to a collapsed HCGM, following Definition 6.
Algorithm 3: Graphical algorithm for marginalizing an augmented model. This algorithm
marginalizes out parent(s) of an augmentation variable (Section 5.2).
Causal query pipeline: Utilizes Algorithms 1 -3 (as needed) to check if a causal query is identifiable in the HCM. The use of Algorithms 2 and 3 depends on the causal query, i.e. whether a variable needs to be augmented in (Alg 2) and then whether another variable needs to be marginalized out (Alg 3).
HSCM tests
High-level example (with real-world motivation) that shows how to do a causal query on a HCM

codecov · 2024-09-06T19:42:27Z

Codecov Report

Attention: Patch coverage is 88.12500% with 19 lines in your changes missing coverage. Please review.

Project coverage is 81.27%. Comparing base (05a9456) to head (3af8c66).
Report is 4 commits behind head on main.

Files with missing lines	Patch %	Lines
src/y0/hierarchical.py	88.12%	9 Missing and 10 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #236      +/-   ##
==========================================
+ Coverage   80.87%   81.27%   +0.39%     
==========================================
  Files          50       51       +1     
  Lines        4135     4314     +179     
  Branches      845      981     +136     
==========================================
+ Hits         3344     3506     +162     
- Misses        668      670       +2     
- Partials      123      138      +15

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

cthoyt · 2024-09-10T10:22:47Z

hi @adamrupe - can you add a checklist into the PR description with the tasks to complete for this PR before it needs review?

Copilot

Copilot reviewed 18 out of 19 changed files in this pull request and generated 1 comment.

Files not reviewed (1)

tox.ini: Language not supported

src/y0/hierarchical.py

pyproject.toml

tests/test_hierarchical.py

cthoyt

I did a major refactor to address the software issues from the last round. The next steps for @adamrupe and @djinnome are:

Read through the code and familiarize yourselves with the new interface
Comment on / address all TODO's I left in the code (there aren't many)
Tests
- Either implement tests for conversion to HSCM or delete the conversion code
- Test augment_collapsed_model
Check the notebook, which used to raise some exceptions, but I replaced those with the high-level identify_outcomes API. Please review to make sure that the places where there is no estimand produced because the graph has a single c-component are all correct
Create a high-level, real world example that demonstrates using all of the code in a story-driven workflow (i.e., do not explain the math, only explain which functions you implemented solve the problem). Use https://github.com/y0-causal-inference/y0/blob/main/notebooks/Counterfactual%20Transportability.ipynb as a golden standard for how a great notebook with applications looks

Along the way, please make sure that you check the CI/CD system for automated, objective feedback on code quality. @adamrupe if you're not familiar with how to do this, I am happy to show you

adamrupe · 2025-02-05T22:46:56Z

@cthoyt What's your recommendation for handling merge conflicts with jupyter notebooks? I need to do this before I can pull your updates. I'm also not familiar with the CI/CD system, so if you could talk me through it that would be great.

cthoyt · 2025-02-05T22:58:06Z

@adamrupe before merging, copy your local notebook to your desktop. While merging, throw away everything from your repository's copy and overwrite it with remote. Then, you can think about manually inspecting your notebook on your desktop, and the new version from the remote repo side-by-side.

The best way to avoid this kind of thing is never to leave changes unpushed when you finish working, and to always pull before you start working again

The short explanation of how to use the CI/CD system is: you can always scroll to the bottom of this pull request (#236) and look at the feedback given by GitHub running our unit tests, linting, and code quality checks.

This is what it looks like to me right now:

You can click on any of the rows with the red x's, and then it will bring you to the page that ran the tests for you. Right now, you will be able to see all of the output from running pytest. You have to scroll up a bit since unfortunately, pytest reports timings and warnings after test failures, but you can see https://github.com/y0-causal-inference/y0/actions/runs/13134504627/job/36646756591?pr=236#step:6:69 for the currently failing test.

Similarly, while you're still getting used to having code quality checks, you will probably see that the linting or type checking scripts also give errors, which you can view in the same way..

It's sort of the expectation in a team setting for coding that you make pushes often, and each time check out what kind of feedback CI gives. This will help you iteratively make your code better, with fully objective feedback that you don't have to wait on someone else to give you. Alternatively to CI/CD in GitHub, you can run tox which also creates a reproducible execution of all of the testing suite.

There's documentation in the README on how to use all of the nice development tools built into this repo at https://github.com/y0-causal-inference/y0?tab=readme-ov-file#%EF%B8%8F-for-developers

If you get caught up on any parts of this that aren't self-explanatory, I'm happy to plan a video chat tomorrow, or sometime next before 6PM germany time

adamrupe · 2025-02-06T20:13:43Z

Awesome, thanks @cthoyt! That makes sense, and Jeremy and Richard have already shown me how to use tox a bit. I've pulled your changes and I'm going through them now. I'll add a test_to_hscm.

adamrupe · 2025-02-28T20:51:30Z

@cthoyt I've refactored HSCMs and filled in the HCM.to_hscm() tests, so all tests are now passing. However, there is a depreciation warning I'm getting from another part of the codebase:

src/y0/examples/__init__.py:1173: FutureWarning: 
Downcasting behavior in `replace` is deprecated and will be removed in a future version. 
To retain the old behavior, explicitly call `result.infer_objects(copy=False)`. 
To opt-in to the future behavior, set `pd.set_option('future.no_silent_downcasting', True)`
    asia_df = pd.read_csv(ASIA_PATH).replace({"yes": 1, "no": -1})

There is also a mypy error from graph.py

mypy: commands[0]> mypy --ignore-missing-imports --strict src/ tests/test_hierarchical.py
src/y0/graph.py:480: error: Function is missing a type annotation for one or more arguments  [no-untyped-def]

Since I didn't write this code, I didn't want to make edits to fix these, but they seem straightforward fixes.

I'll add a test for augment_collapsed_model, and Jeremy and I have discussed a high-level notebook that I'll create as well. This should then complete your requested changes. I'll ping again when they are all complete.

cthoyt · 2025-03-01T01:28:31Z

@adamrupe you should be unblocked on the CI/CD pipeline now. looking forward to seeing a nice case study notebook, then we can finish this PR!

adamrupe linked an issue Sep 6, 2024 that may be closed by this pull request

Implement hierarchical causal models from figure 2 in pygraphviz #232

Closed

cthoyt added the hierarchical causal models label Oct 29, 2024

djinnome assigned adamrupe Jan 24, 2025

djinnome requested review from cthoyt and Copilot January 24, 2025 00:00

Copilot AI reviewed Jan 24, 2025

View reviewed changes

src/y0/hierarchical.py Outdated Show resolved Hide resolved

djinnome marked this pull request as ready for review January 24, 2025 00:01

cthoyt reviewed Jan 24, 2025

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

cthoyt reviewed Jan 27, 2025

View reviewed changes

tests/test_hierarchical.py Outdated Show resolved Hide resolved

cthoyt force-pushed the HCM-fig2 branch 2 times, most recently from 3237de1 to ef54579 Compare February 3, 2025 08:42

cthoyt requested changes Feb 3, 2025

View reviewed changes

cthoyt force-pushed the HCM-fig2 branch from a53f77f to 18d2ae4 Compare February 3, 2025 10:07

cthoyt mentioned this pull request Feb 5, 2025

Add tutorial on working in a team cthoyt/cookiecutter-snekpack#41

Open

adamrupe added 10 commits February 27, 2025 18:43

functions for creating and querying HCMs with pygraphviz

3c27650

functionality to collapse phygraphviz HCMs to nxmixedgraphs

3975ba4

added __all__

134202c

added test_hierarchical

e643395

pygraphviz dep and lint exception for hierarchical in pyproject

4079a0d

auto lint hierarchical

e8fb669

auto lint test_hierarchical

c76a6f9

added docstrings to hierarchical.py

9134417

added docstrings to test_hierarchical.py

833ed32

more ignores in pyproject.toml

6828af3

cthoyt and others added 21 commits February 27, 2025 18:43

Remove warnings

1d96ea6

Clean up variable names

73659a7

Reuse some example graphs

3483304

Update Hierarchical Causal Models.ipynb

6827b85

Keep track of stochastic variables

a6da4e4

minor commit before pull

6990df3

merge nb

cf08dfa

finish nb merge

036c531

change HCM attribute name stochastic to deterministic

d72ae02

fix to_hscm and update to_pygraphviz for exogenous variables

c0c1b30

add collapse_hcm_function

91bcf5f

refactor with HSCM subclass

26de0c0

add HSCM examples

369c960

add tests for HCM.to_hscm and HSCM.to_hcm

37b433c

lint

9d9214e

minor lint and bug fixes

8e5f2ce

minor bug fix for collapse_hcm

5be8134

trying fix for mypy bug

7c400c7

lint

bbd7e8f

try again

2a16fb4

minor clean

6319378

adamrupe force-pushed the HCM-fig2 branch from e4a9139 to 6319378 Compare February 28, 2025 00:43

cthoyt added 2 commits March 1, 2025 02:18

Merge branch 'main' into HCM-fig2

cd95f2b

Update tox.ini

3d2a39a

adamrupe added 4 commits March 5, 2025 14:25

raise exception if collapsing HCM with unobserved subunits

7006267

allow strings for marginal_parents in marginalize_augmented_model

ee87ec5

minor clean to HCM nb

4ccd6ff

add tests for augment_collapsed_model and lint

9d9563f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hierarchical Causal Models #236

Hierarchical Causal Models #236

adamrupe commented Sep 6, 2024 •

edited by cthoyt

Loading

codecov bot commented Sep 6, 2024 •

edited

Loading

cthoyt commented Sep 10, 2024

Copilot AI left a comment

This comment was marked as outdated.

cthoyt left a comment •

edited

Loading

adamrupe commented Feb 5, 2025

cthoyt commented Feb 5, 2025

adamrupe commented Feb 6, 2025

adamrupe commented Feb 28, 2025

cthoyt commented Mar 1, 2025

Hierarchical Causal Models #236

Are you sure you want to change the base?

Hierarchical Causal Models #236

Conversation

adamrupe commented Sep 6, 2024 • edited by cthoyt Loading

codecov bot commented Sep 6, 2024 • edited Loading

Codecov Report

cthoyt commented Sep 10, 2024

Copilot AI left a comment

Choose a reason for hiding this comment

This comment was marked as outdated.

cthoyt left a comment • edited Loading

Choose a reason for hiding this comment

adamrupe commented Feb 5, 2025

cthoyt commented Feb 5, 2025

adamrupe commented Feb 6, 2025

adamrupe commented Feb 28, 2025

cthoyt commented Mar 1, 2025

adamrupe commented Sep 6, 2024 •

edited by cthoyt

Loading

codecov bot commented Sep 6, 2024 •

edited

Loading

cthoyt left a comment •

edited

Loading