Add weights option to event_study #920

shapiromh · 2025-05-28T15:39:06Z

This PR opens the (analytic) weight option to the did/event_study function and all the subclasses of DiD it currently calls (twfe, did2s, and saturated_twfe). The related issue is #919

Changes in this PR:

The DiD2S class already permitted weights so the new event_study function simply passes weights to it. No new test was added for this change.
The TWFE class was opened to accept weights. They are passed through to the underlying FEOLS call. One test was added on top of existing TWFE event study tests.
The Saturated Event Study was similarly opened to accept weights. It required a substantive change to use these weights in its aggregation method. New tests were added comparing it to fixest sunab with weights.
The ABC DID now expects weights as an argument. LPDID is the only subclass of DID that still does not allow weights.

…st stops complaining

…stalled

for more information, see https://pre-commit.ci

shapiromh · 2025-05-28T15:40:28Z

I also added a fix for the saturated event study not being able to call summarize because of an out-of-date check in the summarize code.

codecov · 2025-05-28T15:46:11Z

Codecov Report

Attention: Patch coverage is 90.00000% with 2 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
pyfixest/report/summarize.py	0.00%	2 Missing ⚠️

Flag	Coverage Δ
core-tests	`78.36% <35.00%> (-0.11%)`	⬇️
tests-extended	`?`
tests-vs-r	`48.84% <85.00%> (+33.42%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
pyfixest/did/did.py	`85.00% <100.00%> (+0.78%)`	⬆️
pyfixest/did/did2s.py	`90.47% <ø> (+0.85%)`	⬆️
pyfixest/did/estimation.py	`98.70% <100.00%> (+18.43%)`	⬆️
pyfixest/did/saturated_twfe.py	`75.00% <100.00%> (+58.44%)`	⬆️
pyfixest/did/twfe.py	`80.76% <100.00%> (ø)`
pyfixest/estimation/estimation.py	`94.35% <ø> (ø)`
pyfixest/report/summarize.py	`87.40% <0.00%> (-0.33%)`	⬇️

... and 11 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

s3alfisc · 2025-05-28T20:12:03Z

Awesome, thanks so much! Will review this first thing tomorrow morning!

s3alfisc · 2025-05-28T20:13:08Z

pre-commit.ci autofix

for more information, see https://pre-commit.ci

pyfixest/did/estimation.py

pyfixest/did/saturated_twfe.py

s3alfisc · 2025-05-29T11:16:42Z

pyfixest/did/saturated_twfe.py

+    if weights is not None and use_weights:
+        post = (
+            df[df[treatment] == 1]
+            .groupby([cohort, period])[weights]


Wouldn't this be frequency weights? I.e. we sum up weights over all treated by cohort and period? 🤔

Hm, this is also the implementation in fixest. I didn't test but would guess the std error if we called this frequency weights wouldn't match.

Based on the fixest docs - if use_weights = True, then the aggregation using weights and the weights are treated as frequency weights:

#' @param use_weights Logical, default is TRUE. If the estimation was weighted,
#' whether the aggregation should take into account the weights. Basically if the
#' weights reflected frequency it should be TRUE.

So my understanding would be - use_weights = FALSE -> analytical weights, use_weights = TRUE -> frequency weights?

pyfixest/did/saturated_twfe.py

Co-authored-by: Alexander Fischer <alexander-fischer1801@t-online.de>

shapiromh

Thanks for the comments, let me make sure the weighted test covers the saturated twfe function and update the PR.

The "use_weights" issue is down to your taste, I think. I added to match fixest, but I also don't see the purpose in having this as a separate argument if the user already supplied weights.

I think the questions on freq versus analytic comes back to the discussion we were having about guessing which it was fixest used. At least feeding the argument as analytic to feols matched the std. errors from fixest.

pyfixest/did/estimation.py

pyfixest/did/saturated_twfe.py

shapiromh · 2025-05-29T11:29:25Z

pyfixest/did/saturated_twfe.py

+    if weights is not None and use_weights:
+        post = (
+            df[df[treatment] == 1]
+            .groupby([cohort, period])[weights]


Hm, this is also the implementation in fixest. I didn't test but would guess the std error if we called this frequency weights wouldn't match.

s3alfisc · 2025-05-30T22:04:30Z

pre-commit.ci autofix

for more information, see https://pre-commit.ci

s3alfisc · 2025-05-31T14:31:36Z

Now I think every thing looks good - the only thing I'd like to clarify before merging:

Does use_weights = True trigger fweights or aweights? If fweights, we need to update the docs to make this behavior explicit
If fweights, should we maybe a) deviate from fixest syntax and call the argument "weights_type" instead of "use_weights", and b) default to analytical weights? As aweights are default in feols(), this is what I think users would expect as behavior?

shapiromh · 2025-05-31T15:33:16Z

Thanks, Alex. I checked to verify that the tests would fail if we were using "fweights" instead of "aweights". I had expected this since you had mentioned that fixest isn't explicit about the types of weights it uses, but it should be aweights.

As far as I can see, the sunab code only additionally weights in the way I already implemented in the code. I agree both the text and the usage make it fairly clear that the weights here are used as "fweights", though I don't think this is ever seen by the underlying estimation routine that is fitting the model.

My best current guess is that the weights argument is used in both ways in fixest. It is used as aweights when running the core estimation routine. Then for aggregating from the cohort-period estimates to whatever the user specifies, it is practically treated as a frequency weight.

Finally, I realized that the "aggregate" function is not really user facing in fixest. It should just be called behind the scenes when the user specifies the "agg" argument when summarizing the result from the estimation (https://lrberge.github.io/fixest/reference/sunab.html). In the code base I didn't find a case where use_weights = False so I am guessing they always use these weights if supplied when setting up in the initial model.

If my understanding of all this is correct, then I don't think there is a way to match fixest's output and be consistent in pyfixest's use of the weights. If you want to match the output, then I guess the right thing is to remove the "use_weights" option and force the aggregation to use the weights as sampling weights. If you don't want to match the fixest output, necessarily, the whole PR might need a re-think.

s3alfisc · 2025-06-01T20:45:37Z

Sorry for the delayed response, my parents were visiting over the weekend =)

My best current guess is that the weights argument is used in both ways in fixest. It is used as aweights when running the core estimation routine. Then for aggregating from the cohort-period estimates to whatever the user specifies, it is practically treated as a frequency weight.

This is also my understanding, and I wonder if this does not lead to errors because frequency weights and analytical weights SEs are different? I.e. when computing the vcov matrix, fixest uses "analytical" errors? There's also a slight chance that analytical and frequency errors have the exact same form and I still misunderstand something.

Mabye the best next step here would be to open a PR in the fixest repo and ask Laurent directly?

s3alfisc · 2025-06-01T20:49:37Z

Also linking to the Stata documentation that might be helpful / I need to take a closer look at later: link

shapiromh · 2025-06-03T06:22:17Z

Thanks, I'll also try to read through and get more clarity before pushing an update. I'll should have time again this weekend.

shapiromh · 2025-06-17T14:30:56Z

Sorry for disappearing on this one, Alex. Let me take a look this weekend.

s3alfisc · 2025-06-17T19:13:46Z

Sorry for disappearing on this one, Alex. Let me take a look this weekend.

Hi, no worries at all! Thanks for the update!

Matthew Shapiro and others added 25 commits May 17, 2025 11:09

Update project toml with conda-forge available R packages

e3f37b5

Added new pytest marks for r_against_core and r_against_extended

19e6411

Updated packages to install for extended R environment

992d289

Added test markers in pytest init and related pixi dev tasks

3fa3f8a

Moved the extended mark of a fixture to the relevant function so pyte…

8ed74df

…st stops complaining

Adjusted extended R test scripts to skip over modules not properly in…

b2d206a

…stalled

Updated R requirements to correct install issues.

20ae811

Added skip summary on tasks that may cover R tests

2c1de9a

Updated the documentation around changes to R tests.

fa50b3d

Added R as dependency to docs as well to avoid need for global install

e0e2d06

UNTESTED: Updated git workflow actions to reflect new R install?

234412a

[pre-commit.ci] auto fixes from pre-commit.com hooks

836a696

for more information, see https://pre-commit.ci

pixi lock

8baa1e6

Merge branch 'py-econometrics:master' into master

250cb55

Made changes to make car a core R package

8477d40

Added check on mpdata availability

96e3363

Fix: forgot to label car tests as core instead of extended

86e5867

[pre-commit.ci] auto fixes from pre-commit.com hooks

6e234e5

for more information, see https://pre-commit.ci

Merge branch 'py-econometrics:master' into master

83e5382

Merge branch 'py-econometrics:master' into master

ee6c656

Merge branch 'py-econometrics:master' into master

6d81f2c

Quick fix for saturated event study invoking summary() error

4f935ab

Updated event study methods to take an aweights argument.

01885ff

Added saturated and twfe weight tests

0552263

Fixed a linter complaint

6e1d3be

[pre-commit.ci] auto fixes from pre-commit.com hooks

7f93451

for more information, see https://pre-commit.ci

s3alfisc requested changes May 29, 2025

View reviewed changes

pyfixest/did/estimation.py Show resolved Hide resolved

pyfixest/did/saturated_twfe.py Show resolved Hide resolved

pyfixest/did/saturated_twfe.py Show resolved Hide resolved

pyfixest/did/saturated_twfe.py Show resolved Hide resolved

pyfixest/did/saturated_twfe.py Show resolved Hide resolved

s3alfisc reviewed May 29, 2025

View reviewed changes

pyfixest/did/saturated_twfe.py Outdated Show resolved Hide resolved

Update pyfixest/did/saturated_twfe.py

de75f8d

Co-authored-by: Alexander Fischer <alexander-fischer1801@t-online.de>

shapiromh commented May 29, 2025

View reviewed changes

s3alfisc added 4 commits May 30, 2025 23:27

fix CI

86c279f

update pyproject.toml, lock

d59f05b

Merge branch 'master' into fix_att

2cb6b90

pixi lock

eb10ec7

[pre-commit.ci] auto fixes from pre-commit.com hooks

0d1f698

for more information, see https://pre-commit.ci

s3alfisc mentioned this pull request Jun 17, 2025

Docs: Sun Abraham Aggregation with Weights lrberge/fixest#578

Open

Add weights option to event_study #920

Are you sure you want to change the base?

Add weights option to event_study #920

Uh oh!

Conversation

shapiromh commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shapiromh commented May 28, 2025

Uh oh!

codecov bot commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

s3alfisc commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

s3alfisc commented May 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

s3alfisc May 29, 2025

Choose a reason for hiding this comment

Uh oh!

shapiromh May 29, 2025

Choose a reason for hiding this comment

Uh oh!

s3alfisc May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

shapiromh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shapiromh May 29, 2025

Choose a reason for hiding this comment

Uh oh!

s3alfisc commented May 30, 2025

Uh oh!

s3alfisc commented May 31, 2025

Uh oh!

shapiromh commented May 31, 2025

Uh oh!

s3alfisc commented Jun 1, 2025

Uh oh!

s3alfisc commented Jun 1, 2025

Uh oh!

shapiromh commented Jun 3, 2025

Uh oh!

shapiromh commented Jun 17, 2025

Uh oh!

s3alfisc commented Jun 17, 2025

Uh oh!

Uh oh!

shapiromh commented May 28, 2025 •

edited

Loading

codecov bot commented May 28, 2025 •

edited

Loading

s3alfisc commented May 28, 2025 •

edited

Loading

s3alfisc May 29, 2025 •

edited

Loading