Add the option to slice out straws/tubes/layers/pixels #57

nvaytet · 2024-01-29T15:47:40Z

A request from users: while playing around with the direct beam iterations, it would be nice to be able to just run it on a single layer/tube etc to speed up development.

This PR adds the option to slice out the data at the correct point in the workflow in a way that retains identical results.

SimonHeybrock · 2024-01-30T08:05:21Z

src/esssans/normalization.py

 def solid_angle(
-    data: CalibratedMaskedData[RunType],
+    data: SlicedMaskedData[RunType],


I am a bit unhappy about this change. I think the previous version, which said "we compute the solid angle based on calibrated data" was clearly expressing the behavior and the intent of the pipeline. As it is now, it could be anything. If we later change the order of slicing and calibration this will be completely broken.

Would simply changing the name to SlicedCalibratedMaskedData help or is that not what you mean?

Would help partially. I still wonder though if this is correctly expressing the intent. Solid angle could be computed before slicing or after, so this choice is due to cost?

SimonHeybrock · 2024-01-30T08:07:58Z

src/esssans/conversions.py

@@ -200,7 +201,7 @@ def calibrate_positions(
 # for RawData, MaskedData, ... no reason to restrict necessarily.
 # Would we be fine with just choosing on option, or will this get in the way for users?
 def detector_to_wavelength(
-    detector: CalibratedMaskedData[RunType],
+    detector: SlicedMaskedData[RunType],


See above... losing information that this is calibrated.

Could the slicing be combined with masking, in terms of the graph structure, to avoid this?

I think the slicing has to be done after the masking because we need all the detector pixels to run the beam center finder, and the masking needs to be done before doing the beam center finder.

If we run the beam center finder on a single layer we will get a different offset.

Is there actually a compute cost reason for changing this in the workflow? Or could we consider, e.g., returning reduced data per pixel (or in event mode)? Maybe that would take too much memory?

As in the PR description, it was because of performance. When Judith asked for this I said she could just inspect the results at the end in a single layer or tube. But she said it would be nice if it was faster when playing around with parameters.

But I am open to alternatives

Hmm, isn't loading the files generally the slowest part? If we slice in the workflow it will mean the complete files will be loaded N times of we want to look at N tubes?

Edit: If this is mainly about the direct-beam iteration I suppose file-load is not the performance bottleneck. Would it be possible to load files, make a selection, and call the direct-beam workflow on that?

You can always cache parts of the workflow by setting an intermediate result on the pipeline?
E.g.

res = pipeline.compute(CalibrateMaskedData[SampleRun]) pipeline[CalibrateMaskedData[SampleRun]] = res

But I am not sure if this is dangerous and shouldn't be recommended to the users?

If this is mainly about the direct-beam iteration I suppose file-load is not the performance bottleneck.

Note that we are calling the pipeline inside a loop (and it fact we are computing it twice inside the loop: once for full wavelength range and once for wavelength bands). I think that means we are loading all the files at every loop.
We should probably have better caching inside the iteration loop?

Presumably we can cache the solid angle, the numerator, the transmission fraction and monitors, and only evolve the direct beam?

I'm guessing a similar thing can be done to speed up the beam_center_finder_from_iofq?

I am just trying that, but it only helps a bit (3 seconds per iteration). There is another source of bad performance in the wavelength-bands pipeline, I am trying to track that down now.

In other words: We should check if this scientist request is actually an XY problem (this solution was suggested because performance is bad).

My guess is this the merge_spectra function is the bottleneck: https://github.com/scipp/esssans/blob/main/src/esssans/i_of_q.py#L238

Yes, you are correct. But it is not the event-mode one, but just the histogram mode. I have some ideas how to improve, will try to look into that tomorrow.

SimonHeybrock · 2024-02-05T13:56:28Z

I think this has been superseded, please reopen if I am wrong.

nvaytet added 3 commits January 29, 2024 16:43

add option to be able to slice out dimensions before computing Q

91d4f31

add note in docstring

6cc9c9d

fix tests

eb46cdf

SimonHeybrock reviewed Jan 30, 2024

View reviewed changes

SimonHeybrock mentioned this pull request Jan 31, 2024

Faster direct beam #63

Merged

SimonHeybrock closed this Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the option to slice out straws/tubes/layers/pixels #57

Add the option to slice out straws/tubes/layers/pixels #57

nvaytet commented Jan 29, 2024

SimonHeybrock Jan 30, 2024

nvaytet Jan 30, 2024

SimonHeybrock Jan 30, 2024

SimonHeybrock Jan 30, 2024

nvaytet Jan 30, 2024

SimonHeybrock Jan 30, 2024

nvaytet Jan 30, 2024

SimonHeybrock Jan 30, 2024 •

edited

Loading

nvaytet Jan 30, 2024

nvaytet Jan 30, 2024

SimonHeybrock Jan 30, 2024 •

edited

Loading

nvaytet Jan 30, 2024

SimonHeybrock Jan 30, 2024

SimonHeybrock commented Feb 5, 2024

Add the option to slice out straws/tubes/layers/pixels #57

Add the option to slice out straws/tubes/layers/pixels #57

Conversation

nvaytet commented Jan 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SimonHeybrock Jan 30, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SimonHeybrock Jan 30, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SimonHeybrock commented Feb 5, 2024

SimonHeybrock Jan 30, 2024 •

edited

Loading

SimonHeybrock Jan 30, 2024 •

edited

Loading