Load monitors separately from the remainder of the files #104

jl-wynen · 2024-02-28T08:14:52Z

Fixes #99

I only did this for the Loki NeXus files because we cannot split loading of the ISIS files so easily. And I don't think those matter for performance in the long run.

src/esssans/loki/general.py

nvaytet · 2024-02-28T09:21:34Z

src/esssans/loki/io.py

@@ -79,121 +82,169 @@ def _merge_events(a, b):

 def _merge_runs(
    data_groups: sciline.Series[
-        Filename[ScatteringRunType], LoadedSingleFileContents[ScatteringRunType]
+        Filename[ScatteringRunType], LoadedSingleFileDetector[ScatteringRunType]


Below, the name is saying this can be either detector or monitor data, but here we are saying it only applies to detector data?
I know it's just an internal function whose types are not checked when building the pipeline, but I still got a bit confused when reading.

nvaytet · 2024-02-28T09:29:44Z

src/esssans/loki/io.py

+        events = DETECTOR_BANK_RESHAPING[data_name](events)
+
+    dg[f'{data_name}_events'] = events
+    return dg


 def load_nexus(


Suggested change

def load_nexus(

def load_nexus_detector(

nvaytet · 2024-02-28T09:31:26Z

src/esssans/loki/io.py

-    data_entries = (detector_name, incident_monitor_name, transmission_monitor_name)
+) -> LoadedSingleFileDetector[RunType]:
+    return LoadedSingleFileDetector[RunType](
+        _load_events(filename, detector_name, transform_path, source_name, sample_name)


Maybe we should use keyword args to make sure we don't mix things up, such as swapping source and sample names?

A type checker should flag those errors. But I can change to keyword args

nvaytet · 2024-02-28T09:31:34Z

src/esssans/loki/io.py

+    sample_name: Optional[NeXusSampleName],
+) -> LoadedSingleFileMonitor[RunType, MonitorType]:
+    return LoadedSingleFileMonitor[RunType, MonitorType](
+        _load_events(filename, monitor_name, transform_path, source_name, sample_name)


Also kwargs?

nvaytet · 2024-02-28T09:32:06Z

src/esssans/loki/io.py

-providers = (load_nexus, to_file_contents, to_path)
+providers = (load_nexus, load_nexus_monitor, to_detector, to_monitor, to_path)
+"""Providers for loading single files."""
+event_merging_providers = (


SimonHeybrock

Can't this be simplified further?

SimonHeybrock · 2024-02-29T04:30:15Z

src/esssans/isis/data.py

@jl-wynen The things moved into this file do not belong here, as it is for the Pooch registry, is in all our repos.

SimonHeybrock · 2024-02-29T04:31:53Z

src/esssans/loki/general.py

 def get_detector_data(
-    dg: LoadedFileContents[ScatteringRunType], detector_name: NeXusDetectorName
+    dg: LoadedDetector[ScatteringRunType], detector_name: NeXusDetectorName
 ) -> RawData[ScatteringRunType]:
-    da = dg[NEXUS_INSTRUMENT_PATH][detector_name][f'{detector_name}_events']
-    return RawData[ScatteringRunType](da)
+    return RawData[ScatteringRunType](dg[f'{detector_name}_events'])


Why does this exist?

SimonHeybrock · 2024-02-29T04:32:09Z

src/esssans/loki/general.py

 def get_monitor_data(
-    dg: LoadedFileContents[RunType], monitor_name: NeXusMonitorName[MonitorType]
+    monitor: LoadedMonitor[RunType, MonitorType],
+    monitor_name: NeXusMonitorName[MonitorType],
 ) -> CalibratedMonitor[RunType, MonitorType]:
-    mon_dg = dg[NEXUS_INSTRUMENT_PATH][monitor_name]
-    out = mon_dg[f'{monitor_name}_events']
-    out.coords['position'] = mon_dg['position']
+    out = monitor[f'{monitor_name}_events'].copy(deep=False)
+    out.coords['position'] = monitor['position']
    return CalibratedMonitor[RunType, MonitorType](out)


Why does this function still exist?

SimonHeybrock · 2024-02-29T04:32:46Z

src/esssans/loki/general.py

 def detector_pixel_shape(
-    dg: LoadedFileContents[ScatteringRunType], detector_name: NeXusDetectorName
+    dg: LoadedDetector[ScatteringRunType],
 ) -> DetectorPixelShape[ScatteringRunType]:
-    return DetectorPixelShape[ScatteringRunType](
-        dg[NEXUS_INSTRUMENT_PATH][detector_name]['pixel_shape']
-    )
+    return DetectorPixelShape[ScatteringRunType](dg['pixel_shape'])


 def detector_lab_frame_transform(
-    dg: LoadedFileContents[ScatteringRunType],
-    detector_name: NeXusDetectorName,
+    detector: LoadedDetector[ScatteringRunType],
    transform_path: TransformationPath,
 ) -> LabFrameTransform[ScatteringRunType]:
-    return LabFrameTransform[ScatteringRunType](
-        dg[NEXUS_INSTRUMENT_PATH][detector_name][transform_path]
-    )
+    return LabFrameTransform[ScatteringRunType](detector[transform_path])


Can't all this be done in the detector load function?

Depends on whether you want this to be visible in the graph. Or whether it may need to be customised.
Note that I wanted to modify the existing structure as little as possible because I don't have an overview of the workflow.

jl-wynen requested a review from nvaytet February 28, 2024 08:14

nvaytet reviewed Feb 28, 2024

View reviewed changes

nvaytet approved these changes Feb 28, 2024

View reviewed changes

jl-wynen added 9 commits February 28, 2024 15:11

Fix import

d329012

Split loading of monitor and detector

6be48b3

Fix imports

56adb71

Fix tests

66ebac1

Fix types

10bc7b2

Remove unused arg

146a627

Clarify name

47e5fca

Use keyword args

fdb6621

Fix typehint

d5af45b

jl-wynen force-pushed the split-monitor-loading branch from 2197d3b to d5af45b Compare February 28, 2024 14:12

jl-wynen enabled auto-merge February 28, 2024 14:12

Fix symbol path

20454de

jl-wynen merged commit e3815f2 into main Feb 28, 2024
3 checks passed

jl-wynen deleted the split-monitor-loading branch February 28, 2024 14:45

SimonHeybrock reviewed Feb 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load monitors separately from the remainder of the files #104

Load monitors separately from the remainder of the files #104

jl-wynen commented Feb 28, 2024

nvaytet Feb 28, 2024

nvaytet Feb 28, 2024

nvaytet Feb 28, 2024

jl-wynen Feb 28, 2024

nvaytet Feb 28, 2024

nvaytet Feb 28, 2024

SimonHeybrock left a comment

SimonHeybrock Feb 29, 2024

SimonHeybrock Feb 29, 2024

SimonHeybrock Feb 29, 2024

SimonHeybrock Feb 29, 2024

jl-wynen Mar 1, 2024

Load monitors separately from the remainder of the files #104

Load monitors separately from the remainder of the files #104

Conversation

jl-wynen commented Feb 28, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SimonHeybrock left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment