Add JRC-IDEES data processing pipeline #133

brynpickering · 2021-07-16T14:47:47Z

First step to fixing #17 & #102 and blocking #119.

Now that the JRC IDEES database is available via FTP, we can roll the processing of the data into Euro-Calliope. This allows the transport, industry, and tertiary (commercial) sector data to be processed for the components that we need down the line. The processing takes the largely machine-unreadable data from JRC-IDEES and turns into tidy dataframes for use elsewhere.

Checklist

Any checks which are not relevant to the PR can be pre-checked by the PR creator. All others should be checked by the reviewer. You can add extra checklist items here if required by the PR.

timtroendle

Only very few comments about functionality, but a lot about structuring the code and readability.

I still think readability is important here. These kinds of data processing scripts are in most cases super difficult to read but structure and descriptions can ease the pain to some degree. You may not agree with all my comments and that's fine of course.

CHANGELOG.md

environment.yaml

lib/eurocalliopelib/utils.py

rules/data-processing.smk

scripts/jrc-idees/tertiary.py

scripts/jrc-idees/transport.py

timtroendle

Just a few more minor changes.

CHANGELOG.md

lib/tests/test_utils.py

scripts/jrc-idees/industry.py

scripts/jrc-idees/transport.py

…ipeline

brynpickering · 2021-07-20T15:54:32Z

Need to fix some bugs found on running the script before it's ready.

Add jrc-idees download & processing scripts/rules

cbc33a4

brynpickering requested review from timtroendle and FLomb July 16, 2021 14:47

brynpickering mentioned this pull request Jul 16, 2021

Import annual transport demand, as in 2.0 #119

Closed

timtroendle requested changes Jul 16, 2021

View reviewed changes

brynpickering added 4 commits July 19, 2021 12:45

Update following review

a25ed10

Fix tests

be6a140

Remove dataset wildcard from tertiary sector rule

a172209

Update changelog to include #113 updates

8daf5c5

brynpickering requested a review from timtroendle July 19, 2021 16:39

timtroendle approved these changes Jul 20, 2021

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

lib/tests/test_utils.py Outdated Show resolved Hide resolved

scripts/jrc-idees/industry.py Show resolved Hide resolved

scripts/jrc-idees/transport.py Show resolved Hide resolved

scripts/jrc-idees/transport.py Show resolved Hide resolved

brynpickering added 2 commits July 20, 2021 16:50

Merge branch 'feature-sector-coupling' into add-jrc-data-processing-p…

cd1de48

…ipeline

Update following review

fdefdc8

Fix rules; add multithreading

63b81c1

brynpickering mentioned this pull request Jul 22, 2021

Update Snakemake to use mamba by default #142

Merged

5 tasks

Add more inline comments; Sentence case CHANGELOG

6e4441a

brynpickering merged commit 029656f into feature-sector-coupling Jul 22, 2021

brynpickering deleted the add-jrc-data-processing-pipeline branch July 22, 2021 12:12

jnnr mentioned this pull request May 5, 2023

Feature sector coupling #260

Draft

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add JRC-IDEES data processing pipeline #133

Add JRC-IDEES data processing pipeline #133

brynpickering commented Jul 16, 2021

timtroendle left a comment

timtroendle left a comment

brynpickering commented Jul 20, 2021

Add JRC-IDEES data processing pipeline #133

Add JRC-IDEES data processing pipeline #133

Conversation

brynpickering commented Jul 16, 2021

Checklist

timtroendle left a comment

Choose a reason for hiding this comment

timtroendle left a comment

Choose a reason for hiding this comment

brynpickering commented Jul 20, 2021