Link INSDC to LOD federations #429

pbuttigieg · 2024-05-13T08:52:56Z

https://github.com/enasequence/ena-experiment-checklist/tree/main/data%2Fschema

Build ODIS compatible specifications based on ENA's exploratory work, above

pbuttigieg · 2024-06-13T13:22:32Z

@Woolly-at-EBI pinging you to follow up on our plan formed at DOF2024

Let's think about how to get ENA sequence metadata linked in via a schema.org/JSON-LD model.

Do you have example records that conform to your schemata?

Woolly-at-EBI · 2024-07-18T08:50:24Z

Yesterday, I talked with Colman (ENA product owner and also my line manager). He is fine for me to investigate and explore what needs to be done. So that is obvious good news.

At ENA there is a direction of travel to use much more JSON for input and output. The SWAGGER API allows JSON output. BioSchema was created and is maintained by our sister group(we share offices).

The JSON schema for the sequence experiment metadata in the GitHub from the top link was created from a manually created JSON file (YAML would have probably been a better option!. That work is as you note exploratory. It is continuing with collaboration of the GA4GH team, I will be running a short hackathon at the GSC conference to work on the core terms more. David B. and I think that we are almost there with the core terms now.

FYI: At ENA an aligned active small piece of development in the ecosystem is a replacement checklist editor. This will allow the continued and easier maintenance of ENA sample checklists, with the requirements that we can create and maintain other checklists e.g. EGA sample checklists, Biosample sample checklists and more relevant to this ticket: ENA sequence experiment checklist. Indeed I will be reviewing the requirement list again with the developers later today.

Woolly-at-EBI · 2024-07-18T08:57:59Z

so @pbuttigieg I am thinking what is the next useful step?
I have sequence experiment examples that pass this exploratory JSON-LD schema/ The final solution may be similar or rather different. You are right we have to start somewhere.
I will provide two mocked up "real" examples here for 2 different experiment types with the accompanying reference to the actions sequence records. Aiming to by end of 19th July.

pbuttigieg · 2024-07-18T17:14:50Z

@Woolly-at-EBI

I am thinking what is the next useful step?

The steps to link to ODIS (and similar LOD-driven federations) are summarised here: https://book.oceaninfohub.org/gettingStarted.html

I have sequence experiment examples that pass this exploratory JSON-LD schema/ The final solution may be similar or rather different. You are right we have to start somewhere.

I think we have to convert/adapt the JSON-LD schema you have to interoperate with schema.org semantics, as specified in the schema:Dataset type.

Some examples here, here and here.

At ENA there is a direction of travel to use much more JSON for input and output. The SWAGGER API allows JSON output. BioSchema was created and is maintained by our sister group(we share offices).

Bioschemas should be compatible with vanilla schema.org, but I've noticed some odd modelling in Bioschemas. I'd be a little careful, and use vanilla schema.org wherever possible.

The JSON schema for the sequence experiment metadata in the GitHub from the top link was created from a manually created JSON file (YAML would have probably been a better option!. That work is as you note exploratory. It is continuing with collaboration of the GA4GH team, I will be running a short hackathon at the GSC conference to work on the core terms more.

Perhaps we can align these activities - if we make sure the futher development of this exploratory work dovetails with JSON-LD/schema.org compliance, we'll be creating something widely interoperable.

FYI: At ENA an aligned active small piece of development in the ecosystem is a replacement checklist editor. This will allow the continued and easier maintenance of ENA sample checklists, with the requirements that we can create and maintain other checklists e.g. EGA sample checklists, Biosample sample checklists and more relevant to this ticket: ENA sequence experiment checklist. Indeed I will be reviewing the requirement list again with the developers later today.

This is both promising and a little concerning - if tooling is developed before a good data exchange model (i.e. data formats and semantics) is set, then organisations like the ENA are loth to change things, even if they don't interoperate with others. I would strongly encourage that we get the JSON-LD/schema.org modelling and templates settled first, and avoid INSDC or ENA specific types, properties, etc which no other systems will understand without custom coding (xref the missing value story at the GSC)

Woolly-at-EBI · 2024-09-12T12:11:11Z

Finally started...
https://docs.google.com/document/d/19IoPj-Y0_J2ZRr6zr5jhJb438d_CjsNv9sjG53TEdhI/edit?usp=sharing

JSON generated for study(=project) and read_run objects.

Next steps:
make pilot JSON-LD for study
make pilot JSON-LD for read_run
Generate sample level metadata as JSON and then JSON-LD (will have to be selective of metadata)

pbuttigieg · 2024-09-17T16:00:30Z

Thanks @Woolly-at-EBI - could we have examples with real values to develop from? I can then fit them in to the right schema.org slots and they'll be more spottable

pbuttigieg mentioned this issue May 13, 2024

Interoperability Bridge to ODIS and other web architectural systems enasequence/ena-experiment-checklist#4

Open

pbuttigieg mentioned this issue Jun 15, 2024

Discuss ASV portal and possible integration with ODIS architecture with NOAA/AOML researchers #433

Open

pbuttigieg added enhancement partners urgency-medium labels Jul 18, 2024

pbuttigieg mentioned this issue Sep 17, 2024

Create ENA examples for sequence data iodepo/odis-in#29

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Link INSDC to LOD federations #429

Link INSDC to LOD federations #429

pbuttigieg commented May 13, 2024 •

edited

Loading

pbuttigieg commented Jun 13, 2024 •

edited

Loading

Woolly-at-EBI commented Jul 18, 2024

Woolly-at-EBI commented Jul 18, 2024

pbuttigieg commented Jul 18, 2024

Woolly-at-EBI commented Sep 12, 2024

pbuttigieg commented Sep 17, 2024

Link INSDC to LOD federations #429

Link INSDC to LOD federations #429

Comments

pbuttigieg commented May 13, 2024 • edited Loading

pbuttigieg commented Jun 13, 2024 • edited Loading

Woolly-at-EBI commented Jul 18, 2024

Woolly-at-EBI commented Jul 18, 2024

pbuttigieg commented Jul 18, 2024

Woolly-at-EBI commented Sep 12, 2024

pbuttigieg commented Sep 17, 2024

pbuttigieg commented May 13, 2024 •

edited

Loading

pbuttigieg commented Jun 13, 2024 •

edited

Loading