Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Epic: Support the 5 core persistent identifiers in Dataverse #19

Closed
mreekie opened this issue Jan 18, 2023 · 10 comments
Closed

Epic: Support the 5 core persistent identifiers in Dataverse #19

mreekie opened this issue Jan 18, 2023 · 10 comments

Comments

@mreekie
Copy link
Collaborator

mreekie commented Jan 18, 2023

This deliverable defines our support for the 5 core pids.

For most of the 5, we offer support already.

  • we will document what that support is.

For the others:

  • We will define how we intend to support it right now.
  • We will implement that support.

This came out of our meeting on Jan 31st, 2023. We discussed how we are going to define our support for each of these PIDs through a definition of "we support this type of data", "for this type of field"

image

Includes...


Just a reminder that this is a "deliverable" issue. It does not go on the sprint board. It's a group of other issues that represent the delivery of some objective for us to get to or functionality delivered.

@mreekie
Copy link
Collaborator Author

mreekie commented Jan 18, 2023

First step is to get with Julian and make sure that breaking this out is the right thing.

@mreekie mreekie changed the title Complete the implementation of the 5 core PIDs for powering Findability Meeting the principles of using the 5 core persistent identifiers in Dataverse Jan 20, 2023
@mreekie mreekie changed the title Meeting the principles of using the 5 core persistent identifiers in Dataverse Support the 5 core persistent identifiers in Dataverse Feb 7, 2023
@mreekie
Copy link
Collaborator Author

mreekie commented Feb 7, 2023

monthly report

  • We had a meeting on the general topic of providing support for the 5 types of PIDs.
  • This issue was the outcome of that meeting.

In terms of defining support, the most activity is surrounding ROR and Fundref so they will be the easiest to express our support for.

In terms I can understand it, the outcome of the meeting was that there is only so much generalization you can make about these different persistent identifiers (PIDs), so we will need to note our support for each one individually.

Some are general purpose and have the primary intent of providing a globally unique identifier to something, where that "thing" is just about anything.

In other cases, the identifier has an element that typically might go in a form such as a label from a controlled vocabulary that means something specific to a human, but then it may also have additional fields associated with that primary data. e.g. meta data on the meta data. Really meta. I think Orcid is an example where the human readable portion is just one piece. There is also a unique identifier associated with it, and I think perhaps a URL and even other fields.

The unique identifiers cannot be put in a single bucket either. Some are URIs, some are alphanumeric.

The human readable fields will also be internationalized.

In terms of usage in fields within Dataverse, you cannot assume too much either because of the pre-existing datasets. There are many cases where a field intended to be used by one particular PID is instead populated by a different one in existing data.

There are decisions to be made about in what fields on Dataverse forms in the UI that you expect PIDs to be used in. The field/form combination might impact what you display and what you save from form to form. e.g. Some entries may allow for the human readable portion of the information to go on the form, while a unique identifier and other data can be added to the DV database without being shown. In other cases, there will be no "extra" fields and only the human readable text will be saved to the form and to the database.

Which PIDs can interact with which fields on DV forms, can possibly be controlled by extending an existing system that uses JSON files to do the mapping.

@mreekie
Copy link
Collaborator Author

mreekie commented Feb 7, 2023

There is general agreement that we have "good" support for:

  • DOIs
  • ORCID

@mreekie mreekie transferred this issue from IQSS/dataverse Mar 3, 2023
@mreekie mreekie added the D: Dev Deliverable to be measured as defined for development label Mar 3, 2023
@mreekie mreekie changed the title Support the 5 core persistent identifiers in Dataverse Deliverable: Support the 5 core persistent identifiers in Dataverse Mar 14, 2023
@mreekie mreekie added bklog: Deliverable D: 5 Core PIDs and removed D: Dev Deliverable to be measured as defined for development labels Mar 22, 2023
@cmbz
Copy link
Contributor

cmbz commented May 1, 2023

Develop a plan to address the work needed to move this epic forward. Note: re-architecture work should take into account what is planned here (e.g., process used to change UI for metadata fields)

  • Identify & review different metadata fields related to the different PIDs
  • Scope work
  • Coordinate with relevant NIH-GREI subcommittees/working groups (Metadata and Metrics subcommittees) as needed (their recommendations may inform our approach)
  • Contributors include: Julian, Sonia, Gustavo

@jggautier
Copy link

A Google Slide at https://docs.google.com/presentation/d/1PtqmEzAamuM2__V8psOIetgNODPQxjqSEOuxL3kAV-Y summarizes what support means and which types of metadata are and aren't supported in some way. I'm hoping this helps scope the work.

@cmbz
Copy link
Contributor

cmbz commented Sep 25, 2023

2023/09/25: Will require a proposal to scope the investigation and identify a set of tasks. We will need to split this issue into some sub-issues that can be sized, prioritized, and implemented.
See issue: IQSS/dataverse.harvard.edu#230

@cmbz cmbz changed the title Deliverable: Support the 5 core persistent identifiers in Dataverse Epic: Support the 5 core persistent identifiers in Dataverse Jan 8, 2024
@cmbz
Copy link
Contributor

cmbz commented Jan 9, 2024

2024/01/08

@jggautier
Copy link

The NIH-GREI's metadata subcommittee and the Dataverse UX WG have done work to address the tasks outlined at #19 (comment), are continuing to work on those tasks, and that work is being tracked in other GitHub issues.

I'm going to close this GitHub issue.

@cmbz
Copy link
Contributor

cmbz commented Aug 19, 2024

@jggautier are these the additional issues that you/UX working group are addressing? If not, could you add them to the epic, please, and add a description to the issue itself? Thanks.

@jggautier
Copy link

jggautier commented Aug 19, 2024

Hi @cmbz. Yeah the UX WG is planning to address part of #127 during its second design sprint about improving metadata about research objects that are related to datasets, like related journal publications that often have DOIs and Handles.

I think other parts of extending and enhancing support for PIDs and Handles (#195) include PIDs for dataset versions and for funding awards and grants (as opposed to funding organizations). In the PR at IQSS/dataverse#9462 I see discussion and work about PIDs for dataset versions. I think PIDs for funding awards and grants requires more discussion and hasn't been a priority for the NIH-GREI groups and the Dataverse UX WG.

The first design sprint that the Dataverse UX WG is in the middle of now involves improving support of RORs and ORCIDs for describing people and organizations associated with datasets. I've been using the GitHub issue at #127 and the "Epics and Issues" listed there to describe that work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants