Description
This is a continuation from #406, but with a slightly expanded scope.
I would like to add an open source datasets page to ORIX, similar to Kikuchipy or Pyxem. In particular, I'm thinking of three useful datasets:
-
the US Air Force Research Lab AF96 datasests, six 2100 by 1000 ebsd scans of a Martensitic steel which are often split up into a set of 90 overlapping 512 by 512 scans. Available through Globus, uses a CC-BY 4.0 license
-
the Dream3d IN100 dataset of serial sectioned 3d EBSD scans, 189x189x117 pixels in size, stored as 117 .ang files. Available through the BlueQuartz websit, has a BSD open source license
-
The MTEX ebsd files, used in all the MTEX examples. available through github, has GPL license
The EASY thing would be to just add an open_databases.rst page that looks something like this, but better, ideally with a few pictures (heavily copied from pyxem):
===========================
Open datasets and workflows
===========================
Here are some open datasets which are helpful for running some of the functions using real data:
#. `AF96 Martensitic Steel < https://doi.org/10.18126/iv89-3293>`_ A collection of 6 EBSD scans, all 2 million pixels in size, of AF96 Martensitic steel. Details on the exact composition, preperation, and collection of this data can be found in the following two publications:
https://doi.org/10.1016/j.dib.2019.104471
https://doi.org/10.1016/j.matchar.2019.109835
#. `Dream 3D's Inconel 100 serial section scans <http://dream3d.bluequartz.net/Data/>`_, A set of 117 2D ang files which represent a 3D 189x189x117 pixel cubical EBSD dataset.
#. 'MTEX's EBSD example ebsd files<https://github.com/mtex-toolbox/mtex/tree/develop/data/EBSD>`_
However, I think the problem here is new users want something they can learn with immediately, as opposed to learning ORIX's IO and having to fiddle with different import methods until they find the right way to import files into CrystalMap objects. Additionally, Globus is a massive pain to get downloads from (this is why i actually made the original PR, it was far too inconvenient to get the AF96 datasets for new users).
In this respect, as a new user, I loved how in MTEX I could just type `mtexdata ferrite' and I instantly had an MTEX ebsd object. Not an ang, or a .oim I had to then correctly import, but an actual pre-imported object.
To that end, for at least the first two examples, it think it would be useful to have ORIX .h5 versions as files hosted on Zenodo, then include a snippet of code in the download example that can download and then import those files. Bonus if it imports _fetcher
from orix.data so that duplicats of files aren't downloaded, and can just be quickly loaded from the local cache.
Thoughts? @hakonanes @pc494