Update docs #40
No reviewers
Labels
No labels
bug
documentation
duplicate
enhancement
good first issue
help wanted
invalid
question
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
inm7/inm-icf-utilities!40
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "docs-rewrite"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
This is a rewrite of the documentation, which shifts the "dataladification" process from admin docs to user docs. This reflects a change in process compared to initial plans: while DataLad datasets can be created by ICF and placed alongside tarballs, this step will most likely be done by users on their institute local infrastructure.
Consequently, the user docs chapter is now organised into the following sections, and assumes only DICOMs are available from the ICF:
A new chapter, Reference, is added, and groups two sections:
Administrator docs chapter is shortened to a minimum, and describes dataladification steps as optional, linking to the reference parts above.
Majority of the old text is retained with minor changes. Apart from restructuring, the biggest change is the addition of "Generate DataLad datasets" from the point of view of a local user.
Note: please keep the PR open, aiming to finish it today.
It would be an awesome test of the updated docs if someone with access credentials to the test data, but a hazy memory of details, builds the docs and tries to generate and then clone a DataLad dataset from DICOMs.
@ -0,0 +63,4 @@Information required to create a DataLad dataset needs to be extractedfrom the tarball:Following through the docs sequentially, I don't think I've come across this singularity image before. I think it would make sense to link to its download page here.
@ -0,0 +1,142 @@.. _dl-generate:I believe it should go inside the local store?
@ -0,0 +1,142 @@.. _dl-generate:?
@ -0,0 +1,142 @@.. _dl-generate:Re-reading this paragraph many times, I feel like I'm not 100% sure what it is telling me. Maybe one introductory sentence in addition helps. Is the gist something like this?
@ -0,0 +1,142 @@.. _dl-generate:I think the command also misses the
--idparameter and placeholders? I'm getting this when running it:@ -0,0 +1,142 @@.. _dl-generate:@ -0,0 +1,142 @@.. _dl-generate:sorry for the flood of comments, I'm realizing more and more things as I'm walking through - I was expecting this to generate a dataset based on the heading, but it doesn't create a standard dataset on my system - just the lightweight representation. Maybe we can reflect this in the heading and description, eg with by placing "dataset" in air quotes or calling it lightweight dataset representation already at the start?
@ -0,0 +115,4 @@singularity run -B $STORE_DIR icf.sif catalogify_studyvisit_from_meta \--store-dir $STORE_DIR --id <project-ID> <visit ID>I think it would be nice to mention that this catalog needs to be subsequently served, or at least point to the README for further instructions - I naively expected the index.html page to display something and initially thought something was wrong.
@ -0,0 +36,4 @@.. code-block::'datalad-annex::?type=external&externaltype=uncurl&encryption=none&url=file:///data/group/groupname/local_dicom_store/my-study/P000123_{{annex_key}}'I think it would be nice to have an actual fully clone example given here:
@ -0,0 +36,4 @@.. code-block::'datalad-annex::?type=external&externaltype=uncurl&encryption=none&url=file:///data/group/groupname/local_dicom_store/my-study/P000123_{{annex_key}}'It may also be worth a note that this command essentially never fails. If I mistype the URL, cloning succeeds, but it tells me
Which makes it sound like its the dataset's issue, when it just stemmed from a non-existent URL
Sorry for missing this - I was neck-deep in too many projects when this PR was posted. I gave it a try as you suggested, and was able to build and clone a repo using my ICF credentials. I left comments along the way as a review.
In general though, this walk-through works, and I feel its a really great addition to have this info. Thanks much for taking the time to write it up!
Thanks a lot for a thorough review, I will go through the docs again myself and apply the suggestions into account.
@ -0,0 +63,4 @@Information required to create a DataLad dataset needs to be extractedfrom the tarball:The 3rd paragraph of this page says:
where "DataLad requirements" is a link to a page that describes things in greater details (and is actually positioned earlier in the User Guide), and links to containerized execution page.
However, your comment makes it apparent that I didn't do a good enough job when trying to compartmentalize the docs (to avoid repetition), and I will add a sentence of two to make up for it.
By the way, this points to a small design issue with the tooling. Initially, the Singularity image was just for ICF. ICF would only use DataLad through the scripts in this image. Users would not need the Singularity image, they would clone datasets from ICF using DataLad.
Now, users who want to dataladify datasets using the Singularity image still need to download the tarballs somehow. I decided to suggest
datalad downloadfor the task, because it interacts with DataLad credentials, that would also be needed for any subsequent dataset content retrieval from ICF. Alternatively, we could recommendcurl -ufollowed by Singularity (no need to install DataLad), ordatalad downloadfollowed by running scripts from this repo (no need for Singularity). The former seems unsatisfactory, because any further dataset interaction would need to happen through DataLad anyway. The latter seems unsatisfactory because the Singularity image was introduced to make the ICF tooling independent of changes in DataLad.@ -0,0 +1,142 @@.. _dl-generate:No need to apologize; thanks a lot for these comments. I agree with the points you make and will make changes accordingly (without using the suggestions directly).
@ -0,0 +1,142 @@.. _dl-generate:Will do that, but without mixing placeholders and values 😉
@ -0,0 +36,4 @@.. code-block::'datalad-annex::?type=external&externaltype=uncurl&encryption=none&url=file:///data/group/groupname/local_dicom_store/my-study/P000123_{{annex_key}}'Good point, clone from datalad-annex urls does that (related: https://github.com/datalad/datalad-next/issues/373). I'll add a note.
@ -0,0 +1,142 @@.. _dl-generate:Lol, the double space in the argument makes it download to
local_dicom_storeinstead oflocal_dicom_store.I am not a huge fan of how
datalad downloadworks with<path>|<url>|<url-path-pair>as an individual argument, but I guess it is a way to make it work with multiple pairs at once@ -0,0 +1,142 @@.. _dl-generate:oh no.... :o
I addressed the suggestions, I'll leave it open for a while and then mark conversations as resolved. Given previous approval, I understand I can merge then.