Bulk Data Import and Key:Value annotation
Posted: Mon Oct 22, 2018 4:35 pm
Hello everyone,
I have a question about the available tools and/or best practices for bulk import.
My use case: I have about 5000 images (each a 1-2gb slide scan) that I need to import into a new OMERO instance. These will fall into one Project, several hundred Datasets. I think I understand the Bulk Import option pretty well, and should be able to handle setting up import targets and creating the import configuration files programatically. What is less clear to me is how to handle annotating these files with key:value metadata (e.g., Species, tissue, gene, etc) in bulk. I haven't had much luck finding a clear solution in the documentation--the Populate Metadata script seems tailored toward annotation of screens.
There are several key:value pairs (falling under 2+ namespaces) that I could annotate files with at the same time as import, but there doesn't seem to be a straightforward way to include these annotations in the configuration file/settings, unless I am missing something. Nonetheless, I will still have to come back later and annotate all ~5000 images with an additional key:value pair in which the value with be unique to each image.
Are there any tools already available to, for example, take a tsv of image ids + key/value annotations and automatically populate? My backup plan is to write a tool, probably a shell script that uses the omero CLI, to do this very thing. But if I can save myself the work and/or follow best practices/conventions, I would prefer that.
I have a question about the available tools and/or best practices for bulk import.
My use case: I have about 5000 images (each a 1-2gb slide scan) that I need to import into a new OMERO instance. These will fall into one Project, several hundred Datasets. I think I understand the Bulk Import option pretty well, and should be able to handle setting up import targets and creating the import configuration files programatically. What is less clear to me is how to handle annotating these files with key:value metadata (e.g., Species, tissue, gene, etc) in bulk. I haven't had much luck finding a clear solution in the documentation--the Populate Metadata script seems tailored toward annotation of screens.
There are several key:value pairs (falling under 2+ namespaces) that I could annotate files with at the same time as import, but there doesn't seem to be a straightforward way to include these annotations in the configuration file/settings, unless I am missing something. Nonetheless, I will still have to come back later and annotate all ~5000 images with an additional key:value pair in which the value with be unique to each image.
Are there any tools already available to, for example, take a tsv of image ids + key/value annotations and automatically populate? My backup plan is to write a tool, probably a shell script that uses the omero CLI, to do this very thing. But if I can save myself the work and/or follow best practices/conventions, I would prefer that.