Bulk import of Perkin Elmer Operetta data
Posted: Mon May 13, 2019 7:54 am
Dear forum readers,
I have a problem importing Perkin Elmer Operetta data using the command line in-place import using bulk import and yaml.
My yaml file looks like this
My approach was to generate a filelist from the exported Operetta images folder, that contains the images as well as the metadata in the Index.idx.xml file.
My file list contains 3360 images and the Index.idx.xml file.
My screen consisted of 10 wells, 16 fields each, 7 z-slices, 3 colors. So the 3360 files are the correct number, and the imported files are correctly displayed inside the omero web gui.
When I start the import using:
Omero starts to import my images correctly. I get dataset that contains the images as well as plate layouts of the imported images / plate.
What is strange is that the importer does not stop once the dataset is imported, but imports the files repeatedly. I manually canceled the import now after 4 days. I got 22080 files imported in my dataset folder as well as the plate 138 times (names run 1 to 138).
Here is an image from my omero web interface:
[img]
[/img]
Inside my imported dataset I have entries that are called "Index.idx.xml [Well x, Filed x] that contain then many duplicated images.
Am I doing something wrong and do I have just to point the importer to the Index.idx.xml file instead to both the metadata and the images? Can I check somehow how this import was generated?
Thanks for the help,
Alex
I have a problem importing Perkin Elmer Operetta data using the command line in-place import using bulk import and yaml.
My yaml file looks like this
continue: "true"
transfer: "ln_s"
checksum_algorithm: "File-Size-64"
logprefix: "logs/"
output: "yaml"
path: "/OMERO/ManagedRepository/ipimp-54474.tsv"
columns:
- target
- path
My approach was to generate a filelist from the exported Operetta images folder, that contains the images as well as the metadata in the Index.idx.xml file.
My file list contains 3360 images and the Index.idx.xml file.
My screen consisted of 10 wells, 16 fields each, 7 z-slices, 3 colors. So the 3360 files are the correct number, and the imported files are correctly displayed inside the omero web gui.
When I start the import using:
- Code: Select all
/home/omero/OMERO.server/bin/omero import --bulk /OMERO/ManagedRepository/bulki-54474.yml --skip upgrade
Omero starts to import my images correctly. I get dataset that contains the images as well as plate layouts of the imported images / plate.
What is strange is that the importer does not stop once the dataset is imported, but imports the files repeatedly. I manually canceled the import now after 4 days. I got 22080 files imported in my dataset folder as well as the plate 138 times (names run 1 to 138).
Here is an image from my omero web interface:
[img]
[/img]
Inside my imported dataset I have entries that are called "Index.idx.xml [Well x, Filed x] that contain then many duplicated images.
Am I doing something wrong and do I have just to point the importer to the Index.idx.xml file instead to both the metadata and the images? Can I check somehow how this import was generated?
Thanks for the help,
Alex