A bit of an aside, this is a long-term goal I don't expect to be addressed right away. I have three use cases for Omero. I am optimistic that I can use Omero for my first two goals.
dpwrussell wrote:It sounds like an image from your perspective is very big and accessed by looking at one or many tiles, but from OMERO's perspective, it's still 1 logical image?
From Omero's perspective it would be a single huge image or volume.
To clarify my web URL does not serve tiles. The client asks for a region from a channel at an arbitrary resolution and the requested image or volume is returned. On my end I plan to explore storing the volumes on disk without using image file formats so I intentionally avoided the 2D tile interface.
dpwrussell wrote:If you don't generate these tiles up-front, presumably there is some primary data which is then used to generate the correct tiles depending on requests? I assume the primary data is also huge?
The primary data for a single section is ~1000 4096x4096 pixel images. Images overlap 15%. Transforms are generated for each image to create a single mosaic. Another registration transform is generated between adjacent mosaics to create a mosaic to volume transform. Then I generate a final transform to warp each original image into volume space.
An overview of registration:
http://nornir.github.io/overview/overview.html#mosaic-registrationdpwrussell wrote:Presumably you don't want to import the tiles into OMERO as if they were images in their own right? The tiles are just how you render the data? Is there anything really complicated about this tile generation which requires it be done in your system?
The tile generation is not complicated. It is possible for me to create a bunch of tiled tiffs and import those. The volume creation is complicated. Even after I finalize the volume registration if I decide to pre-generate the 256x256 tiles it takes weeks or months and the majority of full-res tiles never read. I farm calculations out to a primitive cluster where possible to speed the process.
My impression is that everyone manages their big data differently and the Omero server can't support every data pipeline. Long-term I feel that complementing BIO-Formats with a BIO-URL's initiative is the simplest solution. Interfaces that have reached a critical user mass is the feature I want to add. Putting my data on the server is only a means to that end.
Thanks