Dear David,
I've prepared an example of how to do this.  Please try downloading 
http://users.openmicroscopy.org.uk/~rle ... xample.zip and taking a look at the files here.  Also, please take a look at 
https://www.openmicroscopy.org/site/sup ... ml-fileset which explains some of the background to this solution.
In the zip, you'll see a collection of PNGs (sparse in T), and a corresponding set of TIFFs.  There is a script (gensparse) which generates this sparse PNG dataset to match your example (note: will only work on Linux, will need tweaking for other systems).  There's a brightfield channel at all timepoints, and then red and cyan channels every 4 timepoints, staggered by 2, starting at t1 and t3, respectively.
The script converts these PNG images into a multi-file OME-TIFF dataset.  Each PNG image is separately converted into a TIFF, and a small OME-XML fragment is embedded in each TIFF, containing a reference to the metadata file (multifile.companion.ome).  This file contains a complete record of each file in the dataset, and which timepoint and channel it refers to.  Note that each filename has a corresponding UUID which is used to reference it; we generate these in the script when we embed the OME-XML in each script, and then again when creating the companion metadata.  I included some ancillary metadata in the companion metadata as an example, which includes timestamps for each plane as well as exposure time and stage positions.  These are all optional; you can omit the plane information if you wish.
I also generated a blank plane for use in the red and cyan channels before the first acquired plane.  We then reuse the last acquired plane in each channel for timepoints between acquisitions for those channels, and because we refer to an already used TIFF this will be avoiding any duplication of pixel data when reusing it for following timepoints.  If you  load this OME-TIFF with bioformats showinf, or import into OMERO or ImageJ, you'll be able to navigate around the channels and timepoints and see how things are laid out.  In the example, I've overlaid text descriptions over each plane so you can see its position, and also added channel colours so they get false coloured in a viewer which supports this.
While the example I've used makes use of graphicsmagick to create the PNG files, and then to do the TIFF conversion and OME-XML embedding, these are just examples.  You can use any tool of your choice to do these steps, for example Bio-Formats bfconvert and tiffcomment.  I wrote this as a shell script so that the steps were clear, but again any scripting or other language of your choice could be used for these tasks.  There are also other possible solutions.  While this example uses a separate TIFF file per plane, you can adjust it to use one file per timepoint, or per channel, or have everything in a single file.  You just need to adjust the IFD numbers in the TiffData to point to the appropriate plane in the file.  If you go with a single file approach, you then don't need the companion metadata file; you can simply embed it in the single TIFF and the whole thing will be self-contained.
Also note that this example uses the 2016-06 data model, which is not yet in a released version of Bio-Formats.  You'll probably want to adjust this to use 2015-01 in your own files, which is the currently supported version (no other changes are required).
Though the script is unlikely to be useful to you without modification, I hope that this at least illustrates the concepts of how you could go about doing this, which should make it possible to adapt or reimplement to make something which works for your datasets.
If there's anything unclear or for which you would like some further explanation, or if this doesn't work for you, please do let us know and we can go into this in more detail.
Kind regards,
Roger