The CellMorph dataset
The CellMorph dataset is a publicly available dataset that supplements the paper Clustering phenotype populations by genome-wide RNAi and multiparametric imaging by Florian Fuchs, Gregoire Pau, Dominique Kranz, Oleg Sklyar, Christoph Budjan, Sandra Steinbrink, Thomas Horn, Angelika Pedal, Wolfgang Huber and Michael Boutros, published in Molecular Systems Biology, June 2010.
For more details on this paper, visit the online supplement.
A subset of this dataset has been prepared to serve as an example dataset, to demonstrate how Phaedra protocols are set up and how imports are performed.
This prepared dataset is 10GB in size, and can be downloaded here: cellmorph.zip
It contains, for a single plate readout named 001-01, the following data:
- Source images (16bit grayscale TIFF), one file per well per channel. E.g. Well B002.FITC.tif
- Segmentation overlays (8bit grayscale TIFF), one file per well. E.g. 001-01-B02_segmask.tif
- Measured cell data (tab-separated TXT), one file per well. E.g. 001-01-B02_ftrs.tab
Some additional notes:
- The images consist of multiple fields that are already stitched together in a rectangular mosaic.
- The folder structure has been arranged in a convenient format for Phaedra: a single folder per plate containing all data pertaining to that plate.
- The zip file also contains a file called cellmorph protocol template.txt. This is a protocol template, to be used when setting up the protocol (see next topic).
How to import
To import the dataset in a Phaedra environment, follow the steps outlined below.
- Download and unzip the dataset to a location on your computer
- Start Phaedra and log in to your environment
- Select File > Create new Protocol
- Select Template Wizard
- Select the Browse… button and select the file cellmorph protocol template.txt in the dataset
- Select Finish
If the protocol creation succeeds, you will get a message mentioning the name and ID of the new protocol. Now to import the dataset.
- Select Tools > Import Wizard. Make sure the two Prompt to create undefined features settings are selected.
- Select the Generic Importer
- In the first page (Select Import Type), just select Next
- In the second page (Select Source Folder), enter the path of the unzipped cellmorph folder and select Next
- In the third page (Select Experiment), select the CellMorph Demo protocol, then select the New… button to create a new experiment to import the dataset into
- Select Finish
The import will start and, depending on your hardware, take between 5 and 15 minutes to complete.
Configuring the protocol
With the protocol set up and the dataset imported, we could start exploring the data now. But the protocol’s configuration is still fairly minimal. Let’s improve the configuration a bit so we can access more advanced functionality later on.
Note: while we talk about configuring the protocol, we are actually editing the protocol class, which can be seen as the ‘blueprint’ of the protocol, and which contains all the configuration settings.
To edit a protocol class, follow these steps:
- In the Navigator view, open the Protocols section and double-click All Protocols
- In the list of protocols that appears, right-click CellMorph Demo and select Edit Protocol Class
Editing a protocol class can be a complex undertaking, as there are literally hundreds of settings to adjust. Here, we will focus on two things:
- Attach a calculation formula to the CellCount feature, so we can see a heatmap with the cell count of each well.
- Configure the ‘cell position features’, so Phaedra can calculate cell positions, allowing us to select cells directly on the well image.
Attach a calculation formula to the CellCount feature
- To show the list of well features, select the Well Features tab at the bottom of the page
- There should be two features listed after doing the import: CellCount and WellId. Select CellCount
- In the panel on the right, select the Calculation section
- Make sure the JEP language is selected, then select the Edit expression… button
- In the expression dialog, select the count(x) function, then select the int subwell feature. The formula should now look like this:
- Select Save to apply the formula and close the dialog
Note: actually you can select any subwell feature, because this formula only needs to know the number of values, not the values themselves.
Configure the ‘cell position features’
- Select the Subwell Features tab at the bottom of the page
- Select the x feature
- In the panel on the right, open the Position role list and select Center X
- Repeat these steps for the y feature and select Center Y for its position role
- Close the editor by clicking the close button on the tab at the top of the page. Select Yes when asked to save the changes
With the dataset imported and the protocol configured, you can now access a huge set of tools to inspect, calculate and validate the dataset. Take a look at the Application Tour to learn more about these functionalities!