Expression Data Import Service¶
The Differential Expression Import Service facilitates upload of user-provided, pre-processed differential expression datasets generated by microarray, RNA-Seq, or proteomic technologies to the user’s private workspace. From there the data sets can be analyzed using annotations and analysis tools for comparison with other expression data sets in PATRIC. Currently, PATRIC only supports differential gene expression data in the form of log ratios, generated by comparing samples/conditions/time points.
Using the Expression Import Service¶
The Expression Import submenu option under the Services main menu (Transcriptomics category) opens the Expression Import input form (shown below). Note: You must be logged into PATRIC to use this service. The Expression Data Import Service can also be accessed via the PATRIC Command Line Interface (CLI).
Expression Import Menu
Expression Import Input Form
Experiment Data File¶
Allows upload a data file containing differential gene expression values in the form of log ratios. The file should be in one of the supported formats described below. Optionally, you may also upload metadata related to sample comparisons in the prescribed format to provide additional context for data analysis. See Optional Metadata.
File Format: Currently, PATRIC allows you to upload your transcriptomics datasets in the form of differential gene expression measured as log ratios. Data can be uploaded in multiple file formats: comma separated values (.csv), tab delimited values (.txt), or Excel (.xls or .xlsx). Click to download the Sample Data template in Gene Matrix Format. Files should contain data in one of the following formats:
Gene Matrix: Gene IDs are represented in the first column with extra columns for each of the comparisons in the form of log ratio, i.e., log2(test/control). Below is an example of transcriptomics data in Gene Matrix format:
Sample Gene Matrix
Gene List: Data are represented in three columns: Gene ID, Comparison ID, and Log Ratio of expression value, i.e., log2(test/control). Below is an example of transcriptomics data in Gene List format:
Sample Gene List
Dropdown list specifying the the experiment type, either Transcriptomics, Proteomics, or Phenomics.
Option for specifying if the data set is for bacteria/archea, or alternatively, eukaryotic host.
Source ID Type¶
Dropdown list for specifying the source ID type, e.g., PATRIC ID, NCBI Gene ID, RefSeq Locus Tag. Used to map genes to genes in PATRIC, allowing integration of other PATRIC data in PATRIC into the analysis. Due to differences in annotation that may exist, some genes may go unmapped. Unmapped genes will be excluded from subsequent analysis.
- RefSeq Locus Tag
- PATRIC Feature ID
- NCBI GI Number
- NCBI Protein ID
- SEED ID
- PATRIC Legacy ID
Option for mapping the expression data to a particular genome in PATRIC. This will facilitate inclusion of other expression data in PATRIC for that genome in subsequent analysis.
Metadata to specify the experiment title, description, organism, and Pubmed ID (optional) for providing contextual information for the uploaded data in PATRIC.
Title for the experiment (required).
Description of the experiment (required).
Taxonomic name of the organism associated with the uploaded data.
Pubmed ID associated with the uploaded data, if available (optional).
Folder in the Workspace where the uploaded data file will be located. The default is /home/Experiments.
Allows upload of an additional file containing key metadata attributes about the comparisons. This will enhance your analysis when using the heatmap and clustering tools. The metadata table can be uploaded as a tab/comma-delimited or Excel file. Note: The column names should be the same as in the example template. Also, comparison IDs in the metadata file should match those in the data file. Click to download the Metadata Template.
Expression Import Service Output Files
The Expression Import Service generates several files that are deposited in the Private Workspace in the designated Output Folder. These include
- expression.json - JSON format file containing log-ratio expression values by feature.
- mapping.json - JSON format file containing details of the mapping of genes/proteins in the uploaded file to PATRIC Feature IDs.
- sample.json - JSON format file containing the experiment conditions.
View Experiment Conditions¶
Expression Import View Icon
Clicking on the View icon at the top right of the page will display a page with each of the conditions that are part of the experiment.
Expression Import Conditions Table
Selecting one of the Conditions and clicking the Genes icon in the vertical green Action Bar on the right side of the table will display a table of the genes and how they are regulated for that condition. See Transcriptomics Tab for more information on how to use this table and the associated heatmap.
View Experiment Results Across All Conditions¶
Expression Import Experiment Icon
Clicking on the Experiment icon at the top right of the page will display a page with all of the results for all of the conditions.
Expression Import Genes Table
See Transcriptomics Tab for more information on how to use this table and the associated heatmap.