Common Tasks Using the Web Interface

In this tutorial, we will present examples of common tasks and show how to accomplish them using the PATRIC Web Interface. To keep this page manageable, we link to instructions on separate pages.

Working with Sequences

Assemble a Set of Reads into Contigs

PATRIC provides an assembly service that allows you to choose between multiple assemblers, using either single-read libraries, paired-end reads, or reads stored online in the NCBI Sequence Read Archive. We have an introductory tutorial and a more detailed guide.

Create an Annotated Genome from a set of contigs and Estimate its Quality

PATRIC provides an annotation service that constructs a private genome from contigs and produces a quality report. See Genome Annotation for instructions on submitting the contigs, and Analyzing Genome Quality to read the quality report. A sample contigs input file and the resulting job output can be seen here. Note that most of the links in the Genome Report will not work because all annotated genomes are private to the annotating user.

Extract as Many Complete Genomes as Possible from a Meta-Genomic Sample

The /user_guides/services/metagenome_binning_service provides this function. For a tutorial, see Using the PATRIC Metagenomic Binning Service. Example input files for binning and their results can be found here.

Annotate a New Genome and Compare it to Similar Genomes in PATRIC

This combines several of the tasks listed elsewhere on this page. The Comprehensive Genome Analysis service takes either reads or contigs and creates a private genome integrated into the PATRIC database. It also creates a web page describing the genome and places it into a phylogenetic tree. There is a tutorial on how to use this service here. In Comparing Your New Genome to Existing Neighbors we show how to look at the results, find the closest genomes, and compare the protein families.

Working with Taxonomic Groups

Find Representative Genomes for a Taxon

NCBI identifies Reference and Representative genomes for each clade and species. To these lists, PATRIC adds UniProt reference genomes, and additional genomes selected by manual curation. Because of their high quality, they are a useful subset for doing certain types of analysis.

For the procedure to list the genomes in a taxonomic grouping, see Locating All Genomes for a Taxonomic Group. Instructions for isolating the Reference and Representative genomes are given on the same page under the heading Finding the Representative and Reference Genomes.

Find Genomes in a Taxon with Specific Attributes

For the procedure to list the genomes in a taxonomic grouping, see Locating All Genomes for a Taxonomic Group. Instructions for filtering the list are given on the same page under the heading Filtering the Genomes.

Working with Genomes

Find the Roles Present in a Genome

This is described at Finding the Roles in a Genome.

Compare the Proteomes for a Set of Genomes

This is described at Proteome Comparison: Comparing annotated proteins across genomes..

Visually Compare the Protein Families in a Set of Genomes

This is described at Comparing Protein Families across individual genomes and genome groups. A description of the output and how to manipulate it can be found here

Determine the Evidence of Quality for a Genome

A quality analysis is performed on every private genome created using Genome Annotation. The analysis results are stored in a file called GenomeReport.html in the annotation service output folder. A description of how to read this file is in the tutorial Analyzing Genome Quality.

Find the Closest N Genomes to a Particular Genome

Use the Similar Genome Finder. For more information read Similar Genome Finder Service.

Working with Protein Families

List the Features in a Protein Family and the Genomes Containing Them

This procedure is described in Finding a Protein Family by ID.

Find the Function of a Protein Family

First, get a list of the features in the protein family using Finding a Protein Family by ID or Finding a Feature’s Protein Family. Then use the procedure in Finding the Protein Family Function.

Working with Features

Find the Sequence, Protein Translation, and Other Known Attributes of a Gene

The many attributes of a feature are displayed on the Feature (Gene) Overview Tab.

Find the Upstream Region of a Gene

The upstream region can be seen most clearly using the Compare Region Viewer. The upstream region is shown to the left of the red arrow.

Find All the Genes in the Same Protein Family As A Feature

This procedure is described in Finding a Feature’s Protein Family.

Create an Alignment and the Associated Phylogenetic Tree from a Set of Features

This is described in Creating a Multiple-Sequence Alignment.

Find the Papers Relating to a Specific Feature

This information is displayed in the right-hand column of the Feature (Gene) Overview Tab.