KBase Documentation
  • KBase Documentation
  • KBase Terms & Conditions
  • Getting Started
    • Signing Up and Signing In
      • Step-by-Step Signup Guide
      • Authentication Update
    • Supported Browsers
    • Narrative Quick Start
    • Narrative Interface User Guide
      • Access the Narrative Interface
      • Tour the Narrative Interface
      • Narrative Navigator
      • Create a Narrative
      • Explore Data
      • Add Data to Your Narrative
      • Browse KBase Analysis Tools
      • Analyze Data Using KBase Apps
      • Job Browser
      • Revise Your Narrative
      • Format Markdown Cells
      • Share Narratives
      • Linking Static Narratives to ORCID
      • Access and Copy Narratives
      • Organizations
    • FAQs
  • Manage Your Account
    • Linking Accounts
    • Linking KBase to ORCiD
  • Working with Data
    • Data Upload and Download Guide
      • Data Types
      • Importing Data
        • Bulk Import Limitations
      • Assembly
      • Genome
      • FASTQ/SRA Reads
      • Flux Balance Analysis (FBA) Model
      • Media
      • Expression Matrix
      • Phenotype Set
      • Amplicon Matrix
      • Chemical Abundance Matrix
      • SampleSet
      • Compressed/Zipped Files
      • Bulk Import Specification
      • Downloading Data
    • Searching, Adding, and Uploading Data
    • Filtering, Managing, and Viewing Data
    • Linking Metadata
      • Ontologies and Validated Terms
    • Public Data in KBase
    • Transfer Data with Globus
  • Using Apps
    • Analysis Apps in KBase
      • Assembly & Annotation
      • Comparative Genomics
      • Metabolic Modeling
      • Metagenomics & Community Exploration
      • Data Matrices - Amplicon, Stats
      • Chemical Abundance
      • Expression & Transcriptomics
    • Apps in Beta
  • Running Common Workflows
    • Assembling & Annotating Microbial Genomes
      • FAQ: Assembly and Annotation
    • Comparative Genomics & Phylogenetic Analysis
      • FAQ: Comparative Genomics
    • Metagenomic & Community Analysis
      • FAQ: Metagenomics & Community Analysis
    • Transcriptomic Analysis
      • FAQ: RNA-seq Analysis
    • Constructing Metabolic Models
      • Constructing and Analyzing Metabolic Flux Models of Microbial Communities
      • FAQ: Metabolic Modeling
  • Community Developed Workflows and Tools
    • Functional Annotation
    • Functional and Taxonomic Profiling of MAGs
    • Taxonomy
    • Viral
    • Random Walk with Restart Toolkit
  • Troubleshooting
    • Problems with the User Interface
    • Help Board
    • How to Report Issues
    • Job Errors and Their Meanings
      • Common Job Errors
        • The Job Log
      • Import Job Errors
      • Assembly App Errors
      • Annotation App Errors
      • Functional Genomics App Errors
      • Modeling App Errors
  • Developing Apps
    • The KBase SDK
    • Create a KBase Developer Account
    • KBase GitHub Repository
  • External Links
    • KBase Narrative Interface
    • KBase web site
    • KBase App Catalog
  • kbase.us
Powered by GitBook
On this page
  • Formatting Amplicon Matrix files
  • Uploading and Importing
  • Tutorial Webinar

Was this helpful?

  1. Working with Data
  2. Data Upload and Download Guide

Amplicon Matrix

Generating and handling Amplicon Matrices to link to Samples

PreviousPhenotype SetNextChemical Abundance Matrix

Last updated 7 months ago

Was this helpful?

Formatting Amplicon Matrix files

Any approach to creating an amplicon matrix should be usable in KBase, but may require additional curation. Amplicon matrices or taxonomic abundance matrices include OTU (operational taxonomic units) and ASV (amplicon sequence variants).

We recommend to upload raw counts as analysis () will be tracked and provenance maintained on system. There are a number of .

Amplicon matrices can be uploaded from a TSV (tab-separated values) file with a .tsv file extension.

When uploading, ensure that rows are taxa and columns are samples. A column for taxonomy is optional.

Consensus sequences must also be included as a FASTA file with extension .fa, .fasta. Sequence names must match exactly with the taxa names in the amplicon matrix.

Two files are needed for uploading amplicon data:

  • FASTA file with consensus sequences

  • Taxonomic abundance matrix (OTU or ASV)

    • row = taxon

    • column = sample

    • optional column = taxonomy

Additional information to have on hand for parameters include:

  • Sample processing metadata (SampleSet), i.e. sequencing technology or platform, target gene or region

  • Bioinformatic processing metadata, i.e. clustering method, quality filtering steps

Uploading and Importing

Once the files are in your Staging Area, you can import the data into your Narrative.

Go to the APPS panel and open the Upload category to locate the "Import Amplicon Matrix from TSV/FASTA File in Staging Area" App. Click on the Import Amplicon App to add it to your Narrative.

In the first section for Input Objects, select the previously imported SamplesSet file from the dropdown menu for linking metadata.

Under Parameters, select the Amplicon Matrix or Taxonomic Abundance TSV file using the dropdown and then the corresponding sequences FASTA file.

Parameters that require inputs are the target gene, target subfragment, taxon calling method, i.e. denoising or clustering, and sequence error cutoff.

Additional processing metadata, i.e. primer sequences, library kit, sequencing center, the denoise or clustering methods can be input by clicking "show advanced" parameters. While these parameters are not required, they are recommended to enhance workflow documentation and reproducibility.

Fill in the Amplicon Matrix Object Name and click the "Run" button.

Once your amplicon matrix is imported into the Narrative, you can use the "View Matrix as Table" App to view your new AmpliconMatrix object.

Tutorial Webinar

Amplicon Matrices can be uploaded into KBase without linking metadata. For full functionality of analysis tools using metadata, first .

Using a file on your computer, open the . Then drag & drop the amplicon matrix file and the FASTA file with consensus sequences into the Staging Area.

The amplicon matrix may be applied to an existing and on-system to facilitate the interoperability of objects in KBase.

rarefaction, standardization, and normalization
tools in KBase to analyze Amplicon Matrices and metadata
upload and import the SampleSet
Import tab within the Data Browser
SampleSet
Attribute Mapping
Importing Samples and Amplicon
4KB
amplicon_example.zip
archive
Example of test amplicon matrix with consensus FASTA
Import Amplicon Matrix from TSV/FASTA File in Staging Area. Double click to view.