KBase Documentation
  • KBase Documentation
  • KBase Terms & Conditions
  • Getting Started
    • Signing Up and Signing In
      • Step-by-Step Signup Guide
      • Authentication Update
    • Supported Browsers
    • Narrative Quick Start
    • Narrative Interface User Guide
      • Access the Narrative Interface
      • Tour the Narrative Interface
      • Narrative Navigator
      • Create a Narrative
      • Explore Data
      • Add Data to Your Narrative
      • Browse KBase Analysis Tools
      • Analyze Data Using KBase Apps
      • Job Browser
      • Revise Your Narrative
      • Format Markdown Cells
      • Share Narratives
      • Linking Static Narratives to ORCID
      • Access and Copy Narratives
      • Organizations
    • FAQs
  • Manage Your Account
    • Linking Accounts
    • Linking KBase to ORCiD
  • Working with Data
    • Data Upload and Download Guide
      • Data Types
      • Importing Data
        • Bulk Import Limitations
      • Assembly
      • Genome
      • FASTQ/SRA Reads
      • Flux Balance Analysis (FBA) Model
      • Media
      • Expression Matrix
      • Phenotype Set
      • Amplicon Matrix
      • Chemical Abundance Matrix
      • SampleSet
      • Compressed/Zipped Files
      • Bulk Import Specification
      • Downloading Data
    • Searching, Adding, and Uploading Data
    • Filtering, Managing, and Viewing Data
    • Linking Metadata
      • Ontologies and Validated Terms
    • Public Data in KBase
    • Transfer Data with Globus
  • Using Apps
    • Analysis Apps in KBase
      • Assembly & Annotation
      • Comparative Genomics
      • Metabolic Modeling
      • Metagenomics & Community Exploration
      • Data Matrices - Amplicon, Stats
      • Chemical Abundance
      • Expression & Transcriptomics
    • Apps in Beta
  • Running Common Workflows
    • Assembling & Annotating Microbial Genomes
      • FAQ: Assembly and Annotation
    • Comparative Genomics & Phylogenetic Analysis
      • FAQ: Comparative Genomics
    • Metagenomic & Community Analysis
      • FAQ: Metagenomics & Community Analysis
    • Transcriptomic Analysis
      • FAQ: RNA-seq Analysis
    • Constructing Metabolic Models
      • Constructing and Analyzing Metabolic Flux Models of Microbial Communities
      • FAQ: Metabolic Modeling
  • Community Developed Workflows and Tools
    • Functional Annotation
    • Functional and Taxonomic Profiling of MAGs
    • Taxonomy
    • Viral
    • Random Walk with Restart Toolkit
  • Troubleshooting
    • Problems with the User Interface
    • Help Board
    • How to Report Issues
    • Job Errors and Their Meanings
      • Common Job Errors
        • The Job Log
      • Import Job Errors
      • Assembly App Errors
      • Annotation App Errors
      • Functional Genomics App Errors
      • Modeling App Errors
  • Developing Apps
    • The KBase SDK
    • Create a KBase Developer Account
    • KBase GitHub Repository
  • External Links
    • KBase Narrative Interface
    • KBase web site
    • KBase App Catalog
  • kbase.us
Powered by GitBook
On this page

Was this helpful?

  1. Working with Data
  2. Data Upload and Download Guide

Data Types

PreviousData Upload and Download GuideNextImporting Data

Last updated 9 months ago

Was this helpful?

Data within KBase covers a wide range of data types relevant to systems biology research, including genomes and their annotations, metagenomes, expression, and protein-protein interactions, inferred models of organismal and community metabolism and gene regulation, even geographical information about populations.

Remember to follow the KBase agreement.

When working with data in KBase and using Apps, remember that choosing the same output name for data objects will overwrite existing data objects of the same type with that name.

Data Type Descriptions

KBase handles data as objects versus files for interoperability with KBase Apps. The following are data types and their adjacent file types:

  • Assembly ⏤ FASTA files; extensions .fasta, .fna, .fa, .fas

  • FASTQ/SRA Reads ⏤ Interleaved, Non-interleaved, paired-end reads, single-end reads; extensions .fastq, .fq, .sra

  • Genome ⏤ GenBank or GFF3 with a FASTA file; extensions .genbank, .gb, .gbk, or .gbff, .gff, and .fasta, .fna, .fa, .fas

  • Metagenome ⏤ GFF Metagenome with a FASTA file; extensions .gbff, .gff, and .fasta, .fna, .fa, .fas

  • FBA Model ⏤ SBML, Excel, or TSV; extensions .sbml, .xml, .tsv, .xls, .xlsx

  • Media ⏤ Excel or TSV; .xls, .xlsx, .tsv

  • Expression Matrix ⏤ Excel or TSV; .xls, .xlsx, .tsv

  • Phenotype Set ⏤ Excel or TSV; .xls, .xlsx, .tsv .tab

  • Amplicon Matrix ⏤ Excel and FASTA; .xls, .xlsx, and .fasta, .fna, .fa, .fas

  • Chemical Abundance Matrix ⏤ Excel, CSV, or TSV; .xls, .xlsx, .csv, .tsv

  • SampleSet ⏤ Excel or TSV; .xls, .xlsx, .tsv

The lists and describes the key data types representing different classifications of biological data within KBase.

Data Policy and Sources
Data Type Catalog