Flux Balance Analysis (FBA) Model
Data handling to create a model to predict biomass yield given a certain amount of input nutrient.
Flux Balance Analysis (FBA) models can be uploaded into KBase as a Systems Biology Markup Language (SBML) file using the .sbml or .xml file extension, an Excel file using the .xls extension, or a tab-separated values (TSV) file using the .tsv extension.
When uploading an FBA model in the TSV format, there will be two files corresponding to chemical compounds and reactions.
Formatting Flux Balance Analysis Models
SBML
More SBML FBA Models for various organisms can be found here: http://systemsbiology.ucsd.edu/Downloads. For a description of how to create a COBRA-compliant SBML file that can be imported as an FBA model into KBase, please see SBML Level Three Specifications.
Excel
In the Excel format, the tables containing the information about the chemical compounds and reactions in an FBA model are stored in two separate tabs respectively. The most important requirement for the Excel file is naming the tabs “ModelCompounds” and “ModelReactions.”
TSV
For the Tab-Separated Values file format, the chemical compounds and reactions tables are saved as two separate files named “FBAModelCompounds.tsv” and “FBAModelReactions.tsv” that will be transformed
FBAModelCompounds
id | name | formula | charge | aliases |
cpd00113 | Isopentenyldiphosphate | C5H10O7P2 | -2 | |
cpd02590 | all-trans-Heptaprenyl diphosphate | C35H58O7P2 | -2 |
id: Compound identifier; see the KBase Biochemistry reference for a list of compounds [link]
name: Name of chemical compound
formula: Chemical formula using the Hill system
charge: Formal charge of the molecule
aliases: Alternative names for chemical compound
FBAModelReactions
Importing an FBA model into KBase requires specified reaction IDs for the biomass producing equations in the model. An individual organism FBA model needs a single biomass equation, while a community model may have multiple biomass producing equations. In the FBAModelReactions table, denote the biomass producing reaction of an individual organism, or the first in a community model, with an ID such as “bio1” so the reaction is easily identifiable. For subsequent biomass producing reactions in a community model, it is recommended that you use “bio2” for the second biomass producing reaction, “bio3” for the third, and so on.
Scroll table below from left to right to see complete format
id | direction | compartment | gpr | name | enzyme | pathway | reference | equation |
rxn00001_c0 | -> | c0 | fig|211586.9.peg.3751 | Pyrophosphate phosphohydrolase_c0 | (1) cpd00001[c0] + (1) cpd00012[c0] -> (2) cpd00067[c0] + (2) cpd00009[c0] | |||
rxn00056_c0 | <-> | c0 | fig|211586.9.peg.1038 | Fe(II):oxygen oxidoreductase_c0 | (4) cpd00067[c0] + (4) cpd10515[c0] + (1) cpd00007[c0] <-> (4) cpd10516[c0] + (2) cpd00001[c0] |
id: Reaction identifier; see the KBase Biochemistry reference for a list of reactions
direction: Directionality of chemical reaction specified as forward (->), reversed (<-), or equilibrium (<->)
compartment: Cellular compartment that the reaction takes place in; see the Metabolic Modeling FAQ for more information
gpr: Gene-Protein Reaction in PATRIC identifier format
name: Protein associated with the reaction
enzyme: Optional, can be left empty
pathway: Optional, can be left empty
reference: Optional, can be left empty
equation: Chemical reaction equation specified with compound coefficients in parentheses, compound identifiers, and directional arrows
Importing FBA Models
In order to successfully upload an FBA model into your KBase workspace, you first need to add the Genome that corresponds to referenced in the FBA Model you wish to upload. Add the Genome to the Narrative before importing the FBA model file.
Using a file on your computer, open the Import tab within the Data Browser. Then drag & drop the FBA model file (plus compound file if using TSV format) into your Staging Area. Open the pulldown menu to the right of the filename in the Staging Area and select “FBA Model."
Now click the import icon (up arrow) to the right of “FBA Model." The Data Browser slide-out will close and an app called “Import TSV/XLS/SBML File as an FBAModel from Staging Area” will be added to your Narrative.
Notice that the Model file path is filled in, as is a suggested name for the FBA Model data object that will be created by the import (you can edit this).
At this point, the corresponding Genome needs to be linked to the FBA Model. Add the name of the Genome using the dropdown to select the Genome that you added to the Data Panel. Although specifying the Genome is optional, adding a Genome with gene IDs that match genes IDs in the model is required to import gene-protein-reactions (GPRs). GPRs are necessary to perform gene KO in the Run Flux Balance Analysis App.
Ensure the correct Model file type is selected (SBML, Excel, Tab-separated values).
Click the "+" button to the right of Biomass and type in the name of the first biomass. If there are additional biomass values to enter, click the "+" button to open another parameter field for another biomass. Note: If the biomass reaction name starts with “R_”, do NOT include the “R_” when entering the name in this field.
If importing a TSV-formatted model, select the corresponding Compounds file path from the dropdown.
Edit the FBA Model object name for the output if necessary. Click the green "Run" button to start the import.
When the import is finished, your Data Panel will update to show the new FBA Model object, and a report will appear in the Import App.
Associating an FBA Model with Genome
For KBase Metabolic Modeling Apps, associating a genome with user uploaded models is necessary for ensuring fidelity between the genome annotations and gene-protein-reactions in the metabolic model. This means that the gene IDs between the model and genome need to match. Occasionally, researchers will use locus tags or gene IDs for constructing metabolic models that are different than those found commonly in reference genomes. If the gene IDs do not match, you will need to change either the model or the genome to ensure their gene IDs match.
One way to accomplish this is to locate the genome used as the reference for building the model – based on shared gene IDs – and to upload this genome into KBase before uploading the model and associating it with the genome. If you cannot locate the genome with the gene IDs used to build the model, the best option for reconciling the gene IDs is to edit the model file (as SBML or TSV) to change the gene IDs contained therein to match the gene IDs of the genome.
This public Help Board ticket has a detailed example of how to accomplish this.
Last updated