The program for finding pathways by tracking carbons is available as a jar file: CarbonSearch.jar The program currently requires a mysql database to access certain data, a mysqldump of this database can be downloaded here. Three sets of data files are also required to run the program: Mol files obtained from KEGG Cleaned RPAIR mappings from KEGG Cleaned reaction mappings based on RPAIR from KEGG To run the program (assuming you have Java installed): java -jar CarbonSearch.jar [input file] The input file contains all of the necessary information to run the program. It consists of a list of field names and field values separated by a tab. An example input file is available here.
Description of the field names in the input file | |
---|---|
DBHOST | where the mysql database is hosted |
DBPORT | port of the mysql database |
DBNAME | mysql database name |
DBUSER | mysql database user with access to the database |
DBPASS | mysql password for the database user |
MOLDIR | where the mol files are located |
RPAIRDIR | where the cleaned RPAIR mappings are located |
REACTIONDIR | where the cleaned reaction mappings are located |
STARTCID | the KEGG compound ID for the start compound |
TARGETCID | the KEGG compound ID for the target compound |
K | value of K for Eppstein’s algorithm (usually set quite large) |
NUMCARBON | number of carbons to conserve |
DELTALENGTH | the number added to the shortest path length to determine the seed pathways, for example if the shortest path is of length 10 and DELTALENGTH is set to 4, then all pathways up to length 14 will be used |
BRANCH | if set to 0, only find linear pathways, if set to 1, find branched pathways |
NUMRETURN | the maximum number of pathways to return |
OUTPUTDIR | where the output files (in dot format) will be written |