Difference between revisions of "Catapult"
From Marcotte Lab
Line 4: | Line 4: | ||
* A [http://www.cs.utexas.edu/~naga86/research/geneRecommender/ query interface], where you can submit your own gene lists and have the results emailed to you. | * A [http://www.cs.utexas.edu/~naga86/research/geneRecommender/ query interface], where you can submit your own gene lists and have the results emailed to you. | ||
* The project [http://www.marcottelab.org/users/mblom/catapult/WeightedWalks1.0.zip source code], if you want to test it all yourself. | * The project [http://www.marcottelab.org/users/mblom/catapult/WeightedWalks1.0.zip source code], if you want to test it all yourself. | ||
+ | * All [http://www.marcottelab.org/paper-pdfs/genes_phenotypes.txt gene-disease predictions] in a 300MB tab-delimited text file (or .zipped 120MB version [http://www.marcottelab.org/paper-pdfs/genes_phenotypes.zip here]). Rows represent human genes, indicating gene names in column 1 as NCBI Entrez genome/Locus link ID numbers. Columns represent human diseases or mouse phenotypes. Entries represent computed scores for gene-trait associations, where larger values indicate more likely associations. All values are scaled, such that sorting by values in a given column allows one to find the most likely genes for that disease, and sorting by values in a given row allows one to find the most likely linked traits for a given gene (obvious caveats being that the diseases/genes must be represented in this table, and the associations are subject to the reach and quality of the predictions captured by CATAPULT.) |
Revision as of 18:35, 2 March 2012
This is the homepage for Catapult. Here we provide:
- The predictions for all the OMIM diseases used.
- A query interface, where you can submit your own gene lists and have the results emailed to you.
- The project source code, if you want to test it all yourself.
- All gene-disease predictions in a 300MB tab-delimited text file (or .zipped 120MB version here). Rows represent human genes, indicating gene names in column 1 as NCBI Entrez genome/Locus link ID numbers. Columns represent human diseases or mouse phenotypes. Entries represent computed scores for gene-trait associations, where larger values indicate more likely associations. All values are scaled, such that sorting by values in a given column allows one to find the most likely genes for that disease, and sorting by values in a given row allows one to find the most likely linked traits for a given gene (obvious caveats being that the diseases/genes must be represented in this table, and the associations are subject to the reach and quality of the predictions captured by CATAPULT.)