Elementolab/ChIPseeqerGetPCGenes
From Icbwiki
(Difference between revisions)
| Revision as of 20:02, 19 November 2010 Eug2002 (Talk | contribs) ← Previous diff |
Current revision Eug2002 (Talk | contribs) |
||
| Line 17: | Line 17: | ||
| The following options are available: | The following options are available: | ||
| - | -projFile FILE This file is created from the ChIPseeqerModel script and contains the PC scores for all transcripts. | + | --projFile FILE This file is created from the ChIPseeqerModel script and contains the PC scores for all transcripts. |
| - | -pcNum INT Set this option to a number to specify the number of principal components you are going to extract the transcripts for. | + | --pcNum INT Set this option to a number to specify the number of principal components you are going to extract the transcripts for. |
| + | --thres NUMBER Set this threshold value to extract genes that have value > threshold | ||
| Example of the '''projFile'''. | Example of the '''projFile'''. | ||
| Line 38: | Line 39: | ||
| ... | ... | ||
| NM_001130845 0.0002 | NM_001130845 0.0002 | ||
| - | ... | ||
| - | NM_001134738 -0.7238 | ||
| - | NM_001006109 -1.3473 | ||
| '''IMPORTANT''': The scores are sorted numerically (from positive to negative values) | '''IMPORTANT''': The scores are sorted numerically (from positive to negative values) | ||
Current revision
Back to Elementolab/
Back to Elementolab/ChIPseeqerModel
[edit]
ChIPseeqerGetPCGenes
This script extracts for each principal component the transcripts and their scores (after PCA).
[edit]
How to run the script
To run the script directly from any folder, you need to add the $CHIPSEEQERDIR and $CHIPSEEQERDIR/SCRIPTS to your $PATH. Read How to set the CHIPSEEQERDIR variable.
1. To run the program, type the command:
ChIPseeqerGetPCGenes --projFile=MatPromBind.txt.proj
The following options are available:
--projFile FILE This file is created from the ChIPseeqerModel script and contains the PC scores for all transcripts. --pcNum INT Set this option to a number to specify the number of principal components you are going to extract the transcripts for. --thres NUMBER Set this threshold value to extract genes that have value > threshold
Example of the projFile.
GENE PC1 PC2 PC3 NM_018117 2.6802 1.0600 -0.0294 NM_001130845 13.8136 12.4116 -2.2314 NM_021107 4.3074 1.8483 1.4885
[edit]
How do the results look like
The output of this script is one file for each PC, named MatPromBind.txt.proj.PC1.transcripts.txt and so on.
Here's what these files will contain.
NM_001706 17.0907 NM_001029835 14.2258 NM_001518 12.1606 ... NM_001130845 0.0002
IMPORTANT: The scores are sorted numerically (from positive to negative values)
