![]() |
The Genomatix Mapping Algorithm is used for mapping the input sequences against a small RNA (non-coding RNA) library.
The Genomatix smallRNA library contains 0.25 million sequences of non-coding RNAs collected
from a number of pertinent public databases and stratified into 10 classes.
Details and numbers can be found on this page on the small RNA library
If small RNA sequencing data are mapped to the small RNA library, no linker removal is needed. This speciality is due to the fact that the smallRNA library is constructed in such a way that only the overlaps of the sequence reads with the small RNA is taken into consideration.
Input data | |
---|---|
Available Files |
Input data should contain the input reads / tags from the sequencing. The following formats are accepted:
You can use shift/ctrl-keys to select multiple files from the list. All selected files will then be used as input
(also see merge option below).
When adding a new file, a new window will open, asking you to either
Note that almost all browsers have a general upload limit of 2 GB, i.e. files bigger than this size should be zipped before uploading from your local computer. This restriction does not apply when using the direct import from the GGA. Optionally you can specify a name for saving uploaded files on the server, otherwise the name of the uploaded file will be used. If several files are uploaded, the string given here will be used as prefix for each file name. After one or several Read files were uploaded successfully, and after closing the popup window,
the list of available Read files will be automatically updated.
Uploaded Read files can be deleted from the project anytime via the project management. |
Merge results | If this checkbox is ticked, all mapping results from several input files will be merged into one single BAM output file. Otherwise, each input file will result in a single BAM file which will be stored separately into the result management. |
Genomatix Mapper Parameters | |
Sequencing |
Antisense directed
Select this checkbox in case of an antisense sequencing experiment
(Library type of the RNA-seq protocol.)
Specifies whether the strand generated during first strand synthesis (antisense strand) was sequenced. If this option is set, the reads will be reverse complemented before mapping. |
Seed Mapping | This parameter controls the seed search strategy:
Select fast for an exact search of seeds from the mapping library.
Select deep for a seed search with one error tolerance.
Note: The seed search mode should be selected according to the sequence length and sequencing error rate. The 'deep' mode should be used if the sequences are short (< 50 bps) or if the expected error rate is high (> 10%). Otherwise the 'fast' mode can be used without losing a significant amount of good hits compared to the deep mode. Note: For smallRNA mapping 'deep' mode is recommended.Please note that for sequences ≥ 75bps this parameter will be
automatically set to 'fast'.
|
Alignment | Alignment type
Please note that gapped alignment is required for subsequent Small Variant Detection
|
Alignment quality can be selected in two different ways:
|
|
Note: The choice of the alignment method depends on the
sequencer which generated the results:
|
|
Masking |
These parameters determine the number of base pairs at the ends
of the sequence reads which should be excluded from the mapping.
|
Output Options |
Reporting of matches:
If this parameter is set, a de novo splice junction detection via spliced alignment of the sequence reads will be calculated. |
Output |
Result | Here, you can edit the default name of the result file. |
Email address | Here you can choose between two methods for receiving
the results:
The results will be available for a limited time on our server. For details of how long your results will be kept please see the result-email. After that period they will be deleted unless protected in the project management! |
The output has a number of sections, depending on the input and parameters:
The result sections are described in detail below.
Here is a description of the categories for the Genomatix Mapper.
The number of reads that map (unique or multiple respectively) to the various small RNA classes are displayed in this view:
In this chart the Mapping Overview is displayed graphically. Here's an example:
Example:
The rightmost column shows the number and percentage of perfectly mapped reads
(alignment quality = 100).
Also shown are the reads mapping with one (alignment quality 97%) or two mismatches.
Moving the mouse pointer over one of the columns shows the numbers.
Note that you can zoom into the graphics by selecting a x-range with the mouse. To
show the default range, click the "reset zoom" button.
Format:
Column description | Explanation | Format | Example |
---|---|---|---|
Organism | Target organism (three letter format) | String | aga |
Chromosome | Chromosome name | String | chr2L |
start | start position on chromosome | Int | 1385844 |
end | end position on chromosome | Int | 1385865 |
strand | genomic strand | Char (+/-) | - |
GXN | Genomatix ncRNA ID, sequence based | String | GXN_2864 |
name | ncRNA name | String | mmu-miR-466f-3p |
total hits | unique + multiple hits for this ncRNA | Int | 321 |
relative hits | total hits divided by read count for this class | Float | 0.00456 |
annotation | ncRNA annotation | String | mmu-miR-466f-3p MIMAT0004882 Mus musculus miR-466f-3p |
© 2022 Precigen Bioinformatics Germany GmbH - All rights reserved |