RepeatMasker: screens DNA sequences for low complexity DNA sequences and interspersed repeats (Smit. A, Green. P)



your e-mail
( = required, = conditionally required)


Sequence File : please enter either :

  1. the name of a file:
  2. or the actual data here:

(sequence format)


Species options


Masking options


Running options


Output options


Species options

Masks rodent specific and mammalian wide repeats (-mus)
Masks repeats found in non-primate, non-rodent mammals (-mammal)
Masks repeats found in Arabidopsis thaliana (-arabidopsis)
Masks repeats found in Drosophilas (-drosophila)
masks repeats found in C. elegans (-elegans)

Custom library (e.g. from another species) (-lib) : please enter
either :
  1. the name of a file:
  2. or the actual data here:


only clips E coli insertion elements (-is_only)
clips IS elements before analysis (-is_clip)
skips bacterial insertion element check (-no_is)
only checks for rodent specific repeats (-rodspec)
only checks for primate specific repeats (-primspec)

[Return to the main part with your favorite browser's Back function]


Masking options

Does not mask low_complexity DNA or simple repeats (-nolow)
Only masks low complex/simple repeats (no interspersed repeats) (-noint)
Only masks Alus (and 7SLRNA, SVA and LTR5)(only for primate DNA) (-alu)
Masks only those repeats that are less than this percent diverged from the consensus sequence (-div)
Sets cutoff score for masking repeats when using -lib (-cutoff)

[Return to the main part with your favorite browser's Back function]


Running options


Program search speed
use WU-blast, rather than cross_match as engine (-w)
Use matrices calculated for this percentage background GC level (-gc)
Program calculates the GC content even for batch files/small seqs (-gccalc)
Maximum sequence length masked without fragmenting (-frag)
Skips the steps in which repeats are excised (-nocut)
Prints cross_match progress report to screen (-noisy)

[Return to the main part with your favorite browser's Back function]


Output options

Shows the alignments in a .align output file (-a)
Alignments are presented in the orientation of the repeat (with option -a) (-inv)
Saves a sequence (in file.cut) from which full-length repeats are excised (-cut)
Returns complete masked sequence in lower case (-small)
Returns repetitive regions in lowercase (rest capitals) rather than masked (-xsmall)
Returns repetitive regions masked with Xs rather than Ns (-x)
Reports simple repeats that may be polymorphic (in file.poly) (-poly)
Creates an additional output file in ACeDB format (-ace)
Creates a General Feature Finding format output (-gff)
Creates an untouched annotation file besides the manipulated file (-u)
Creates an additional output file in cross_match format (for parsing) (-xm)
Creates an (old style) annotation file with fixed width column (-fixed)
leaves out final column with unique ID for each element (-no_id)
calculates repeat densities excluding runs of >25 Ns in query (-e)

[Return to the main part with your favorite browser's Back function]


your e-mail

Some explanations about the options


Main parameters
enter either the name of a file or the actual data
if you are using Netscape 2.x or later, you can select a file by typing its name, or better, by selecting it with the Netscape file browser (Browse button)
OR you can type your data in the next area, or cut and paste it from another application.
(but not both)


Running options
Program search speed
Quick search: 5-10% less sensitive, 3-4 times faster than default. Rush job: about 10% less sensitive. Slow search: 0-5% more sensitive, 2.5 times slower than default.
Sequence format
The sequence will be automatically converted in the format needed for the program
providing you enter a sequence either:
in plain (raw) sequence format or in one of the following known formats:
IG,GenBank,NBRF,EMBL,GCG,DNAStrider,Fitch,fasta,Phylip,PIR,MSF,ASN,PAUP,CLUSTALW
You may enter in the text area a database entry code, or an accession number, in this form:
database:entry_name
or:
database:accession.

References:

We still haven't published a paper on RepeatMasker yet but would appreciate it if you could refer to the web page (Smit,AFA and Green,P RepeatMasker at http://repeatmasker.genome.washington.edu/cgi-bin/RM2_req.pl) or otherwise to Smit, AFA and Green, P., unpublished results.

Pise form generator version: 5.a (30 Oct 2009 13:37)