MAST results in plain text format.
MAST results in XML format.
The sequence that would achieve the best possible match score (and its reverse complement for nucleotide motifs).
MAST computes the pairwise correlations between each pair of motifs. The correlation between two motifs is the maximum sum of Pearson's correlation coefficients for aligned columns divided by the width of the shorter motif. The maximum is found by trying all alignments of the two motifs.
Motifs with correlations below 0.60 have little effect on
the accuracy of the E-values computed by MAST. Motifs with higher
correlations with other motifs should be removed from the query. You can
also request MAST to remove redundant motifs from its analysis
under Advanced options from the MAST web page,
or by specifying
when running MAST on your own computer.
This diagram shows the normal spacing of the motifs specified to MAST.
MAST will calculate larger p-values for sites that diverge from the order and spacing in the diagram.
The E-value of the sequence.
If strands were scored separately then there will be two E-values for the sequence separated by a slash (/). The score for the provided sequence will be first and the score for the reverse-complement will be second.
The block diagram shows the best non-overlapping tiling of motif matches on the sequence. These motif matches are the ones used by MAST to compute the E-value for the sequence. Hovering the mouse cursor over a motif match causes the display of the motif name, position p-value of the match and other details in the hovering text.
The combined p-value of the sequence.
If strands were scored separately with a complementable alphabet then there will be two p-values for the sequence separated by a slash (/). The score for the given sequence will be first and the score for the reverse-complement will be second.
This indicates the offset used for translation of the DNA.
The annotated sequence shows a portion of the sequence with the matching motif sequences displayed above.
The displayed portion of the sequence can be modified by sliding the two buttons below the sequence block diagram so that the portion you want to see is between the two needles attached to the buttons. By default the two buttons move together but you can drag one individually by holding shift before you start the drag.
If the strands were scored separately then overlaps in motif sites may occur so you can choose to display only one strand at a time. This is done by selecting "Matches on given strand" or "Matches on opposite strand" from the drop-down list.
The sequence p-value of a score is defined as the probability of a random sequence of the same length containing some match with as good or better a score.
The combined p-value of a sequence measures the strength of the match of the sequence to all the motifs and is calculated by
The E-value of a sequence is the expected number of sequences in a random database of the same size that would match the motifs as well as the sequence does and is equal to the combined p-value of the sequence times the number of sequences in the database.
If you use MAST in your research, please cite the following paper:
Timothy L. Bailey and Michael Gribskov, "Combining evidence using p-values: application to sequence homology searches", Bioinformatics, 14(1):48-54, 1998. [pdf]
Motifs with a pale red background are very similar to other earlier
specified motifs and may be biasing the results.
It is recommended that you re-run MAST and request it to remove redundant motifs.
Motifs which are grayed-out were very similar to other earlier specified motifs and were removed from the scan as you requested.
The following sequence databases were supplied to MAST.
|Database||Sequence Count||Residue Count||Last Modified|
The following motif databases were supplied to MAST.
The expected order and spacing of the motifs (as specified by you).
|Max Sequence E-value|
|Adjust Hit p-value|
|Displayed Weak Hits|