The number of training set positive sequences matching the motif / the number of training set positive sequences.

Note these counts are made after erasing sites that match previously found motifs.

[close ]

The number of training set positive sequences matching the motif.

Note these counts are made after erasing sites that match previously found motifs.

[close ]

The number of training set negative sequences matching the motif / the number of training set negative sequences.

Note these counts are made after erasing sites that match previously found motifs.

[close ]

The number test set positive sequences matching the motif / the number of test set positive sequences.

Note these counts are made after erasing sites that match previously found motifs.

[close ]

The number of test set positive sequences matching the motif.

Note these counts are made after erasing sites that match previously found motifs.

[close ]

The number of test set negative sequences matching the motif / the number of test set negative sequences.

Note these counts are made after erasing sites that match previously found motifs.

[close ]

The mean distance from the center of the best match to the sequence center, averaged over all training set sequences with a match.

[close ]

The mean distance from the center of the best match to the sequence center, averaged over all test set sequences with a match.

[close ]

For determining if a motif is statistically significant, you should use the value in the E-value column. If there is no E-value column, that means that either the positive or negative hold-out set would have been too small (fewer than 5 sequences). For very small sequence sets, it is not practical for STREME to compute an accurate E-value. In that case, you can determine if your motif is significant by running STREME twenty or more times on shuffled versions of your positive dataset, and seeing if the Score is always larger than the Score using the original sequences. You can make shuffled sequence datasets using the MEME Suite command-line utility fasta-shuffle-letters) if you have installed the MEME Suite on your own computer.

The statistical test used in computing the Score is either the Fisher Exact Test, the Binomial Test, or the Cumulative Bates distribution. (See Inputs and Settings for the particular test being used.) The Fisher Exact Test and the Binomial Test both estimate the enrichment of the motif in the positive sequences compared to the the negative sequences. (The Binomial Test is used when the positive and negative sequences have different average lengths.) The Cumulative Bates distribution measures the tendency of motif to be near the center of the input sequences.

[close ]

The statistical test used in computing the p-value is either the Fisher Exact Test, the Binomial Test, or the Cumulative Bates distribution. (See Inputs and Settings at the bottom of this document for the particular test being used.) The Fisher Exact Test and the Binomial Test both measure the enrichment of the motif in the positive test sequences compared to the the negative test sequences. (The Binomial Test is used when the positive and negative sequences have different average lengths.) The Cumulative Bates distribution measures the tendency of motif to be near the center of the sequences.

[close ]

The score threshold for determining if a potential site is a match to the motif. The same threshold is applied when determining matches in the training and test sequences. The threshold is in bits.

The match score of a position in a sequence is determined by converting the motif to a base-2 log-odds matrix using the formula log2(prob[a][i]/background[a]). Here, prob[a][i] is the probability of the letter 'a' at position 'i' of the motif, and background[a] is the probability of the letter 'a' according to the background.

[close ]

The names of the files containing the positive (primary) and negative (control) sequences input to STREME.

If you did not provide a file containing the negative (e.g., control) sequences, STREME created them using N-order shuffling. 0-order shuffling preserves 1-mer frequencies (i.e., the letter frequencies), 1-order shuffling preserves 2-mer frequencies, etc.

[close ]

The name of the alphabet of the sequences.

[close ]

The number of sequences.

[close ]

The total length of the sequences.

[close ]

The name of the alphabet symbol.

[close ]

The frequency of the alphabet symbol in the negative sequences.

[close ]

The frequency of the alphabet symbol as defined by the background model.

[ close ]

STREME

Sensitive, Thorough, Rapid, Enriched Motif Elicitation

For further information on how to interpret these results please access https://meme-suite.org/meme/doc/streme.html.
To get a copy of the MEME software please access https://meme-suite.org.

Your browser does not support canvas!

Sequences

Role	Source	Alphabet	Sequence Count	Total Size
Positive (primary) Sequences
Negative (control) Sequences

Background Model

Other Settings

Strand Handling	This alphabet only has one strand. Only the given strand is processed. Both the given and reverse complement strands are processed.
Objective Function
Statistical Test
Motif Selection Criterion
Minimum Motif Width
Maximum Motif Width
Sequence Shuffling
Test Set
Word Evaluation
Seed Refinement
Refinement Iterations
Minimum Score
Refinement Match Subsets
Minimum Palindrome Ratio
Maximum Palindrome Edit Distance
Print Candidate Motifs?
Random Number Seed
Trimming of Control Sequences
Total Length
Maximum Motif
Maximum Motifs to Find
Maximum Run Time

Details

STREME

Sensitive, Thorough, Rapid, Enriched Motif Elicitation

Your browser does not support canvas!

Discovered Motifs

Inputs & Settings

Sequences

Background Model

Other Settings

STREME version

Command line