MCC ranges from -1 to +1, where a +1 result indicates that the occurrence - of a best match to the motif in the reported region perfectly discriminates positive + of the best match to the motif in the reported region perfectly discriminates positive sequences from negative sequences.

where p is the uncorrected p-value. The number of multiple tests is the number of regions considered times the number of score thresholds considered. It depends on the motif length, sequence length, - and the type of optimizations being done (central enrichment, local enrichemnt, score + and the type of optimizations being done (central enrichment, local enrichement, score optimization).

Sort & Filter

Filter & Sort

Filters

Reference

]+ - and may have comment lines beginning with "#" in column 1. + File must have format: + [

[ close ]

The binomial test compares the number of sequences that had their best matches in the region to the expected number of sequences under the assumption that matches would be randomly distributed.

The Fisher exact test compares the number of sequences which have their - best matches in the region to the number found in a discriminative set.

Adding a second comparative dataset allows use of the Fisher exact test + (FET) to sort the regions found by the binomial test. The FET compares the + number of sequences that have their best matches in the region to the + number found in the comparative set.

[ close ]

@@ -42,7 +44,7 @@

[ close ]

Select a file of FASTA formatted DNA sequences or paste in actual FASTA formatted DNA sequences to compare to the other sequence set.

@@ -124,12 +126,6 @@

[ close ]

This option enables you to graph a comparative set of sequences against - the set under test without using the Fisher exact test.

[ close ]

This option enables you to store the sequence identifies which have their best match in the best region for each motif. This option can @@ -185,13 +181,13 @@

Select how enrichment is determined

- - Binomial Test - - Fisher Exact Test (discriminative) -

+ + Binomial Test + + Binomial Test + Fisher Exact Test (comparative) +

@@ -211,18 +207,18 @@ -

+ @@ -324,12 +320,6 @@ name="ethresh" size="5" value="10" min="0" step="any">

Allow comparative sequences for binomial test?

- - Always allow comparative sequences -

Include sequence IDs

Include a list of matching sequence ids diff -r e77390759cae -r 61325860cd46 website/cgi-bin/centrimo_verify.tmpl --- a/website/cgi-bin/centrimo_verify.tmpl Thu Nov 22 18:53:34 2012 +1100 +++ b/website/cgi-bin/centrimo_verify.tmpl Tue Jan 22 18:14:22 2013 +1000 @@ -27,9 +27,9 @@ Central regions - + Enrichment Test - Fisher Exact Test (discriminative) + Binomial Test + Fisher Exact Test (comparative) Enrichment Test Binomial Test diff -r e77390759cae -r 61325860cd46 website/cgi-bin/meme-chip.pl --- a/website/cgi-bin/meme-chip.pl Thu Nov 22 18:53:34 2012 +1100 +++ b/website/cgi-bin/meme-chip.pl Tue Jan 22 18:14:22 2013 +1000 @@ -193,6 +193,8 @@ # get the maximum central enrichment E-value to report $d{CENTRIMO_ETHRESH} = $utils->param_num($q, 'centrimo_ethresh', 'CentriMo E-value threshold', 0, undef, 10); # get if the seq IDs should be stored + $d{CENTRIMO_LOCAL} = $utils->param_bool($q, 'centrimo_local'); + # get if the seq IDs should be stored $d{CENTRIMO_STORE_IDS} = $utils->param_bool($q, 'centrimo_store_ids'); return \%d; @@ -224,6 +226,7 @@ push(@args, '-centrimo-score', $data->{CENTRIMO_SCORE}); push(@args, '-centrimo-maxreg', $data->{CENTRIMO_MAXREG}) if (defined($data->{CENTRIMO_MAXREG})); push(@args, '-centrimo-ethresh', $data->{CENTRIMO_ETHRESH}); + push(@args, '-centrimo-local') if $data->{CENTRIMO_LOCAL}; push(@args, '-centrimo-noseq') unless $data->{CENTRIMO_STORE_IDS}; # sequences and motif dbs push(@args, $data->{SEQ_NAME}, $data->{DBMOT_DBS}); @@ -279,6 +282,7 @@ $template->param(centrimo_score => $data->{CENTRIMO_SCORE}); $template->param(centrimo_maxreg => $data->{CENTRIMO_MAXREG}); $template->param(centrimo_ethresh => $data->{CENTRIMO_ETHRESH}); + $template->param(centrimo_local => $data->{CENTRIMO_LOCAL}); $template->param(centrimo_store_ids => $data->{CENTRIMO_STORE_IDS}); return $template->output; diff -r e77390759cae -r 61325860cd46 website/cgi-bin/meme-chip.tmpl --- a/website/cgi-bin/meme-chip.tmpl Thu Nov 22 18:53:34 2012 +1100 +++ b/website/cgi-bin/meme-chip.tmpl Tue Jan 22 18:14:22 2013 +1000 @@ -179,6 +179,13 @@

[ close ]

This option causes all regions up to the maximum region size to be + considered even if they are not in the center. This can be useful with + non-symetric data.

[ close ]

This option enables you to store the sequence identifies which have their best match in the best region for each motif. This option can @@ -287,7 +294,7 @@

Scan both DNA strands?

- scan given strand only

@@ -439,6 +446,12 @@ id="centrimo_ethresh">

Find uncentered regions

+ + Run CentriMo in local mode to find uncentered regions +

Include sequence IDs

diff -r e77390759cae -r 61325860cd46 website/cgi-bin/meme-chip_verify.tmpl --- a/website/cgi-bin/meme-chip_verify.tmpl Thu Nov 22 18:53:34 2012 +1100 +++ b/website/cgi-bin/meme-chip_verify.tmpl Tue Jan 22 18:14:22 2013 +1000 @@ -76,11 +76,13 @@ Minimum Site Score - Maximum Central Region + Maximum Region E-value Threshold + Allow Uncentered Regions + EnabledDisabled Store Sequence IDs EnabledDisabled diff -r e77390759cae -r 61325860cd46 website/html/centrimo.js --- a/website/html/centrimo.js Thu Nov 22 18:53:34 2012 +1100 +++ b/website/html/centrimo.js Tue Jan 22 18:14:22 2013 +1000 @@ -6,9 +6,9 @@ function on_page_show() { pasted_sequences_enable($('use_pasted').checked); - discr_pasted_sequences_enable($('use_discr_pasted').checked); + compar_pasted_sequences_enable($('use_compar_pasted').checked); upload_secondaries_enable($('motif_db').value == 'upload'); - update_discr(); + update_compar(); $("max_region").disabled = !$("use_max_region").checked; toggle_class($('adv_opts'), 'expanded', adv_changed()); } @@ -58,15 +58,13 @@ $('pasted_sequences').style.display = (enable ? 'block' : 'none'); } -function update_discr() { - var enable; - enable = $('negs_always').checked || $('discr_on').checked; - $('discr_sequences_area').style.display = (enable ? 'block' : 'none'); +function update_compar() { + $('compar_sequences_area').style.display = ($('compar_on').checked ? 'block' : 'none'); } -function discr_pasted_sequences_enable(enable) { - $('discr_sequences').disabled = enable; - $('discr_pasted_sequences').style.display = (enable ? 'block' : 'none'); +function compar_pasted_sequences_enable(enable) { + $('compar_sequences').disabled = enable; + $('compar_pasted_sequences').style.display = (enable ? 'block' : 'none'); } function check() { @@ -81,14 +79,14 @@ return false; } } - if ($("discr_on").checked || $("negs_always").checked) { - if (!$('use_discr_pasted').checked) { - if ($('discr_sequences').value == '') { + if ($("compar_on").checked) { + if (!$('use_compar_pasted').checked) { + if ($('compar_sequences').value == '') { alert("Please input a file of FASTA formatted sequences for the comparative set.\n"); return false; } } else { - if ($('discr_pasted_sequences').value == '') { + if ($('compar_pasted_sequences').value == '') { alert("Please input FASTA formatted sequences for the comparative set.\n"); return false; } @@ -146,7 +144,6 @@ if ($("opt_score").checked) return true; if ($("use_max_region").checked) return true; if (!/^\s*10\s*$/.test($("ethresh").value)) return true; - if ($("negs_always").checked) return true; if (!$("store_ids").checked) return true; return false; } @@ -164,8 +161,7 @@ $("max_region").disabled = true; $("max_region").value = 200; $("ethresh").value = 10; - $("negs_always").checked = false; $("store_ids").checked = true; toggle_class($('adv_opts'), 'modified', false); - update_discr(); + update_compar(); } diff -r e77390759cae -r 61325860cd46 website/html/downloads.html --- a/website/html/downloads.html Thu Nov 22 18:53:34 2012 +1100 +++ b/website/html/downloads.html Tue Jan 22 18:14:22 2013 +1000 @@ -31,7 +31,7 @@

Download MEME Suite Software and Databases

Installation Guide

Copyright

Commercial Licenses

diff -r e77390759cae -r 61325860cd46 website/html/meme-chip.js --- a/website/html/meme-chip.js Thu Nov 22 18:53:34 2012 +1100 +++ b/website/html/meme-chip.js Tue Jan 22 18:14:22 2013 +1000 @@ -198,6 +198,7 @@ if (!/^\s*5\s*$/.test($("centrimo_score").value)) return true; if ($("centrimo_maxreg_enable").checked) return true; if (!/^\s*10\s*$/.test($("centrimo_ethresh").value)) return true; + if ($("centrimo_local").checked) return true; if (!$("centrimo_store_ids").checked) return true; return false; } @@ -211,7 +212,8 @@ $("centrimo_maxreg").value = 200; $("centrimo_maxreg").disabled = true; $("centrimo_ethresh").value = 10; - $("centrimo_store_ids").checked = false; + $("centrimo_local").checked = false; + $("centrimo_store_ids").checked = true; toggle_class($('centrimo_opts'), 'modified', false); } @@ -258,6 +260,17 @@ $("meme_sites").style.display = ($('meme_dist').value == 'oops' ? 'none' : 'block'); } +function on_ch_norc() { + var norc = $("norc").checked; + var centrimo_local = $("centrimo_local").checked; + if (norc && !centrimo_local) { + if (confirm("CentriMo's localized search works well with this option. Enable it now?")) { + $("centrimo_local").checked = true; + toggle_class($('centrimo_opts'), 'expanded', true); + } + } +} + function on_ch_bfile() { if (window.File && window.FileReader && window.FileList) { var input = $("bfile"); diff -r 5863bee1d071 -r b6a55daaf534 website/html/meme-download.html --- a/website/html/meme-download.html Fri Dec 07 10:34:33 2012 +1000 +++ b/website/html/meme-download.html Fri Jan 25 11:34:30 2013 +1000 @@ -27,46 +27,40 @@
Javascript doesn't seem to be available on your browser. - -

Download Software / View Command Line Options (Man Pages)

Download Software

The MEME Suite software is available for FREE interactive use via the web.

- Alternatively, you can download the software for installation and + +

Alternatively, you can download the software for installation and non-profit use on your own computer. Please read the Copyright for terms and - conditions before downloading the software. For-profit licenses - are also available; click - - here for details.

- The downloadable software includes the complete source code for the MEME suite, - and instructions on how to install and test them. The online installation - guide is Installation instructions.

+ conditions before downloading the software. For-profit + licenses are also available; click here for details.

When you install the MEME Suite software on your own - computer, you can use many features not available with the interactive - versions. You can click on the "Man Page" buttons below to see all of the - features of programs available in the downloadable versions.

The downloadable software includes the complete source code for + the MEME suite, and instructions on how to install and test them. + The online installation guide is Installation instructions.

+ +

When you install the MEME Suite software on your own computer, + you can use many features not available with the interactive + versions. Refer to the commmand-line documentation for information on these + features.

diff -r 5863bee1d071 -r b6a55daaf534 website/html/meme-suite-menu.in --- a/website/html/meme-suite-menu.in Fri Dec 07 10:34:33 2012 +1000 +++ b/website/html/meme-suite-menu.in Fri Jan 25 11:34:30 2013 +1000 @@ -33,7 +33,7 @@ ['Downloads', html_path+'downloads.html', ['Download MEME Suite Software', html_path+'meme-download.html'], ['Copyright', html_path+'COPYRIGHT.html'], - ['Commercial Licenses', 'http://invent.ucsd.edu/technology/cases/2010/SD2010-808.shtml'] + ['Commercial Licenses', 'http://invent.ucsd.edu/technology/cases/2010/documents/MEME4_0_Jun_28_2011.pdf'] ], ['User Support', html_path+'resources.html', ['Q&A Forum', 'https://groups.google.com/forum/#!forum/meme-suite'], diff -r a8846bbb72a3 src/compute-prior-dist.c --- a/src/compute-prior-dist.c Wed Oct 03 10:59:07 2012 +1000 +++ b/src/compute-prior-dist.c Tue Jan 22 12:51:53 2013 -0800 @@ -35,7 +35,10 @@ // Read each prior, find max and min of distribution. DATA_BLOCK_READER_T *psp_reader = NULL; - psp_reader = new_prior_reader_from_psp(filename); + psp_reader = new_prior_reader_from_psp( + FALSE, // Don't try to parse genomic coord. + filename + ); DATA_BLOCK_T *psp_block = new_prior_block(); double min_prior = 1.0; double max_prior = 0.0; diff -r a8846bbb72a3 src/compute-uniform-priors.c --- a/src/compute-uniform-priors.c Wed Oct 03 10:59:07 2012 +1000 +++ b/src/compute-uniform-priors.c Tue Jan 22 12:51:53 2013 -0800 @@ -37,8 +37,10 @@ char *seq_name = NULL; - DATA_BLOCK_READER_T *prior_reader - = new_prior_reader_from_psp(filename); + DATA_BLOCK_READER_T *prior_reader = new_prior_reader_from_psp( + FALSE, // Don't parse genomic coord. + filename + ); DATA_BLOCK_T *prior_block = new_prior_block(); if (uniform_prior < 0.0L) { diff -r a8846bbb72a3 src/fasta-io.c --- a/src/fasta-io.c Wed Oct 03 10:59:07 2012 +1000 +++ b/src/fasta-io.c Tue Jan 22 12:51:53 2013 -0800 @@ -327,9 +327,11 @@ *sequences = (SEQ_T**)mm_malloc(sizeof(SEQ_T*) * num_allocated); // Allocate the DATA_BLOCK_READER - DATA_BLOCK_READER_T *fasta_reader - = new_seq_reader_from_fasta(alph, fasta_filename); - + DATA_BLOCK_READER_T *fasta_reader = new_seq_reader_from_fasta( + TRUE, // parse genomic coord. + alph, + fasta_filename + ); /* Read the sequences one by one. */ i_seq = 0; while ( diff -r a8846bbb72a3 src/fimo.c --- a/src/fimo.c Wed Oct 03 10:59:07 2012 +1000 +++ b/src/fimo.c Tue Jan 22 12:51:53 2013 -0800 @@ -65,6 +65,7 @@ BOOLEAN_T allow_clobber; // Allow overwritting of files in output directory. BOOLEAN_T compute_qvalues; // Compute q-values + BOOLEAN_T parse_genomic_coord;// Parse genomic coord. from seq. headers. BOOLEAN_T text_only; // Generate only plain text output BOOLEAN_T max_strand; // When scores available for both strands // print only the max of the two strands. @@ -162,6 +163,7 @@ {"o", REQUIRED_VALUE}, {"oc", REQUIRED_VALUE}, {"no-qvalue", NO_VALUE}, + {"parse-genomic-coord", NO_VALUE}, {"psp", REQUIRED_VALUE}, {"prior-dist", REQUIRED_VALUE}, {"qv-thresh", NO_VALUE}, @@ -188,6 +190,7 @@ " --norc\n" " --o (default=fimo_out)\n" " --oc (default=fimo_out)\n" + " --parse-genomic-coord\n" " --psp (default none)\n" " --prior-dist (default none)\n" " --qv-thresh\n" @@ -205,6 +208,7 @@ options->allow_clobber = TRUE; options->compute_qvalues = TRUE; options->max_strand = FALSE; + options->parse_genomic_coord = FALSE; options->threshold_type = PV_THRESH; options->text_only = FALSE; options->scan_both_strands = TRUE; @@ -294,6 +298,9 @@ options->output_dirname = option_value; options->allow_clobber = TRUE; } + else if (strcmp(option_name, "parse-genomic-coord") == 0){ + options->parse_genomic_coord = TRUE; + } else if (strcmp(option_name, "thresh") == 0){ options->output_threshold = atof(option_value); } @@ -994,6 +1001,12 @@ ); fprintf( out, + "%s\n", + "parse genomic coord.", + boolean_to_string(options->parse_genomic_coord) + ); + fprintf( + out, "%3.2g\n", "pseudocount", options->pseudocount @@ -1201,14 +1214,19 @@ int num_motif_names = 0; fimo_read_motifs(&options, &num_motif_names, &motifs, &bg_freqs); - DATA_BLOCK_READER_T *fasta_reader - = new_seq_reader_from_fasta(options.alphabet, options.seq_filename); + DATA_BLOCK_READER_T *fasta_reader = new_seq_reader_from_fasta( + options.parse_genomic_coord, + options.alphabet, + options.seq_filename + ); DATA_BLOCK_READER_T *psp_reader = NULL; if (options.psp_filename != NULL) { - psp_reader = new_prior_reader_from_psp(options.psp_filename); + psp_reader = new_prior_reader_from_psp( + options.parse_genomic_coord, + options.psp_filename + ); } - PRIOR_DIST_T *prior_dist = NULL; if (options.prior_distribution_filename) { prior_dist = new_prior_dist(options.prior_distribution_filename); diff -r a8846bbb72a3 src/prior-reader-from-psp.c --- a/src/prior-reader-from-psp.c Wed Oct 03 10:59:07 2012 +1000 +++ b/src/prior-reader-from-psp.c Tue Jan 22 12:51:53 2013 -0800 @@ -19,6 +19,7 @@ typedef struct psp_data_block_reader { BOOLEAN_T at_start_of_line; + BOOLEAN_T parse_genomic_coord; int current_position; char* filename; size_t filename_len; // Includes trailing '\0' @@ -33,7 +34,6 @@ // Forward declarations -DATA_BLOCK_READER_T *new_prior_reader_from_psp(const char *filename); void free_prior_reader_from_psp(DATA_BLOCK_READER_T *reader); BOOLEAN_T close_prior_reader_from_psp(DATA_BLOCK_READER_T *reader); BOOLEAN_T reset_prior_reader_from_psp(DATA_BLOCK_READER_T *reader); @@ -58,9 +58,10 @@ * This function creates an instance of a data block reader UDT for reading * priors from a MEME PSP file. *****************************************************************************/ -DATA_BLOCK_READER_T *new_prior_reader_from_psp(const char *filename) { +DATA_BLOCK_READER_T *new_prior_reader_from_psp(BOOLEAN_T parse_genomic_coord, const char *filename) { PSP_DATA_BLOCK_READER_T *psp_reader = mm_malloc(sizeof(PSP_DATA_BLOCK_READER_T) * 1); psp_reader->at_start_of_line = TRUE; + psp_reader->parse_genomic_coord = parse_genomic_coord; int filename_len = strlen(filename) + 1; psp_reader->filename = mm_malloc(sizeof(char)* filename_len); psp_reader->filename_len = filename_len; @@ -268,7 +269,7 @@ if (c == '>') { BOOLEAN_T found_genomic_coordinates = FALSE; result = read_seq_header_from_prior_reader_from_psp(psp_reader); - if (result == TRUE) { + if (result == TRUE && psp_reader->parse_genomic_coord) { // Look for genomic coordinates in header found_genomic_coordinates = parse_genomic_coordinates(psp_reader); } diff -r a8846bbb72a3 src/prior-reader-from-psp.h --- a/src/prior-reader-from-psp.h Wed Oct 03 10:59:07 2012 +1000 +++ b/src/prior-reader-from-psp.h Tue Jan 22 12:51:53 2013 -0800 @@ -17,6 +17,8 @@ * This function creates an instance of a data block reader UDT for reading * priors from a MEME PSP file. *****************************************************************************/ -DATA_BLOCK_READER_T *new_prior_reader_from_psp(const char *filenae); - +DATA_BLOCK_READER_T *new_prior_reader_from_psp( + BOOLEAN_T parse_genomic_coord, + const char *filename +); #endif diff -r a8846bbb72a3 src/seq-reader-from-fasta.c --- a/src/seq-reader-from-fasta.c Wed Oct 03 10:59:07 2012 +1000 +++ b/src/seq-reader-from-fasta.c Tue Jan 22 12:51:53 2013 -0800 @@ -22,6 +22,7 @@ typedef struct seq_reader_from_fasta { BOOLEAN_T at_start_of_line; + BOOLEAN_T parse_genomic_coord; int current_position; char* filename; size_t filename_len; // Includes trailing '\0' @@ -37,7 +38,6 @@ // Forward declarations -DATA_BLOCK_READER_T *new_seq_reader_from_fasta(ALPH_T alph, const char *filename); void free_seq_reader_from_fasta(DATA_BLOCK_READER_T *reader); BOOLEAN_T close_seq_reader_from_fasta(DATA_BLOCK_READER_T *reader); BOOLEAN_T reset_seq_reader_from_fasta(DATA_BLOCK_READER_T *reader); @@ -60,9 +60,14 @@ * This function creates an instance of a data block reader UDT for reading * sequence segments from a FASTA file. *****************************************************************************/ -DATA_BLOCK_READER_T *new_seq_reader_from_fasta(ALPH_T alph, const char *filename) { +DATA_BLOCK_READER_T *new_seq_reader_from_fasta( + BOOLEAN_T parse_genomic_coord, + ALPH_T alph, + const char *filename +) { SEQ_READER_FROM_FASTA_T *fasta_reader = mm_malloc(sizeof(SEQ_READER_FROM_FASTA_T) * 1); fasta_reader->at_start_of_line = TRUE; + fasta_reader->parse_genomic_coord = parse_genomic_coord; int filename_len = strlen(filename) + 1; fasta_reader->filename = mm_malloc(sizeof(char)* filename_len); fasta_reader->filename_len = filename_len; @@ -501,7 +506,7 @@ if (c == '>') { BOOLEAN_T found_genomic_coordinates = FALSE; result = read_seq_header_from_seq_reader_from_fasta(fasta_reader); - if (result == TRUE) { + if (result == TRUE && fasta_reader->parse_genomic_coord == TRUE) { // Look for genomic coordinates in header found_genomic_coordinates = parse_genomic_coordinates(fasta_reader); } diff -r a8846bbb72a3 src/seq-reader-from-fasta.h --- a/src/seq-reader-from-fasta.h Wed Oct 03 10:59:07 2012 +1000 +++ b/src/seq-reader-from-fasta.h Tue Jan 22 12:51:53 2013 -0800 @@ -18,7 +18,11 @@ * This function creates an instance of a data block reader UDT for reading * sequence segments from a FASTA file. *****************************************************************************/ -DATA_BLOCK_READER_T *new_seq_reader_from_fasta(ALPH_T alph, const char *filename); +DATA_BLOCK_READER_T *new_seq_reader_from_fasta( + BOOLEAN_T parse_genomic_coord, + ALPH_T alph, + const char *filename +); /****************************************************************************** * This function parses a FASTA sequence header and returns the

MEME Suite System Release Notes

Sort & Filter

Sort

Filter & Sort

Filters

Sort

Reference

Select how enrichment is determined

Input the comparative sequences

Enter DNA sequences for comparing motif locality -

Allow comparative sequences for binomial test?

Include sequence IDs

Scan both DNA strands?

Find uncentered regions

Include sequence IDs

Download MEME Suite Software and Databases

Installation Guide

Copyright

Commercial Licenses

Commercial Licenses

Download Software / View Command Line Options (Man Pages)

Download Software