The EhecRegNet 1.0p prediction workflow

Transfer of gene regulatory networks from the model organism E. coli K12 to EHEC organisms

Workflow

Workflow description [Click on one of the tool icons in the graphic to be redirected to the corresponding bioinformatics website.]

Step Description Publication
EhecRegNet 1.0e - experimental We used all validated datasets of E. coli from the EhecRegNet database.
MoRAine All known transcription factor binding sites are re-adjusted (no position shifts but strand annotation --> l=r=0, method: Cluster growing/Motif-seed similarity). Construction of position weight matrices (PWMs). Baumbach J, Wittkop T, Weile J, Kohl T, Rahmann S (2008) MoRAine - A web server for fast computational transcription factor binding motif re-annotation. Journal of Integrative Bioinformatics, 5(2):91, 2008. (download)
PoSSuMsearch We utilized PoSSuMsearch to scan the upstream/promotor sequences of all transcription units of the EHEC (extracted from EhecRegNet) to scan for putative transcription factor binding sites by using the MoRAine-adjusted binding sites of E. coli. Beckstette M, Homann R, Giegerich R, Kurtz S (2006) Fast index based algorithms and software for matching position specific scoring matrices. BMC Bioinformatics. 2006 Aug 24;7:389. (download)
TransClust All amino acid sequences of EHEC are extracted from EhecRegNet and clustered into clusters of orthologous/conserved proteins. We consider two genes as orthologous/conserved (1) if the corresponding proteins are in the same cluster and (2) if at least one of the surrounding genes also shares a cluster. Wittkop T, Emig D, Lange SJ, Rahmann S, Albrecht M, Morris JH, Boecker S, Stoye J, Baumbach J (2010) Partitioning biological data with Transitivity Clustering. Nature Methods 2010 Jun;7(6):419-20. (download)
EhecRegNet 1.0p - predicted We consider a gene regulatory interaction as conserved between E. coli and one other organisms if (1) the transcription factor is conserved; see TransClust, (2) it's binding sites are conserved; see MoRAine/PoSSuMsearch, and (3) the putative target genes are conserved as well; see TransClust. The corresponding gene regulation is added to the EhecRegNet 1.0p database.