Gene ECH74115_3102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3102 
Symbol 
ID6971228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2875832 
End bp2879461 
Gene Length3630 bp 
Protein Length1209 aa 
Translation table11 
GC content51% 
IMG OID643386930 
Productmolybdate metabolism regulator MolR homolog 
Protein accessionYP_002271398 
Protein GI209400731 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.772316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGG AATTACCGTG GCTGGCGGAT AACGCCCAAC TGGAACTGAA ATATAAAAAA 
GGCAAAACGC CGCTCAGTCA TCGTCGCTGG CCGGGCGAAC CAGTGTCCGT TATCACTGGA
AGTCTCATCC AGACATTGGG AGATGAATTA CTACAACAAG CAGGCCAGAA GGAAAATATC
ACCTGGAATT ATGATAAATG CTCACTGGAG TGGCAGTCCG CCATCCAGCA GGCAATCAAT
TTGACTGGCG AGCATAAACC GTCAATCCCA GCCCTTACCA TGGCGGCTCT GATCTGTATC
GCGCAAAATG ACAGCCAGCA GTTACTTGAT GAAATCGTGC AACAAGAGGG GCTGGAATAT
GCGACGGATG TAGTGATATC GCGTCAGTGC ATTGCGCGGC GTTATGAGAG TGACTCACTG
GTCGTCACTT TGCAGTATCA GGACGAGGAC TATGGCTACG GTTATGGCTC CGCGACCTAT
AACGATTTCG ATCTCCGTTT GCGTAAGCAT CTGTCGCTGG CAGAGGAAAG TTGCTGGCAG
CGTTGTGCCG ATAAACTCAT TGCCGCACTG CCAGGAATAC CCAAAATTCG CCGTCCTTTT
ATTGCGCTGA TCCTCCCGGA AAAACCTGAA ATAGCCAATG AACTCGCCAG TCTTGAAAGC
TCCCGGTCAA GTCTCCATTC AAAGGAGTGG TTAAAGGTCG TTGCTACTGA TAATACGGCG
GTAAAAAAAC TCGAACGATA CTGGGGCCTG GACGTATTTA GCGATCGCGA AGCCAGCTAC
ATGTCACAAG AAAACCGCTT CGGCTACGCG GCCTGCGCCT CCCTTTTGCG CGAACAAGGA
CTGGCAGCCG TTCCGCGCCT GGCAATGTAT GCTCATAAAG AAGATTGCGG CAGTCTGCTG
GTACAAATTA ACCATCCGCA AGTCATCCGC ACCTTGCTGT TAGTTGCTGA TAAAAACAAA
CCCAGCCTGC AACGTGTGGC TAAATACAGT AAAAACTTCC CCCATGCGAC GCTCGCCGCA
CTGGCAGAAC TGCTGGCGTT AAAAGAACCA CCAGCCCGCC CTGGTTATCC AATCATCGAA
GACAAAAAGC TGCCTGCACA GCAAAAAGCT CGAGAAGAAT ACTGGCGTAC GCTGTTACAA
ACGCTGATGG CATCGCAGCC ACAACTGGCA GCAGAAGTGA TGTCGTGGTT AAGTACTCAA
GCCAGGGCAG TGCTGAAGAG TTATTTATCG GCACTGCCAA AACCGGTTAT TGATGGCACC
GATAACAGCA ATCTGCCTGA AATCCTCGTT TCACCACCGT GGCGTAGTAA GAAAAAAATG
ACAGCTCCAC GTCTTGATTT GGCACGGCTC GAATTAACTC CGCAAGTTTA CTGGCAACCA
GGCGAACGAG AGAGGCTTGC CGCCACTGAG TCTGCCCGTT ATTTCAGCAC GGAATCTCTT
GCGCAACGCA TGGAACAAAA AAGTGGGCGA GTTGTATTAC AGGAACTGGG TTTTGGGGAT
GATGTATGGC TGTTTTTGAA TTATATACTC CCTGGAAAAC TGGATGCTGC ACGCAATTCA
CTCATTGTTC AGTGGCATTA CTACCAGGGG CGGGTTGAAG AGATCCTGAA TGGCTGGAAC
TCCCCGGAAG CACAATTAGC AGAACAGGCG CTCCGCAGCG GTCACATAGA AGCGTTAATT
AACATATGGG AAAATGACAA CTACTCACGT TATCGTCCGG AAAAGAGTGT CTGGAACCTG
TATTTATTGG CACAGTTGCC GCGTGAGATG GCTTTGACTT TCTGGCTGCG TATCAATGAG
AAAAAGCATC TGTTCGCGGG AGAGGACTAT TTTCTCAGTA TCCTCGGATT GGATGCGCTA
CCAGGTCTGC TGTTGGCTTT TTCACATCGT CCAAAAGAAA CATTTCCGTT AATTTTAAAT
TTCGGCGCAA CAGAACTGGC CCTGCCCGTT GCCCGCGTCT GGCGACGTTT CGCGGCGCAG
CGTGATCTGG CTCGCCAGTG GATTTTACAA TGGCCGGAAC ATACGGCTAC TGCACTTATC
CCTCTTGTCT TCACCAAATC TAGCGATAAA AGCGAAGCTG CATTACTTGC CCTGCGTTTA
CTTTACGAAC ATGGGCATGG CGAATTGCTA CAAACTGTAG CCAATCGTTG GCAGCGTACA
GATGTATGGC CTGCCCTGGA GCATTTACTG AAACAGGGTC CCATGGAAAT TTACCCGGCA
CGCATTCCAA AAGCCCCTGA TTTCTGGCAT CCGCAAATGT GGTCCAGGCC GCGCCTTATC
ACTAATAATC AACCTGTTAC CGATGACGCT CTGGAAATTA TCGGCGAAAT GCTGCGCTTT
ACCCAGGGGG GACGTTTTTA TAGCGGGCTG GAACAACTGA AAACGTTCTG CCAGCCACAA
ACGCTGGCAG CTTTTGCCTG GGATCTCTTC ACTGCGTGGC AACAAGCTGG TGCCCCCGCA
AAAGACAACT GGGCATTTCT GGCGTTAAGT CTCTTTGGTG ACGAAAGCAC GGCACGGGAT
CTAACAACAC AGATCCTCGC CTGGCCACAA GGCAAATCTG CCCGTGCTGT CAGCGGCCTG
AACATCCTTA CCCTGATGAA TAATGATATG GCGCTGATAC AGCTGCATCA TATATCGCAA
CGGGCTAAAT CCCGCCCCTT ACGTGATAAC GCGGCGGAAT TTCTTCAGGT AGTCGCAGAA
AATCGCGGGC TAAGCCAGGA AGAGCTAGCG GACAGATTAG TCCCAACCCT GGGCCTTGAT
GATCCGCAGG CGTTGAGTTT TGATTTTGGT CCCCGGCAGT TTACCGTTCG CTTCGATGAA
AACCTCAACC CGGTTATCTT TGATCAGCAA AACGTTCGCC AGAAAAGCGT TCCCCGGTTG
CGCGCCGATG ACGATCAACT GAAAGCGCCC GAGGCACTGG CCCGACTAAA AGGGCTAAAA
AAAGATGCTA CTCAGGTGAG CAAAAACCTG CTCCCGCGTC TTGAAGCTGC CCTACGTACC
ACCCGACGCT GGTCGCTGGC AGATTTTCAT TCTCTGTTTG TTAATCATCC CTTTACCCGT
CTGGTTACCC AGCGATTAAT ATGGGGGGTT TATCCGGCAA ATGAACCGCG TCGTTTACTC
AACGCCTTTC GTGTGGCCGC AGAGGGGGGG TTCTGCAATG CGCAAGATGA GCCAATTGAC
CTGCCTGCGG ACGCTCTGAT TGGCATTGCC CACCCGTTAG AAATGACAGC AGAAATGCGC
AGTGAATTTG CACAGCTTTT TGCCGATTAC GAAATTATGC CGCCTTTTCG CCAGTTGGCG
CGCCGCACGG TGCTGCTCAC ACCTGATGAG TCAACCAGTA ACAGCCTGAC TCGCTGGGAA
GGTAAATCCG CTACCGTTGG GCAACTTATG GGAATGCGAT ACAAAGGCTG GGAGTCAGGT
TATGAGGACG CATTTGTCTA TGACCTGGGT GAGTACCGGC TGGTCCTTAA GTTTTCACCC
GGTTTTAACC ACTACAATGT TGATAGCAAA GCGTTAATGA GCTTCCGTTC TCTTCGAGTG
TACCGTGACA ATAAATCCGT CACTTTTGCC GAACTTGATG TGTTTAATTT GAGTGAGGCG
TTAAGCGCAC CTGACGTCAT TTTCCATTAA
 
Protein sequence
MDKELPWLAD NAQLELKYKK GKTPLSHRRW PGEPVSVITG SLIQTLGDEL LQQAGQKENI 
TWNYDKCSLE WQSAIQQAIN LTGEHKPSIP ALTMAALICI AQNDSQQLLD EIVQQEGLEY
ATDVVISRQC IARRYESDSL VVTLQYQDED YGYGYGSATY NDFDLRLRKH LSLAEESCWQ
RCADKLIAAL PGIPKIRRPF IALILPEKPE IANELASLES SRSSLHSKEW LKVVATDNTA
VKKLERYWGL DVFSDREASY MSQENRFGYA ACASLLREQG LAAVPRLAMY AHKEDCGSLL
VQINHPQVIR TLLLVADKNK PSLQRVAKYS KNFPHATLAA LAELLALKEP PARPGYPIIE
DKKLPAQQKA REEYWRTLLQ TLMASQPQLA AEVMSWLSTQ ARAVLKSYLS ALPKPVIDGT
DNSNLPEILV SPPWRSKKKM TAPRLDLARL ELTPQVYWQP GERERLAATE SARYFSTESL
AQRMEQKSGR VVLQELGFGD DVWLFLNYIL PGKLDAARNS LIVQWHYYQG RVEEILNGWN
SPEAQLAEQA LRSGHIEALI NIWENDNYSR YRPEKSVWNL YLLAQLPREM ALTFWLRINE
KKHLFAGEDY FLSILGLDAL PGLLLAFSHR PKETFPLILN FGATELALPV ARVWRRFAAQ
RDLARQWILQ WPEHTATALI PLVFTKSSDK SEAALLALRL LYEHGHGELL QTVANRWQRT
DVWPALEHLL KQGPMEIYPA RIPKAPDFWH PQMWSRPRLI TNNQPVTDDA LEIIGEMLRF
TQGGRFYSGL EQLKTFCQPQ TLAAFAWDLF TAWQQAGAPA KDNWAFLALS LFGDESTARD
LTTQILAWPQ GKSARAVSGL NILTLMNNDM ALIQLHHISQ RAKSRPLRDN AAEFLQVVAE
NRGLSQEELA DRLVPTLGLD DPQALSFDFG PRQFTVRFDE NLNPVIFDQQ NVRQKSVPRL
RADDDQLKAP EALARLKGLK KDATQVSKNL LPRLEAALRT TRRWSLADFH SLFVNHPFTR
LVTQRLIWGV YPANEPRRLL NAFRVAAEGG FCNAQDEPID LPADALIGIA HPLEMTAEMR
SEFAQLFADY EIMPPFRQLA RRTVLLTPDE STSNSLTRWE GKSATVGQLM GMRYKGWESG
YEDAFVYDLG EYRLVLKFSP GFNHYNVDSK ALMSFRSLRV YRDNKSVTFA ELDVFNLSEA
LSAPDVIFH