Gene ECH74115_B0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0072 
Symbol 
ID6966456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp35267 
End bp37240 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content52% 
IMG OID643383973 
Producthypothetical protein 
Protein accessionYP_002268452 
Protein GI209395548 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.385572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATTG CCGGGCTGCG TGAAGAGCCC GCACAGGACT GGTTATCTTT GCATAAACGG 
CTGGCCAGTG ATGGTCTGTA TATCACGATG CAGGAAGGGG AACTGGTGGT GAAGGATGGC
TGGGACAGAG CGCGGGAAGG TGTTGCACTC AGTTCGTTCG GACCGTTATG GACAGCTGAA
AAACTGGGGC GGAAACTGGG GGAATATCAG CCTGTACCGA CAGATATTTT CAGTCAGGTG
GGGACACCCG GTCGTTATGA TCCGGAAGCC ATAAACGTTG ATATCCGGCC GGAAAAAGTG
GCAGAAACGG AGAGTCTGAA ACAGTACGCC TGCCGTCATT TTGCCGAACG CTTACCGGAA
ATGGCGAGAA ATGGTGAGCT GGAAAGTTGC CTGGATGTGC ACAGAACTCT GGCAACGGCT
GGATTATGGA TGGGGATCCA GCACGGGCAT CTGGTACTGC ATGATGGATT TGATAAACAA
CAGACACCAG TACGTGCAGA CAGTGTGTGG CCGTTAATGA CGCTGGATTA TATGCAGGAT
CTGGATGGTG GATGGCAGCC TGTACCGAAG GATATTTTTA CTCAGGTTAT ACCCGGAGAA
CGGTTCCGGG GACGTAATCT TGGTACTCAG GCTGTCAGTG ATTACGAATG GTATCGTATG
CGGATGGGGA CAGGTCCACA GGGAGCAATA AAGCGCGAAC TGTTTTCTGA TAAGGAAAGT
CTGTGGGGGT ACACAACTGT TCAGTGTGAG TCGTTGATTG AAGATATGAT CGCGGGCGGG
AATTTCAGCT GGCAGGCGTG CCATGAGATG TTTGCGCGTA AAGGGCTTAT GTTGCAGAAA
CAGCATCATG GTCTTGTGAT TGTGGATGCG TTTAATCATG AGCTCACACC AGTGAAGGCC
AGCAGTATCC ACCCGGATTT GACGTTATCC AGGGCTGAGC CGCAGGCAGG GCCATTTGAG
ATTGCAGGTG CAGATATATT TGAACGGGTG AAGCCTGAAT GTCGCTACAA TCCGGAACTT
GCTGCCAGCG ATGAAGTGGA GCCCGGCTTC CGGCGCGATC CTGTACTGCG TCGTGAACGC
CGTGAAGCCC GTGCAGCTGC GCGGGAAGAT TTGCGCGCGC GTTATCTTGC ATGGAAGGAG
CACTGGCGTA AACCCGACTT ACGTTACGGA GAACGGTTAC GGGAAATTCA TGCTGCCTGC
CGTCGGCGCA AGGCGTATAT CCGTGTGCAG TTCCGGGATC CACAATTGCG TAAGCTGCAT
TACCATATTG CGGAAGTGCA GCGTATGCAG GCGCTGATCA GGCTGAAGGA GAGTGTGAAG
GAAGAGCGGT TGTCACTGAT TGAGGAGGGT AAGTGGTATC CTTTGTCCTA CCGGCAGTGG
GTGGAGCAAC AGGCGGTACA GGGAGACAGA GCTGCATTAT CACAGTTACG TGGCTGGGAT
TATCGTGATC GTCGCAAAGA CAAGCGACGA ACAACGAATG CCGACCGCTG CGTGATTCTC
TGTGAACCGG GAGGAACTCC ACTTTATGAA GACACTGGCG TTCTCGAAGC CCGTCTGCAG
AAAGATGGCA GTGTGCGTTT TCGTGATCGG AGGAATGGCG AGTTAGTCTG TGTGGATTAT
GGGGACCGTG TGGTGTTCTA CCATCATCAG GACAGAAATG AACTGGTGGA TAAGCTGAAT
CTGATTGCTC CTGTATTGTT TGATCGTGAG CCAGGAATGG GTTTTGAACC GGAAGGCTCA
TATCAACAGT TTAATGATGT ATTTGCGGAA ATGGTGGCCT GGCATAATGC AGCTGGAATA
ACCGGAAATG GACACTTTGT AATCAGCCGC CCGGATGTGG ATTTACATCG TCAGCGTAGT
GAACAGTACT ACCACGAATA TATCAGGCAA CAGAAAAGCA TATCCGGGGG GCATGGCGCA
TCTTATGCTC CAGTTCAGGA TAATGAGTGG ACGCCACCAT CACCTGGAAT GTAG
 
Protein sequence
MSIAGLREEP AQDWLSLHKR LASDGLYITM QEGELVVKDG WDRAREGVAL SSFGPLWTAE 
KLGRKLGEYQ PVPTDIFSQV GTPGRYDPEA INVDIRPEKV AETESLKQYA CRHFAERLPE
MARNGELESC LDVHRTLATA GLWMGIQHGH LVLHDGFDKQ QTPVRADSVW PLMTLDYMQD
LDGGWQPVPK DIFTQVIPGE RFRGRNLGTQ AVSDYEWYRM RMGTGPQGAI KRELFSDKES
LWGYTTVQCE SLIEDMIAGG NFSWQACHEM FARKGLMLQK QHHGLVIVDA FNHELTPVKA
SSIHPDLTLS RAEPQAGPFE IAGADIFERV KPECRYNPEL AASDEVEPGF RRDPVLRRER
REARAAARED LRARYLAWKE HWRKPDLRYG ERLREIHAAC RRRKAYIRVQ FRDPQLRKLH
YHIAEVQRMQ ALIRLKESVK EERLSLIEEG KWYPLSYRQW VEQQAVQGDR AALSQLRGWD
YRDRRKDKRR TTNADRCVIL CEPGGTPLYE DTGVLEARLQ KDGSVRFRDR RNGELVCVDY
GDRVVFYHHQ DRNELVDKLN LIAPVLFDRE PGMGFEPEGS YQQFNDVFAE MVAWHNAAGI
TGNGHFVISR PDVDLHRQRS EQYYHEYIRQ QKSISGGHGA SYAPVQDNEW TPPSPGM