Gene ECH74115_0842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0842 
SymboltolA 
ID6972034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp864119 
End bp865393 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content52% 
IMG OID643384867 
Productcell envelope integrity inner membrane protein TolA 
Protein accessionYP_002269373 
Protein GI209400066 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain
[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000389137 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAAAGG CAACCGAACA AAACGACAAG CTCAAGCGGG CGATAATTAT TTCAGCAGTG 
CTGCATGTCA TCTTATTTGC GGCGCTGATC TGGAGTTCGT TCGATGAGAA TATAGAAGCT
TCAGCTGGAG GCGGCGGTGG TTCGTCCATC GACGCTGTCA TGGTTGATTC AGGTGCGGTA
GTTGAGCAGT ACAAACGCAT GCAAAGCCAG GAATCAAGCG CGAAGCGTTC TGATGAGCAG
CGCAAGATGA AGGAACAGCA GGCTGCTGAA GAACTGCGTG AGAAACAAGC GGCTGAACAG
GAACGCCTGA AGCAACTTGA GAAAGAGCGG TTAGCTGCTC AGGAACAGAA AAAGCAGGCT
GAAGAAGCCG CAAAACAGGC CGAGTTAAAG CAGAAGCAAG CGGAAGAGGC GGCAGCGAAA
GCGGCGGCAG ATGCTAAAGC GAAGGCCGAA GCGGATGATA AAGCTGCGGA AGAAGCAGCG
AAGAAAGCGG CTGCAGACGC GAAGAAAAAA GCAGAAGCAG AAGCCGCCAA AGCCGCAGCC
GAAGCGCAGA AAAAAGCCGA GGCAGCAGCT GCGGCGCTGA AGAAGAAAGC GGAAGCGGCA
GAAGCAGCTG CAGCTGAAGC AAGAAAGAAA GCGGCAGCAG AGAAAGCTGC AGCCGACAAA
AAAGCAGCAG AGAAAGCTGC AGCCGACAAA AAAGCAGCAG AAAAAGCGGC TGCTGAAAAG
GCAGCAGCAG AGAAAGCTGC AGCCGACAAA AAAGCAGCAG AAAAAGCGGC TGCTGAAAAG
GCAGCAGCTG ATAAGAAAGC AGCGGCAGAA AAAGCCGCCG CAGACAAAAA AGCGGCAGCT
GCAAAAGCAG CAGCTGAAAA AGCCGCTGCA GCAAAAGCTG CCGCGGAGGC AGATGATATT
TTCGGTGAGC TAAGCTCTGG TAAGAATGCA CCGAAAACGG GGGGAGGGGC GAAAGGGAAC
AATGCTTCGC CTGCCGGGAG TGGTAATACT AAAAACAATG GCGCATCAGG GGCCGATATC
AATAACTATG CCGGGCAGAT TAAATCTGCT ATCGAAAGTA AGTTCTATGA CGCATCGTCC
TATGCAGGCA AAACCTGTAC GCTGCGCATA AAACTGGCAC CCGATGGTAT GTTACTGGAT
ATCAAACCTG AAGGTGGCGA TCCCGCACTT TGTCAGGCTG CGTTGGCAGC AGCTAAACTT
GCGAAGATCC CGAAACCACC AAGCCAGGCA GTATATGAAG TGTTCAAAAA CGCGCCATTG
GACTTCAAAC CGTAA
 
Protein sequence
MSKATEQNDK LKRAIIISAV LHVILFAALI WSSFDENIEA SAGGGGGSSI DAVMVDSGAV 
VEQYKRMQSQ ESSAKRSDEQ RKMKEQQAAE ELREKQAAEQ ERLKQLEKER LAAQEQKKQA
EEAAKQAELK QKQAEEAAAK AAADAKAKAE ADDKAAEEAA KKAAADAKKK AEAEAAKAAA
EAQKKAEAAA AALKKKAEAA EAAAAEARKK AAAEKAAADK KAAEKAAADK KAAEKAAAEK
AAAEKAAADK KAAEKAAAEK AAADKKAAAE KAAADKKAAA AKAAAEKAAA AKAAAEADDI
FGELSSGKNA PKTGGGAKGN NASPAGSGNT KNNGASGADI NNYAGQIKSA IESKFYDASS
YAGKTCTLRI KLAPDGMLLD IKPEGGDPAL CQAALAAAKL AKIPKPPSQA VYEVFKNAPL
DFKP