Gene ECH74115_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0843 
SymboltolB 
ID6971245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp865526 
End bp866818 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content53% 
IMG OID643384868 
Producttranslocation protein TolB 
Protein accessionYP_002269374 
Protein GI209399861 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID[TIGR02800] tol-pal system beta propeller repeat protein TolB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000968933 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGG CATTACGAGT AGCATTTGGT TTTCTCATAC TGTGGGCATC AGTTCTGCAT 
GCTGAAGTCC GCATTGTGAT CGACAGCGGT GTAGATTCCG GTCGTCCTAT TGGAGTTGTT
CCTTTCCAGT GGGCGGGCCC TGGTGCGGCA CCTGAAGATA TTGGCGGCAT CGTTGCTGCC
GACTTGCGTA ACAGCGGTAA ATTTAATCCG TTAGATCGCG CTCGTCTGCC ACAGCAGCCG
GGTAGTGCGC AGGAAGTACA ACCAGCTGCA TGGTCCGCGC TGGGTATTGA CGCTGTAGTT
GTCGGTCAGG TCACTCCGAA TCCGGATGGT TCTTACAATG TTGCTTATCA ACTTGTTGAC
ACTGGCGGCG CACCGGGTAC TGTACTTGCT CAGAACTCGT ACAAAGTGAA CAAGCAGTGG
CTGCGTTATG CTGGTCATAC CGCCAGTGAT GAAGTGTTTG AAAAACTGAC CGGTATTAAA
GGTGCGTTCC GTACCCGTAT TGCCTACGTT GTTCAGACCA ACGGCGGTCA GTTCCCGTAT
GAACTGCGCG TATCTGACTA TGACGGTTAC AACCAGTTTG TCGTTCACCG TTCACCACAG
CCGCTGATGT CCCCGGCGTG GTCACCAGAC GGTTCTAAAC TGGCTTATGT GACCTTCGAA
AGCGGTCGTT CCGCGCTGGT TATACAAACG CTGGCAAACG GCGCTGTACG TCAGGTGGCT
TCATTCCCGC GTCACAACGG TGCGCCTGCA TTCTCGCCAG ACGGCAGCAA ACTGGCATTC
GCCTTGTCGA AAACCGGTAG CCTGAACCTG TACGTAATGG ATTTGGCTTC TGGTCAGATC
CGCCAGGTGA CTGATGGTCG CAGTAACAAT ACCGAACCGA CCTGGTTCCC GGACAGCCAG
AACCTGGCAT TTACTTCTGA CCAGGCCGGT CGTCCACAGG TTTATAAAGT GAATATCAAC
GGCGGTGCGC CACAACGTAT TACCTGGGAA GGTTCGCAGA ACCAGGATGC GGATGTCAGC
AGCGACGGTA AATTTATGGT AATGGTCAGC TCCAATGGTG GGCAGCAGCA CATTGCCAAA
CAAGATCTGG CAACGGGAGG CGTACAAGTT CTGTCGTCCA CGTTCCTGGA TGAAACGCCA
AGTCTGGCAC CTAACGGCAC TATGGTAATC TACAGCTCTT CTCAGGGGAT GGGATCCGTG
CTGAATTTGG TTTCTACAGA TGGGCGTTTC AAAGCGCGTC TTCCGGCAAC TGATGGACAG
GTCAAATTCC CTGCCTGGTC GCCGTATCTG TGA
 
Protein sequence
MKQALRVAFG FLILWASVLH AEVRIVIDSG VDSGRPIGVV PFQWAGPGAA PEDIGGIVAA 
DLRNSGKFNP LDRARLPQQP GSAQEVQPAA WSALGIDAVV VGQVTPNPDG SYNVAYQLVD
TGGAPGTVLA QNSYKVNKQW LRYAGHTASD EVFEKLTGIK GAFRTRIAYV VQTNGGQFPY
ELRVSDYDGY NQFVVHRSPQ PLMSPAWSPD GSKLAYVTFE SGRSALVIQT LANGAVRQVA
SFPRHNGAPA FSPDGSKLAF ALSKTGSLNL YVMDLASGQI RQVTDGRSNN TEPTWFPDSQ
NLAFTSDQAG RPQVYKVNIN GGAPQRITWE GSQNQDADVS SDGKFMVMVS SNGGQQHIAK
QDLATGGVQV LSSTFLDETP SLAPNGTMVI YSSSQGMGSV LNLVSTDGRF KARLPATDGQ
VKFPAWSPYL