Gene EcSMS35_0763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0763 
SymboltolB 
ID6143437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp770335 
End bp771630 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content53% 
IMG OID641615652 
Producttranslocation protein TolB 
Protein accessionYP_001742851 
Protein GI170683440 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID[TIGR02800] tol-pal system beta propeller repeat protein TolB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000112892 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAGC AGGCATTACG AGTAGCATTT GGTTTTCTCA TACTGTGGGC ATCAGTTCTG 
CATGCTGAAG TCCGCATTGT GATCGACAGC GGTGTAGATT CCGGTCGTCC TATTGGTGTT
GTTCCTTTCC AGTGGGCGGG GCCTGGTGCG GCACCTGAAG ATATTGGCGG CATCGTTGCT
GCTGACTTGC GTAACAGCGG TAAATTTAAT CCGTTAGATC GCGCTCGTCT GCCACAGCAG
CCGGGTAGTG CGCAGGAAGT ACAACCAGCT GCATGGTCCG CACTGGGCAT TGACGCTGTA
GTTGTCGGTC AGGTCACTCC GAATCCGGAC GGCTCTTACA ATGTTGCTTA TCAACTTGTT
GACACTGGCG GCGCACCGGG TACTGTACTT GCTCAGAACT CGTACAAAGT GAACAAGCAG
TGGCTGCGTT ATGCTGGTCA TACCGCCAGT GATGAAGTGT TTGAAAAACT GACCGGCATT
AAAGGTGCGT TCCGTACCCG TATTGCCTAC GTTGTTCAGA CCAACGGCGG TCAGTTCCCG
TATGAACTGC GCGTATCTGA CTATGACGGT TACAACCAGT TTGTCGTTCA CCGTTCACCG
CAGCCGCTGA TGTCTCCGGC GTGGTCACCA GACGGTTCTA AACTGGCTTA TGTGACCTTC
GAAAGCGGTC GTTCCGCGCT GGTTATTCAG ACGCTGGCAA ATGGCGCTGT ACGTCAGGTG
GCTTCATTCC CGCGTCACAA CGGTGCACCT GCATTCTCGC CAGACGGCAG CAAACTGGCA
TTCGCCTTGT CGAAAACCGG TAGTCTGAAC CTGTACGTAA TGGATTTGGC TTCTGGTCAG
ATCCGCCAGG TGACTGATGG TCGCAGTAAC AATACCGAAC CGACCTGGTT CCCGGATAGC
CAGAACCTGG CATTTACTTC TGACCAGGCC GGTCGTCCAC AGGTTTATAA AGTGAATATC
AACGGCGGTG CGCCACAACG TATTACCTGG GAAGGTTCGC AGAACCAGGA TGCGGATGTC
AGCAGCGACG GTAAATTTAT GGTAATGGTC AGCTCCAATG GTGGGCAGCA GCACATTGCC
AAACAAGATC TGGCAACGGG AGGCGTACAA GTTCTGTCGT CCACGTTCCT GGATGAAACG
CCAAGTCTGG CACCTAACGG CACTATGGTA ATCTACAGCT CTTCTCAGGG GATGGGATCC
GTGCTGAATT TGGTTTCTAC AGATGGGCGT TTCAAAGCGC GTCTTCCGGC AACTGATGGA
CAGGTCAAAT TCCCTGCCTG GTCGCCGTAT CTGTGA
 
Protein sequence
MMKQALRVAF GFLILWASVL HAEVRIVIDS GVDSGRPIGV VPFQWAGPGA APEDIGGIVA 
ADLRNSGKFN PLDRARLPQQ PGSAQEVQPA AWSALGIDAV VVGQVTPNPD GSYNVAYQLV
DTGGAPGTVL AQNSYKVNKQ WLRYAGHTAS DEVFEKLTGI KGAFRTRIAY VVQTNGGQFP
YELRVSDYDG YNQFVVHRSP QPLMSPAWSP DGSKLAYVTF ESGRSALVIQ TLANGAVRQV
ASFPRHNGAP AFSPDGSKLA FALSKTGSLN LYVMDLASGQ IRQVTDGRSN NTEPTWFPDS
QNLAFTSDQA GRPQVYKVNI NGGAPQRITW EGSQNQDADV SSDGKFMVMV SSNGGQQHIA
KQDLATGGVQ VLSSTFLDET PSLAPNGTMV IYSSSQGMGS VLNLVSTDGR FKARLPATDG
QVKFPAWSPY L