Gene SbBS512_E0661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0661 
SymboltolA 
ID6269840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp631197 
End bp632384 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content52% 
IMG OID641724857 
Productcell envelope integrity inner membrane protein TolA 
Protein accessionYP_001879396 
Protein GI187732995 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain
[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000498099 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAAAGG CAACCGAACA AAACGACAAG CTCAAGCGGG CGATAATTAT TTCAGCAGTG 
CTGCATGTCA TCTTATTTGC GGCGCTGATC TGGAGTTCGT TCGATGAGAA TATAGAAGCT
TCAGCCGGAG GCGGCGGTGG TTCGTCCATC GACGCTGTCA TGGTTGATTC AGGTGCGGTA
GTTGAGCAGT ACAAACGCAT GCAAAGCCAG GAATCAAGCG CGAAGCGTTC TGATGAACAG
CGCAAGATGA AGGAACAGCA GGCTGCTGAA GAACTGCGTG AGAAACAAGC GGCTGAACAG
GAACGCCTGA AGCAACTTGA GAAAGAGCGG TTAGCGGCTC AGGAGCAGAA AAAGCAGGCT
GAAGAAGCTG CAAAACAGGC CGAGTTAAAG CAGAAGCAAG CTGAAGAGGC GGCAGCGAAA
GCGGCGGCAG ATGCTAAAGC GAAGGCCGAA GCAGATGCTA AAGCGAAGGC CGAAGCAGAT
GCTAAAGCTG CGGAAGAAGC AGCGAAGAAA GCGGCTGCAG ACGCAAAGAA AAAAGCAGAA
GCAGAAGCCG CCAAAGCCGC AGCGCAGAAA AAAGCCGAGG CAGCCGCTGC GGCACTGAAG
AAGAAAGCGG AAGCGGCAGA AGCAGCTGCA GCTGAAGCAA GAAAGAAAGC GGCAACTGAA
GCTGCTGAAA AAGCCAAAGC AGAAGCTGAG AAGAAAGCGG CAGCAGAGAA AGCTGCAGCC
GACAAAAAAG CAGCAGAAAA AGCGGCTGCT GAAAAGGCAG CAGCTGATAA GAAAGCAGCG
GCAGAAAAAG CCGCCGCAGA GGCAGATGAT ATTTTCGGTG AGCTAAGCTC TGGTAAGAAT
GCACCGAAAA CGGGGGGAGG GGCGAAAGGG AACAATGCTT CGCCTGCCGG GAGTGGTAAT
ACTAAAAACA ATGGCGCATC AGGGGCCGAT ATCAATAACT ATGCCGGGCA GATTAAATCT
GCTATCGAAA GTAAGTTCTA TGACGCATCG TCCTATGCAG GCAAAACCTG TACGCTGCGC
ATAAAACTGG CACCCGATGG TATGTTACTG GATATCAAAC CTGAAGGTGG CGATCCCGCA
CTTTGTCAGG CTGCGTTGGC AGCAGCTAAA CTTGCGAAGA TCCCGAAACC ACCAAGCCAG
GCAGTATATG AAGTGTTCAA AAACGCGCCA TTGGACTTCA AACCGTAA
 
Protein sequence
MSKATEQNDK LKRAIIISAV LHVILFAALI WSSFDENIEA SAGGGGGSSI DAVMVDSGAV 
VEQYKRMQSQ ESSAKRSDEQ RKMKEQQAAE ELREKQAAEQ ERLKQLEKER LAAQEQKKQA
EEAAKQAELK QKQAEEAAAK AAADAKAKAE ADAKAKAEAD AKAAEEAAKK AAADAKKKAE
AEAAKAAAQK KAEAAAAALK KKAEAAEAAA AEARKKAATE AAEKAKAEAE KKAAAEKAAA
DKKAAEKAAA EKAAADKKAA AEKAAAEADD IFGELSSGKN APKTGGGAKG NNASPAGSGN
TKNNGASGAD INNYAGQIKS AIESKFYDAS SYAGKTCTLR IKLAPDGMLL DIKPEGGDPA
LCQAALAAAK LAKIPKPPSQ AVYEVFKNAP LDFKP