Gene SbBS512_E1677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1677 
SymbolsotB 
ID6271473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1526656 
End bp1527846 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content52% 
IMG OID641725762 
Productsugar efflux transporter 
Protein accessionYP_001880260 
Protein GI187732169 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAA ACACTGTTTC CCGCAAAGTG GCGTGGCTAC GGGTCGTTAC GCTGGCAGTC 
GCCGCCTTCA TCTTCAACAC CACCGAATTT GTCCCTGTTG GCCTGCTCTC TGACATTGCG
CAAAGTTTTC ACATGCAAAC CGCTCAGGTC GGCATCATGT TGACCATTTA CGCATGGGTA
GTAGCGCTAA TGTCATTGCC TTTTATGTTA ATGACCAGTC AGGTTGAACG GCGCAAATTA
CTGATCTGCC TGTTTGTGGT GTTTATTGCC AGCCACGTAC TGTCGTTTTT ATCGTGGAGC
TTTACCGTTC TGGTGATCAG TCGCATTGGT GTGGCTTTTG CACATGCGAT TTTCTGGTCG
ATTACGGCGT CTCTGGCGAT CCGTATGGCT CCGGCCGGGA AGCGGGCACA GGCCTTGAGT
TTAATTGCCA CCGGTACAGC ACTGGCGATG GTCTTAGGTT TACCTCTCGG GCGCATTGTG
GGGCAGTATT TCGGTTGGCG AATGACCTTC TTCGCGATTG GTATTGGGGC GCTTATCACC
CTTTTGTGCC TGATTAAGTT ACTTCCCTTA CTGCCCAGTG AGCATTCCGG TTCATTGAAA
AGCCTCCCGC TATTGTTCCG CCGCCCGGCA TTGATGAGCA TTTATTTGTT AACTGTGGTG
GTTGTCACCG CCCATTACAC GGCATACAGC TATATCGAGC CTTTTGTGCA AAACATTGCG
GGATTCAGCG CCAACTTTGC CACGGCATTA CTGTTATTAC TCGGTGGTGC GGGCATTATT
GGCAGCGTGA TTTTCGGTAA ACTGGGTAAT CAGTATGCGT CTGCGTTGGT GAGTACGGCG
ATTGCGCTGT TGCTGGTGTG TCTGGCACTG CTGCTACCTG CGGCGAACAG TGAAATACAC
CTCGGGGTGC TGAGTATTTT CTGGGGGATC GCGATGATGC TCATCGGGCT TGGTATGCAG
GTTAAAGTGC TGGCGCTGGC ACCAGATGCT ACCGACGTCG CGATGGCGCT ATTCTCCGGC
ATATTTAATA TTGGAATCGG GGCGGGTGCG TTGGTAGGTA ATCAGGTGAG TTTGCACTGG
TCAATGTCGA TGATTGGTTA TGTGGGCGCG GTGCCTGCTT TTGCCGCGTT AATATGGTCA
ATCATTATAT TTCGCCGCTG GCCAGTGACA CTCGAAGAAC AGACGCAATA G
 
Protein sequence
MTTNTVSRKV AWLRVVTLAV AAFIFNTTEF VPVGLLSDIA QSFHMQTAQV GIMLTIYAWV 
VALMSLPFML MTSQVERRKL LICLFVVFIA SHVLSFLSWS FTVLVISRIG VAFAHAIFWS
ITASLAIRMA PAGKRAQALS LIATGTALAM VLGLPLGRIV GQYFGWRMTF FAIGIGALIT
LLCLIKLLPL LPSEHSGSLK SLPLLFRRPA LMSIYLLTVV VVTAHYTAYS YIEPFVQNIA
GFSANFATAL LLLLGGAGII GSVIFGKLGN QYASALVSTA IALLLVCLAL LLPAANSEIH
LGVLSIFWGI AMMLIGLGMQ VKVLALAPDA TDVAMALFSG IFNIGIGAGA LVGNQVSLHW
SMSMIGYVGA VPAFAALIWS IIIFRRWPVT LEEQTQ