Gene SbBS512_E4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4237 
Symbol 
ID6268421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3960963 
End bp3962711 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content52% 
IMG OID641728056 
ProductPTS system, alpha-glucoside-specific IIBC component 
Protein accessionYP_001882477 
Protein GI187730493 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR02005] PTS system, alpha-glucoside-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCGATG CTGTGTACCA GTGCTCACAG ATGTCTACTT TTTCGCGAAA ACGTAGATCT 
CTACCGCCCA ACGAAAAGCA TGAAAGCGAT CACGAATCCC ATTGGGTCGT CATGTTCTGC
CGATCGCATC TTCCTATCCT CGCTCCAGGC CTGCCGCATA ACCAATCAGG CTTCCTACTT
ACAGAATTGA GAAAAGAGGA TGTGGAAATG CTCAGTCAAA TTCAACGCTT TGGCGGCGCG
ATGTTCACGC CAGTGCTGCT GTTTCCCTTC GCCGGGATTG TGGTGGGTCT TGCCATCTTG
CTGCAAAACC CGATGTTTGT CGGGGAATCA CTGACCGATC CGAACAGTTT ATTCGCGCAA
ATCGTACACA TTATTGAAGA GGGCGGTTGG ACGGTATTCC GTAATATGCC GCTGATTTTT
GCTGTCGGTT TACCCATTGG CCTTGCTAAG CAAGCGCAGG GGCGTGCTTG TCTGGCGGTG
ATGGTGAGTT TCCTGACCTG GAACTATTTC ATCAACGCGA TGGGAATGAC CTGGGGAAGC
TACTTCGGCG TCGATTTCAC TCAGGACGCG GTGGCAGGTA GCGGTCTGAC AATGATGGCC
GGGATTAAAA CCCTCGATAC CAGCATTATC GGCGCAATTA TCATTTCCGG CATTGTGACG
GCGCTGCATA ACCGTCTGTT CGATAAAAAA CTGCCGGTTT TTCTCGGCAT TTTCCAGGGG
ACGTCTTATG TGGTGATTAT CGCCTTCCTG GTGATGATCC CCTGTGCCTG GCTGACGTTG
CTCGGCTGGC CAAAAGTACA AATGGGGATT GAATCTCTGC AAGCGTTCCT GCGTTCGGCG
GGTGCACTTG GGGTGTGGGT TTACACCTTC CTCGAACGTA TTCTGATCCC AACCGGTTTA
CACCACTTCA TCTACGGACA GTTTATCTTT GGTCCGGCAG CTGTTGAAGG CGGCATTCAG
ATGTACTGGG CGCAGCATCT GCAAGAGTTC AGTCTGAGCG CCGAGCCGCT GAAATCGTTG
TTCCCGGAAG GCGGTTTTGC CCTGCACGGT AACTCAAAAA TCTTTGGTGC CGTGGGCATT
TCTTTAGCGA TGTACTTCAC TGCCGCACCG GAAAATCGGG TAAAAGTGGC GGGCTTGCTG
ATTCCCGCAA CCTTAACCGC CATGCTGGCG GCCTCAATGT CGACCGTGAT GTATCTCTTT
GGTGTGGTGG GCAACATGGG CGGAGGTCTG ATTGACCAGG TTTTACCGCA AAACTGGATC
CCGATGTTCA GCAACCACGC GGATATGATG CTGACCCAAA TCGCCATTGG GTTGTGCTTT
ACCCTGCTGT ACTTCGTGGT TTTCCGCACA CTGATTCTGC AATTCAACAT GTGCACGCCG
GGACGTGAAG ATGCGGAAGT GAAACTCTAC TCAAAAGCCG AATACAAAGC CTCGCGAGGC
CAAACCACCG CTGCAGAGCC AAAAAAAGAG CTGGATCAGG CTGCCGGTAT CCTGCAAGCC
CTGGGCGGGG TCGGCAATAT CTCCAGCATT AACAATTGCG CGACGCGTTT ACGTATTGCA
CTGCATGACA TGTCACAAAC GCTGGATGAC GAAGTCTTTA AAAAGCTGGG AGCGCACGGC
GTCTTCCGTA GTGGCGATGC CATTCAGGTG ATCATTGGTC TGCATGTATC CCAGCTGCGT
GAACAGCTCG ATAGCTTAAT TAATTCTCAT CAATCAGCAG AAAATGTTGC CATTACGGAG
GCAGTATAA
 
Protein sequence
MFDAVYQCSQ MSTFSRKRRS LPPNEKHESD HESHWVVMFC RSHLPILAPG LPHNQSGFLL 
TELRKEDVEM LSQIQRFGGA MFTPVLLFPF AGIVVGLAIL LQNPMFVGES LTDPNSLFAQ
IVHIIEEGGW TVFRNMPLIF AVGLPIGLAK QAQGRACLAV MVSFLTWNYF INAMGMTWGS
YFGVDFTQDA VAGSGLTMMA GIKTLDTSII GAIIISGIVT ALHNRLFDKK LPVFLGIFQG
TSYVVIIAFL VMIPCAWLTL LGWPKVQMGI ESLQAFLRSA GALGVWVYTF LERILIPTGL
HHFIYGQFIF GPAAVEGGIQ MYWAQHLQEF SLSAEPLKSL FPEGGFALHG NSKIFGAVGI
SLAMYFTAAP ENRVKVAGLL IPATLTAMLA ASMSTVMYLF GVVGNMGGGL IDQVLPQNWI
PMFSNHADMM LTQIAIGLCF TLLYFVVFRT LILQFNMCTP GREDAEVKLY SKAEYKASRG
QTTAAEPKKE LDQAAGILQA LGGVGNISSI NNCATRLRIA LHDMSQTLDD EVFKKLGAHG
VFRSGDAIQV IIGLHVSQLR EQLDSLINSH QSAENVAITE AV