Gene SbBS512_E1649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1649 
SymbolmdoD 
ID6268685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1506047 
End bp1507702 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content51% 
IMG OID641725738 
Productglucan biosynthesis protein D 
Protein accessionYP_001880236 
Protein GI187730116 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGTA GACGATTTAT TAAAGGTTCA ATGGCTATGG CCGCCGTGTG CGGTACCAGC 
GGCATTGCTT CTCTTTTTTC TCAGGCGGCA TTCGCGGCAG ATTCTGATAT TGCCGACGGG
CAAACCCAGC GTTTTGACTT CTCCATTCTA CAGTCAATGG CGCACGACTT AGCGCAAACA
GCGTGGCGTG GTGCGCCTCG TCCGTTACCT GACACGCTGG CGACAATGAC GCCGCAGGCT
TATAACAGTA TTCAATACGA CGCCGAAAAA TCGCTCTGGC ATAACGTTGA GAACCGTCAA
CTGGACGCTC AGTTCTTCCA TATGGGAATG GGATTCCGTC GCCGCGTTCG TATGTTTTCT
GTAGATCCAG CAACACATCT GGCGCGTGAA ATTCACTTTC GCCCGGAGTT GTTCAAATAC
AACGATGCGG GTGTTGATAC AAAACAATTA GAAGGGCAAA GCGATCTCGG CTTTGCCGGT
TTTCGCGTGT TTAAAGCCCC CGAACTGGCG CGCCGTGATG TAGTATCATT TCTCGGCGCG
AGTTATTTCC GCGCCGTTGA TGATACATAT CAATACGGTT TGTCGGCCCG CGGCCTGGCG
ATCGACACTT ACACCGACAG TAAAGAAGAG TTCCCCGACT TTACCGCCTT CTGGTTTGAT
ACGGTAAAAC CGGGGGCAAC TACCTTTACC GTTTATGCGT TGCTCGATAG CGCCAGCATT
ACTGGTGCCT ATAAGTTCAC TATCCATTGT GAGAAAAGTC AGGTGATTAT GGATGTGGAA
AATCACCTGT ATGCGCGCAA AGACATTAAA CAGCTGGGCA TTGCGCCGAT GACCAGTATG
TTCAGCTGCG GTACTAATGA ACGTCGGATG TGCGATACAA TTCATCCGCA AATTCATGAC
TCTGATCGTC TGTCCATGTG GCGGGGCAAC GGCGAGTGGA TTTGCCGTCC GCTGAATAAT
CCGCAAAAAT TGCAGTTCAA TGCTTACACC GACAACAACC CGAAAGGGTT TGGTTTATTG
CAACTGGATC GTGACTTCTC CCATTATCAG GACATTATGG GCTGGTATAA CAAACGCCCA
AGTCTGTGGC TGGAACCGCG TAACAAGTGG GGTAAGGGCA CCATCGGCCT GATGGAAATC
CCAACAACGG GCGAAACGCT GGATAACATT GTCTGTTTCT GGCAGCCAGA AAAAGCTGTA
AAAGCAGGTG ATGAGTTTGC ATTCCAGTAT CGTCTGTACT GGAGTGCGCA ACCGCCTGTT
CATTGCCCAT TAGCGCGCGT TATGGCGACG CGTACCGGCA TGGGCGGTTT CCCGGAAGGT
TGGGCGCCAG GTGAACACTA TCCCGAAAAA TGGGCGCGTC GTTTTGCCGT CGATTTCGTT
GGTGGTGATC TGAAAGCTGC CGCGCCAAAA GGCATTGAGC CGGTGATTAC GCTTTCCAGT
GGGGAAGCGA AGCAAATCGA AATTCTCTAT ATTGAACCCA TCGATGGTTA TCGTATTCAG
TTTGACTGGT ATCCGACTTC GGACTCCACT GATCCGGTCG ATATGCGGAT GTATCTACGT
TGTCAGGGGG ACGCTATCAG TGAAACATGG CTGTATCAGT ATTTCCCGCC AGCGTCGGAT
AAACGTCAGT ATGTTGACGA CCGCGTGATG AGTTAA
 
Protein sequence
MDRRRFIKGS MAMAAVCGTS GIASLFSQAA FAADSDIADG QTQRFDFSIL QSMAHDLAQT 
AWRGAPRPLP DTLATMTPQA YNSIQYDAEK SLWHNVENRQ LDAQFFHMGM GFRRRVRMFS
VDPATHLARE IHFRPELFKY NDAGVDTKQL EGQSDLGFAG FRVFKAPELA RRDVVSFLGA
SYFRAVDDTY QYGLSARGLA IDTYTDSKEE FPDFTAFWFD TVKPGATTFT VYALLDSASI
TGAYKFTIHC EKSQVIMDVE NHLYARKDIK QLGIAPMTSM FSCGTNERRM CDTIHPQIHD
SDRLSMWRGN GEWICRPLNN PQKLQFNAYT DNNPKGFGLL QLDRDFSHYQ DIMGWYNKRP
SLWLEPRNKW GKGTIGLMEI PTTGETLDNI VCFWQPEKAV KAGDEFAFQY RLYWSAQPPV
HCPLARVMAT RTGMGGFPEG WAPGEHYPEK WARRFAVDFV GGDLKAAAPK GIEPVITLSS
GEAKQIEILY IEPIDGYRIQ FDWYPTSDST DPVDMRMYLR CQGDAISETW LYQYFPPASD
KRQYVDDRVM S