Gene EcHS_A1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1506 
SymbolmdoD 
ID5595300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1517285 
End bp1518940 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content51% 
IMG OID640920663 
Productglucan biosynthesis protein D 
Protein accessionYP_001458219 
Protein GI157160901 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGTA GACGATTTAT TAAAGGTTCA ATGGCTATGG CCGCCGTGTG CGGTACCAGC 
GGCATTGCTT CTCTTTTTTC TCAGGCGGCA TTCGCGGCAG ATTCTGATAT TGCCGACGGG
CAAACCCAGC GTTTTGACTT CTCCATTCTA CAGTCAATGG CGCACGACTT AGCGCAAACA
GCGTGGCATG GTGCGCCTCG TCCGTTACCT GACACGCTGG CGACAATGAC GCCGCAGGCT
TATAACAGTA TTCAATACGA CGCCGAAAAA TCGCTCTGGC ATAACGTTGA GAACCGTCAA
CTGGACGCTC AGTTCTTCCA TATGGGAATG GGATTCCGTC GCCGCGTTCG TATGTTTTCT
GTAGATCCAG CAACACATCT GGCGCGTGAA ATTCACTTTC GCCCGGAGTT GTTCAAATAC
AACGATGCAG GTGTTGATAC CAAACAATTA GAAGGGCAAA GCGATCTCGG CTTTGCCGGT
TTTCGCGTGT TTAAAGCCCC CGAACTGGCG CGCCGTGATG TAGTATCATT TCTCGGCGCG
AGTTATTTCC GCGCCGTTGA TGATACATAT CAATACGGTT TGTCGGCCCG CGGCCTGGCG
ATCGACACTT ACACCGACAG TAAAGAAGAG TTCCCCGACT TTACCGCCTT CTGGTTTGAT
ACGGTAAAAC CGGGGGCAAC TACCTTTACC GTTTATGCGT TGCTCGATAG CGCCAGCATT
ACTGGTGCCT ATAAGTTCAC TATCCATTGT GAGAAAAGTC AGGTGATTAT GGATGTGGAA
AATCACCTGT ATGCGCGCAA AGACATTAAA CAGCTGGGCA TTGCGCCGAT GACCAGTATG
TTCAGCTGCG GTACTAATGA ACGTCGGATG TGCGATACAA TTCATCCGCA AATTCATGAC
TCTGATCGTC TGTCCATGTG GCGGGGCAAC GGCGAGTGGA TTTGCCGTCC GCTGAATAAT
CCGCAAAAAT TGCAGTTCAA TGCTTACACC GACAACAACC CGAAAGGGTT TGGTTTATTG
CAACTGGATC GTGACTTCTC CCATTATCAG GACATTATGG GCTGGTATAA CAAACGCCCA
AGTCTGTGGG TGGAACCGCG TAACAAGTGG GGTAAGGGCA CCATCGGCCT GATGGAAATC
CCAACAACGG GCGAAACGCT GGATAACATT GTCTGCTTCT GGCAGCCAGA AAAAGCTGTA
AAAGCAGGTG ATGAGTTTGC ATTCCAGTAT CGTCTGTACT GGAGTGCGCA ACCGCCTGTT
CATTGCCCAT TAGCGCGCGT TATGGCGACG CGTACCGGCA TGGGCGGTTT CCCGGAAGGT
TGGGCGCCAG GTGAACACTA TCCCGAAAAA TGGGCGCGTC GTTTTGCCGT CGATTTCGTT
GGTGGTGATC TGAAAGCTGC CGCGCCAAAA GGCATTGAGC CGGTGATTAC GCTGTCCAGT
GGGGAAGCGA AGCAAATCGA AATTCTCTAT ATTGAACCCA TTGATGGTTA TCGTATTCAG
TTTGACTGGT ATCCGACGTC GGACTCCACT GATCCGGTCG ATATGCGGAT GTATCTGCGT
TGTCAGGGGG ACGCTATCAG TGAAACATGG CTGTATCAGT ATTTCCCGCC AGCGCCGGAT
AAACGTCAGT ATGTTGACGA CCGCGTGATG AGTTAA
 
Protein sequence
MDRRRFIKGS MAMAAVCGTS GIASLFSQAA FAADSDIADG QTQRFDFSIL QSMAHDLAQT 
AWHGAPRPLP DTLATMTPQA YNSIQYDAEK SLWHNVENRQ LDAQFFHMGM GFRRRVRMFS
VDPATHLARE IHFRPELFKY NDAGVDTKQL EGQSDLGFAG FRVFKAPELA RRDVVSFLGA
SYFRAVDDTY QYGLSARGLA IDTYTDSKEE FPDFTAFWFD TVKPGATTFT VYALLDSASI
TGAYKFTIHC EKSQVIMDVE NHLYARKDIK QLGIAPMTSM FSCGTNERRM CDTIHPQIHD
SDRLSMWRGN GEWICRPLNN PQKLQFNAYT DNNPKGFGLL QLDRDFSHYQ DIMGWYNKRP
SLWVEPRNKW GKGTIGLMEI PTTGETLDNI VCFWQPEKAV KAGDEFAFQY RLYWSAQPPV
HCPLARVMAT RTGMGGFPEG WAPGEHYPEK WARRFAVDFV GGDLKAAAPK GIEPVITLSS
GEAKQIEILY IEPIDGYRIQ FDWYPTSDST DPVDMRMYLR CQGDAISETW LYQYFPPAPD
KRQYVDDRVM S