Gene EcE24377A_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1603 
SymbolmdoD 
ID5587242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1596828 
End bp1598483 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content51% 
IMG OID640925292 
Productglucan biosynthesis protein D 
Protein accessionYP_001462697 
Protein GI157157100 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGTA GACGATTTAT TAAAGGTTCA ATGGCTATGG CCGCCGTGTG CGGTACCAGC 
GGCATTGCTT CTCTTTTTTC TCAGGCGGCA TTCGCGGCAG ATTCTGATAT TGCCGACGGG
CAAACCCAGC GTTTTGACTT CTCCATTCTA CAGTCAATGG CGCACGACTT AGCGCAAACA
GCGTGGCGTG GTGCGCCTCG TCCGTTACCT GACACGCTGG CGACAATGAC GCCGCAGGCT
TATAACAGTA TTCAATACGA CGCCGAAAAA TCGCTCTGGC ATAACGTTGA GAACCGTCAA
CTGGACGCTC AGTTCTTCCA TATGGGAATG GGATTCCGTC GCCGCGTTCG TATGTTTTCT
GTAGATCCAG CAACACATCT GGCGCGTGAA ATTCACTTTC GCCCGGAGTT GTTCAAATAC
AACGATGCAG GTGTTGATAC CAAACAATTA GAAGGGCAAA GCGATCTCGG CTTTGCCGGT
TTTCGCGTGT TTAAAGCCCC CGAACTGGCG CGCCGTGATG TAGTATCATT CCTCGGAGCG
AGTTATTTCC GCGCCGTTGA TGACACATAT CAATACGGTT TGTCGGCCCG CGGCCTGGCG
ATTGACACTT ACACCGACAG TAAAGAAGAG TTCCCCGACT TTACCGCCTT CTGGTTTGAT
ACGGTAAAAC CGGGGGCAAC TACCTTTACC GTTTATGCGT TGCTCGATAG CGCCAGCATT
ACTGGTGCCT ATAAGTTCAC TATCCATTGT GAGAAAAGTC AGGTGATTAT GGATGTGGAA
AATCACCTGT ATGCGCGCAA AGACATTAAA CAGCTGGGCA TTGCGCCGAT GACCAGTATG
TTCAGCTGCG GTACTAATGA ACGTCGGATG TGCGACACCA TTCATCCGCA AATTCATGAC
TCTGATCGTC TGTCCATGTG GCGGGGCAAC GGCGAGTGGA TTTGCCGTCC GCTGAATAAT
CCGCAAAAAT TGCAGTTCAA TGCTTACACC GACAACAACC CGAAAGGGTT TGGTTTATTG
CAACTGGATC GTGATTTCTC CCATTATCAG GACATTATGG GCTGGTATAA CAAACGCCCA
AGTCTGTGGG TGGAACCGCG TAACAAGTGG GGTAAGGGCA CCATCGGCCT GATGGAAATC
CCAACAACGG GCGAAACGCT GGATAACATT GTCTGCTTCT GGCAGCCAGA AAAAGCTGTA
AAGGCGGGTG ATGAGTTTGC ATTCCAGTAT CGTCTGTACT GGAGTGCGCA ACCGCCTGTT
CATTGCCCAT TAGCGCGCGT TATGGCGACG CGTACCGGCA TGGGCGGTTT CCCGGAAGGT
TGGGCACCAG GTGAACACTA TCCCGAAAAA TGGGCGCGTC GTTTTGCCGT CGATTTCGTT
GGTGGTGATC TGAAAGCTGC CGCACCAAAA GGCATTGAGC CGGTGATTAC GCTTTCCAGT
GGGGAAGCGA AGCAAATCGA AATTCTCTAT ATTGAACCCA TTGATGGTTA TCGTATTCAG
TTTGACTGGT ATCCGACTTC GGACTCCACT GATCCGGTCG ATATGCGGAT GTATCTGCGT
TGTCAGGGCG ACGCTATCAG TGAAACATGG CTGTATCAGT ATTTCCCGCC AGCGCCGGAT
AAACGTCAGT ATGTTGACGA CCGCGTGATG AGTTAA
 
Protein sequence
MDRRRFIKGS MAMAAVCGTS GIASLFSQAA FAADSDIADG QTQRFDFSIL QSMAHDLAQT 
AWRGAPRPLP DTLATMTPQA YNSIQYDAEK SLWHNVENRQ LDAQFFHMGM GFRRRVRMFS
VDPATHLARE IHFRPELFKY NDAGVDTKQL EGQSDLGFAG FRVFKAPELA RRDVVSFLGA
SYFRAVDDTY QYGLSARGLA IDTYTDSKEE FPDFTAFWFD TVKPGATTFT VYALLDSASI
TGAYKFTIHC EKSQVIMDVE NHLYARKDIK QLGIAPMTSM FSCGTNERRM CDTIHPQIHD
SDRLSMWRGN GEWICRPLNN PQKLQFNAYT DNNPKGFGLL QLDRDFSHYQ DIMGWYNKRP
SLWVEPRNKW GKGTIGLMEI PTTGETLDNI VCFWQPEKAV KAGDEFAFQY RLYWSAQPPV
HCPLARVMAT RTGMGGFPEG WAPGEHYPEK WARRFAVDFV GGDLKAAAPK GIEPVITLSS
GEAKQIEILY IEPIDGYRIQ FDWYPTSDST DPVDMRMYLR CQGDAISETW LYQYFPPAPD
KRQYVDDRVM S