Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2597 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 2768869 |
End bp | 2770422 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | periplasmic glucan biosynthesis protein MdoG |
Protein accession | ACX40233 |
Protein GI | 260449811 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000000403052 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACATA AACTACAAAT GATGAAAATG CGTTGGTTGA GTGCTGCAGT AATGTTAACC CTGTATACAT CTTCAAGCTG GGCTTTCAGT ATTGATGATG TCGCAAAGCA AGCTCAATCT TTAGCCGGGA AAGGCTACGA GACGCCCAAA AGCAACTTGC CCTCCGTTTT CCGCGATATG AAATACGCGG ACTATCAGCA GATCCAGTTT AATCATGACA AAGCGTACTG GAACAATCTG AAGACCCCAT TCAAACTCGA GTTCTACCAT CAGGGTATGT ACTTCGATAC CCCGGTCAAA ATAAATGAAG TGACTGCCAC CGCAGTCAAA CGAATCAAAT ACAGCCCGGA TTATTTCACT TTCGGCGATG TTCAGCATGA CAAAGATACG GTAAAAGACC TTGGCTTTGC CGGTTTTAAA GTGCTTTACC CGATCAACAG CAAAGATAAA AACGATGAAA TCGTCAGCAT GCTCGGGGCC AGCTATTTCC GCGTGATTGG TGCAGGTCAG GTTTATGGCC TTTCTGCCCG CGGCCTGGCA ATTGATACCG CCTTGCCATC GGGTGAAGAA TTTCCGCGCT TCAAAGAGTT CTGGATCGAG CGTCCAAAAC CGACTGATAA ACGTTTAACC ATCTATGCAT TGCTTGACTC GCCGCGTGCG ACAGGTGCTT ACAAATTCGT GGTTATGCCA GGGCGTGACA CGGTTGTGGA TGTGCAGTCG AAAATCTATC TGCGCGATAA AGTCGGCAAA CTGGGGGTTG CACCGTTAAC CAGTATGTTC CTGTTTGGGC CGAACCAACC GTCGCCTGCA AATAACTATC GTCCGGAGTT GCACGACTCT AACGGTCTCT CTATCCATGC CGGTAATGGC GAATGGATCT GGCGTCCGTT GAATAACCCG AAACATTTAG CGGTCAGCAG CTTCTCCATG GAAAACCCGC AAGGCTTTGG TCTGTTGCAG CGCGGTCGTG ATTTCTCCCG CTTTGAAGAT CTCGATGATC GTTACGATCT CCGTCCAAGC GCATGGGTGA CTCCGAAAGG GGAGTGGGGT AAAGGCAGCG TTGAGCTGGT GGAAATTCCA ACCAACGATG AAACCAACGA TAACATCGTC GCTTACTGGA CGCCGGATCA GCTGCCGGAG CCGGGTAAAG AGATGAACTT TAAATACACC ATCACCTTCA GCCGTGATGA AGACAAACTG CATGCGCCAG ATAACGCATG GGTGCAACAA ACGCGTCGTT CAACGGGGGA TGTGAAGCAG TCGAACCTGA TTCGCCAGCC TGACGGTACT ATCGCCTTTG TGGTCGATTT TACCGGCGCA GAGATGAAAA AACTGCCAGA GGATACCCCG GTCACAGCGC AAACCAGCAT TGGTGATAAT GGTGAGATAG TTGAAAGCAC GGTGCGCTAT AACCCGGTTA CCAAAGGCTG GCGTCTGGTG ATGCGTGTGA AAGTGAAAGA TGCCAAGAAA ACCACTGAAA TGCGTGCTGC GCTGGTGAAT GCCGATCAGA CGTTGAGTGA AACCTGGAGC TACCAGTTAC CTGCCAATGA ATAA
|
Protein sequence | MKHKLQMMKM RWLSAAVMLT LYTSSSWAFS IDDVAKQAQS LAGKGYETPK SNLPSVFRDM KYADYQQIQF NHDKAYWNNL KTPFKLEFYH QGMYFDTPVK INEVTATAVK RIKYSPDYFT FGDVQHDKDT VKDLGFAGFK VLYPINSKDK NDEIVSMLGA SYFRVIGAGQ VYGLSARGLA IDTALPSGEE FPRFKEFWIE RPKPTDKRLT IYALLDSPRA TGAYKFVVMP GRDTVVDVQS KIYLRDKVGK LGVAPLTSMF LFGPNQPSPA NNYRPELHDS NGLSIHAGNG EWIWRPLNNP KHLAVSSFSM ENPQGFGLLQ RGRDFSRFED LDDRYDLRPS AWVTPKGEWG KGSVELVEIP TNDETNDNIV AYWTPDQLPE PGKEMNFKYT ITFSRDEDKL HAPDNAWVQQ TRRSTGDVKQ SNLIRQPDGT IAFVVDFTGA EMKKLPEDTP VTAQTSIGDN GEIVESTVRY NPVTKGWRLV MRVKVKDAKK TTEMRAALVN ADQTLSETWS YQLPANE
|
| |