Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1428 |
Symbol | mdoG |
ID | 6971733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1412044 |
End bp | 1413597 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643385402 |
Product | glucan biosynthesis protein G |
Protein accession | YP_002269896 |
Protein GI | 209398428 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0037891 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.223049 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATA AACTACAAAT GATGAAAATG CGTTGGTTGA GTGCTGCAGT AATGTTAACC CTGTATACAT CTTCAAGCTG GGCTTTCAGT ATTGATGATG TCGCAAAGCA AGCTCAATCC TTAGCCGGGA AAGGCTATGA GGCGCCCAAA AGCAACTTGC CCTCCGTTTT CCGCGATATG AAATACGCGG ACTATCAGCA GATCCAGTTT AATCATGACA AAGCGTACTG GAACAATCTG AAGACCCCAT TCAAACTCGA GTTCTACCAT CAGGGTATGT ACTTCGATAC CCCGGTCAAA ATAAATGAAG TGACTGCCAC CGCAGTCAAA CGAATCAAAT ACAGCCCGGA TTATTTCACT TTCGGCGATG TTCAGCATGA CAAAGACACG GTAAAAGACC TTGGTTTTGC CGGTTTTAAA GTGCTTTACC CGATCAACAG CAAAGATAAA AACGATGAAA TCGTCAGCAT GCTCGGGGCC AGCTATTTCC GCGTGATTGG TGCAGGTCAG GTTTATGGCC TTTCTGCCCG CGGTCTGGCA ATTGATACCG CCTTGCCATC GGGTGAAGAA TTTCCACGCT TCAAAGAGTT CTGGATCGAG CGTCCAAAAC CAACTGATAA ACGTTTAACC ATTTATGCAT TGCTTGACTC GCCGCGCGCG ACAGGTGCTT ACAAATTCGT AGTTATGCCA GGGCGTGACA CGGTTGTGGA TGTGCAGTCG AAAATCTATC TGCGCGATAA AGTCGGCAAA CTGGGGGTTG CACCGTTAAC CAGTATGTTC CTGTTTGGGC CGAACCAACC GTCGCCAGCA AATAACTATC GTCCGGAGTT GCACGACTCT AACGGTCTCT CTATCCATGC CGGTAATGGC GAATGGATCT GGCGTCCGTT GAATAACCCG AAACATTTAG CGGTCAGCAG CTTCTCCATG GAAAACCCGC AAGGCTTTGG TCTGTTGCAG CGCGGTCGTG ATTTCTCCCG CTTTGAAGAT CTCGATGATC GTTACGATCT CCGTCCAAGC GCATGGGTGA CTCCGAAAGG GGAGTGGGGT AAAGGCAGCG TTGAGCTGGT GGAAATTCCA ACCAACGATG AAACTAACGA TAACATCGTC GCTTACTGGA CGCCGGATCA GCTGCCGGAG CCGGGTAAAG AGATGAACTT TAAATACACC ATCACCTTCA GCCGTGATGA AGACAAACTG CATGCGCCAG ATAACGCATG GGTGCAACAA ACGCGTCGTT CAACGGGGGA TGTGAAGCAG TCGAACCTGA TTCGCCAGCC TGACGGTACT ATCGCCTTTG TGGTCGATTT TACCGGCGCA GAGATGAAAA AACTGCCAGA GGATACCCCG GTCACAGCGC AAACCAGCAT TGGTGATAAT GGTGAGATAG TTGAAAGCAC GGTGCGCTAT AACCCGGTCA CCAAAGGCTG GCGTCTGGTG ATGCGTGTGA AAGTGAAAGA TGCCAAGAAA ACCACTGAAA TGCGCGCTGC GCTGGTGAAT GCCGATCAGA CGTTGAGTGA AACCTGGAGC TACCAGTTAC CTGCCAATGA ATAA
|
Protein sequence | MKHKLQMMKM RWLSAAVMLT LYTSSSWAFS IDDVAKQAQS LAGKGYEAPK SNLPSVFRDM KYADYQQIQF NHDKAYWNNL KTPFKLEFYH QGMYFDTPVK INEVTATAVK RIKYSPDYFT FGDVQHDKDT VKDLGFAGFK VLYPINSKDK NDEIVSMLGA SYFRVIGAGQ VYGLSARGLA IDTALPSGEE FPRFKEFWIE RPKPTDKRLT IYALLDSPRA TGAYKFVVMP GRDTVVDVQS KIYLRDKVGK LGVAPLTSMF LFGPNQPSPA NNYRPELHDS NGLSIHAGNG EWIWRPLNNP KHLAVSSFSM ENPQGFGLLQ RGRDFSRFED LDDRYDLRPS AWVTPKGEWG KGSVELVEIP TNDETNDNIV AYWTPDQLPE PGKEMNFKYT ITFSRDEDKL HAPDNAWVQQ TRRSTGDVKQ SNLIRQPDGT IAFVVDFTGA EMKKLPEDTP VTAQTSIGDN GEIVESTVRY NPVTKGWRLV MRVKVKDAKK TTEMRAALVN ADQTLSETWS YQLPANE
|
| |