Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2081 |
Symbol | mdoG |
ID | 6143444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2096427 |
End bp | 2097980 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616957 |
Product | glucan biosynthesis protein G |
Protein accession | YP_001744133 |
Protein GI | 170682373 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.242328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATA AACTACAAAT GATGAAAATG CGTTGGTTGA GTGCTGCAGT AATGTTAACC CTGTATACAT CTTCAAGCTG GGCTTTCAGT ATTGATGATG TCGCAAAGCA AGCTCAATCC TTAGCCGGGA AAGGCTATGA GGCGCCCAAA AGCAACTTGC CCTCCGTTTT CCGCGACATG AAATACGCGG ACTATCAGCA GATCCAGTTT AATCATGACA AAGCGTACTG GAACAATCTG AAGACCCCAT TCAAACTCGA GTTCTACCAT CAGGGTATGT ACTTCGATAC CCCGGTCAAA ATAAATGAAG TGACTGCCAC CGCAGTCAAA CGAATCAAAT ACAGCCCGGA TTATTTCACT TTCGGCGATG TTCAGCATGA CAAAGACACG GTAAAAGACC TTGGTTTTGC CGGTTTTAAA GTGCTTTACC CGATCAACAG CAAAGATAAA AACGATGAAA TCGTCAGCAT GCTCGGGGCC AGCTATTTCC GCGTGATTGG TGCAGGTCAG GTTTATGGCC TTTCTGCCCG CGGCCTGGCA ATTGATACCG CCTTGCCATC GGGTGAAGAA TTTCCACGCT TCAAAGAGTT CTGGATCGAG CGTCCAAAAC CGACTGATAA ACGTTTAACC ATTTATGCAT TGCTTGACTC GCCGCGCGCG ACAGGTGCTT ACAAATTCGT GGTTATGCCA GGGCGTGACA CGGTTGTGGA TGTGCAGTCG AAAATCTATC TGCGCGATAA AGTCGGCAAA CTGGGGGTTG CACCGTTAAC CAGTATGTTC CTGTTTGGGC CGAACCAACC GTCGCCTGCA AATAACTATC GTCCGGAGTT GCACGACTCT AACGGCCTGT CTATCCATGC TGGTAATGGC GAATGGATCT GGCGTCCGTT GAATAACCCG AAACATTTAG CGGTCAGCAG CTTCTCGATG GAAAACCCGC AAGGCTTCGG TCTGTTGCAG CGCGGTCGTG ATTTCTCCCG CTTTGAAGAT CTCGATGATC GTTACGATCT TCGTCCGAGC GCATGGGTGA CTCCGAAAGG GGAGTGGGGC AAAGGCAGCG TTGAGCTGGT GGAAATTCCA ACCAACGATG AAACCAACGA TAACATCGTC GCTTACTGGA CGCCGGATCA GCTGCCGGAG CCGGGTAAAG AGATGAACTT TAAATACACC ATCACCTTCA GCCGTGATGA AGACAAACTG CATGCACCAG ATAACGCATG GGTGCAACAA ACGCGTCGTT CAACGGGGGA TGTGAAGCAG TCGAACCTGA TTCGCCAGCC TGACGGTACT ATCGCCTTTG TGGTCGATTT TACCGGCGCA GAGATGAAAA AACTGCCAGA GGATACCCCG GTCACAGCGC AAACCAGCAT TGGTGATAAT GGTGAGATAG TTGAAAGCAC GGTGCGCTAT AACCCGGTTA CCAAAGGCTG GCGTCTGGTG ATGCGTGTGA AAGTGAAAGA TGCCAAGAAA ACCACTGAAA TGCGTGCTGC GCTGGTGAAT GCCGATCAGA CGTTGAGTGA AACCTGGAGC TACCAGTTAC CTGCCAATGA ATAA
|
Protein sequence | MKHKLQMMKM RWLSAAVMLT LYTSSSWAFS IDDVAKQAQS LAGKGYEAPK SNLPSVFRDM KYADYQQIQF NHDKAYWNNL KTPFKLEFYH QGMYFDTPVK INEVTATAVK RIKYSPDYFT FGDVQHDKDT VKDLGFAGFK VLYPINSKDK NDEIVSMLGA SYFRVIGAGQ VYGLSARGLA IDTALPSGEE FPRFKEFWIE RPKPTDKRLT IYALLDSPRA TGAYKFVVMP GRDTVVDVQS KIYLRDKVGK LGVAPLTSMF LFGPNQPSPA NNYRPELHDS NGLSIHAGNG EWIWRPLNNP KHLAVSSFSM ENPQGFGLLQ RGRDFSRFED LDDRYDLRPS AWVTPKGEWG KGSVELVEIP TNDETNDNIV AYWTPDQLPE PGKEMNFKYT ITFSRDEDKL HAPDNAWVQQ TRRSTGDVKQ SNLIRQPDGT IAFVVDFTGA EMKKLPEDTP VTAQTSIGDN GEIVESTVRY NPVTKGWRLV MRVKVKDAKK TTEMRAALVN ADQTLSETWS YQLPANE
|
| |