Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_3081 |
Symbol | mdoG |
ID | 7118359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 3261454 |
End bp | 3263130 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643525832 |
Product | glucan biosynthesis protein G |
Protein accession | YP_002421847 |
Protein GI | 218531031 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0188456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGAC CGTCGGACGC GGACAACGAG GCGCCCGAAG AAATCGACAT TTCGCGGCGC GCCATCGTCC GTGGCGCTTT TCGAGGTGCT CTTGCCGGTA CGGCCGGCTT GAGCATTCCT GGCCTCTTCG ACCTGAATGC CGCCAGAACG ATGGCTCAAA CGGTGCCAAA CGCCTCACCC CCAGCGCCCC TTGATGCGCG TCCGGTCACC TTCGAAGCGG TGGCGGCACG GGCCGAGCGG CTGGCGGCGC AGGTTTACGC GCCGCAGAAC AATCCCCTAC CGCCGGAACT GGCCGCGCTC GACTACGACG CCTTCCAGGC GATCCGCTTC CGATCCGAGG CGACGGTCCC GCTCGGGCCC CGCTTCTCGA TGCAGCCGTT CCACCGCGGC AACCTGCACG CCAAGCGGGT GGAGATCTTT CTGCAATCGC CCGAGGGCGT GCGCCCCTTC GGCTACGATC CTGACTTGTT CGATCTCGGA CCGGCCTTGC AGGGCCGCCG CTATCCCGCC TCGCTCGGCT ATGCCGGTTT TCGCATCGCC CACGGGTTCG ACGCGAGCCA GCCGCAGGCC CACGAGGAAT TCCTGGTCTT CCTCGGTGCC TCCTACTTCC GCATCCGCGG TCGGGGGCAG ATCTACGGCC TCTCGGCCCG CGGCATCGCC GTGAATACGG GATTGCCGCA GGGCGAGGAG TTTCCCGATT TCACCAGCTT CTGGATCGAG GTACCGCAGG GCGACGTGGC GTCGATCACG ATTTTCGCGC TGCTCGACGG TCCGAGCCTG ACGGGCGCCT ACCGCTTCGC CGTCACGCCG GGCGATCCAT CCCATGTCGC GGTCGAAGCC GCGCTGTTTC CGCGCCGTTC GATCGCGGCA CTCGGCCTCG CGCCGCTCAC CAGCATGTTC CTGTTCGGCG AGAACGGGCC CGGCGTACGC CGCGCCGAGC CGTTCGACGA TTTCCGCCCG CAGGTGCACG ATTCGGACGG GCTGGTGGTG CAGACACCCG GCGACCGTTT GTGGCGGCCC CTCGTCAACG GCCGGCCCGC GCCGCAGATC TCGTCGTTCC ACGCCGCCCC GCTCGAAGGC TTCGGGCTGC TGCAGCGGGA GCGGCGCTTT GCCGCCTATC TCGACGTGCA GGCGCAGCAC GAGGACCGGC CGGGCCTTTG GGTGACGCCG CAGGGCGGCT TCGGCGCGGG CGCGGTCCGC CTGTTCGAGA TCCCCTCGCG CACGGAGGCG ACCGACAACA TCGTCGCCGC CTTCGTGCCG GAGGCGCCGG TCGAGGCCGG CAAGACGCTG CGGCTCGCCT ATTCCCTCGT CACGGTCGGT GCCGAGCAGG CGCCGGCTCT CGCGCCGCCG CTCGCCCGTG TGGTCTCGAC TCGCGTCGGT TCGGCCGAGC GTCTGAGGCC GACCGATCCA CCCTCGCCGC AGCGGCGCCT CTACGCCATC GACTTCGAGG GGCCGGGCCT GCCCGACGAC CCGAAGGCGG CCATCGACGT GGCGCTCTCG GCGAGTGCAG GCGCGCTGGT CGAGCCCTAT GCCGAGCGGG TGCCGCAGAC CAGCGGCTGG CGTCTCTATG CCGAGTTCCG GCCGCCCGAT CCGTGGCCTG CGGGGGACGT GGTGCTGCGC GCCCGCCTCT CGCACGCAGG ACGCGCCATC ACCGAGACAT GGGACGGCGT CGCCTGA
|
Protein sequence | MAGPSDADNE APEEIDISRR AIVRGAFRGA LAGTAGLSIP GLFDLNAART MAQTVPNASP PAPLDARPVT FEAVAARAER LAAQVYAPQN NPLPPELAAL DYDAFQAIRF RSEATVPLGP RFSMQPFHRG NLHAKRVEIF LQSPEGVRPF GYDPDLFDLG PALQGRRYPA SLGYAGFRIA HGFDASQPQA HEEFLVFLGA SYFRIRGRGQ IYGLSARGIA VNTGLPQGEE FPDFTSFWIE VPQGDVASIT IFALLDGPSL TGAYRFAVTP GDPSHVAVEA ALFPRRSIAA LGLAPLTSMF LFGENGPGVR RAEPFDDFRP QVHDSDGLVV QTPGDRLWRP LVNGRPAPQI SSFHAAPLEG FGLLQRERRF AAYLDVQAQH EDRPGLWVTP QGGFGAGAVR LFEIPSRTEA TDNIVAAFVP EAPVEAGKTL RLAYSLVTVG AEQAPALAPP LARVVSTRVG SAERLRPTDP PSPQRRLYAI DFEGPGLPDD PKAAIDVALS ASAGALVEPY AERVPQTSGW RLYAEFRPPD PWPAGDVVLR ARLSHAGRAI TETWDGVA
|
| |