Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2046 |
Symbol | mdoG |
ID | 5834775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2282359 |
End bp | 2283978 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641367844 |
Product | glucan biosynthesis protein G |
Protein accession | YP_001639513 |
Protein GI | 163851470 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.324147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.558823 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCG CCAAAGGCCC TCACGCCCCC AACGGCCATG CCGAGCAGGC CGTGCCGACA GGCGCGCCCT CGCGCCGGGG CGTGATGTCG GGGCTGGCCG CCGCGGGCCT CGCCGCGGCT CTGCCAGCTT CCGCTCAAGA CGATGCGAGG AGCCGTCCCT TCGAGCCCGG CATGGTCGAG CGCCAGGCGC AGGCGCTCGC GGCGCAGCCC TTCGACGGCC GTTTCCCCCC GCTGCCGGCG CCGCTCGCGG GCCTCGATTA CGACGCCTAC CGCGACATCC GCTTCCGGAA GGACCATGCG TTGCTGGGCG AGGCCGGCGC GCCGTTCCGG CTTCAACTGT TCCACCGTGG CTTTCTCTAT CCGCGCCCGG TCTCCGTGAG CCTCGTGCGC GATGGGATCA GCACGCCGAT CCCGTACGAT CCGGCCCTGT TCGATTTCGG CCGGACGCGG ATCGACGGGA CGCTGCCGAA CGACCTGAAC TTTGCCGGCA TCCGCATCCA CGCGCCGCTC AACCGGCCCG ACCGCCTCGA CGAACTGATC GTCTTCGCCG GCGCCAGCTA TTTCCGCTTC CTCGGCCAGG ACCAGCTCTA TGGTCTCTCC GCCCGCGCCC TGGCGATCGG TTCGGACGGC GAGAAGGAGG AATTTCCGTT CTTCCGCGCG TTCTACATCG AGGTGCCGTC GGCGGATGCG AATGCGCTGA CGATCCACGC CCTGCTCGAC AGCCCGTCCG TGGCCGGCGC CTACCGCTTC ACGGTCGAGC CGGGCCGCAC CACGGGGGTG CGGGTGAGCG CGACGCTCTA TCCGCGGCAG GATCTGGCCT CCGTCGGCAT CGCGCCGCTG ACCTCGATGT TCTTCATCAG CGAGACCGAT CGCGGCCACA GCGACGATTA CCGGCCGGAA TTGCACGATT CGGACGGGCT CCAGCTCGCC ACCGGCTCCG GCGAGTGGCT GTGGCGACCG CTCGACAACC CGCAAAGCCG GCGGATCTCG ACCTTCCTCG ACCGCGACCC GAAGGGCTTC GGGCTGATGC AGCGCGACCG CGACTTCGGC AGCTACCAGG ATCTCGAGGC CGGCTACGAG CGCCGCCCCG GCTACTTCGT CGAGCCGGAG GGCGCCTGGG GCGAGGGCAG CGTCGTGCTG ATGGAACTGC CGACCGATAA CGAGACCGCC GACAACGTCG TCGCCTTCTG GCGCCCGAAA CAGCCTTATC CGGCGGGACG GCCGGCGCGG CTCGCCTATA CGATTCGGGC GCTCGCGGCC GAGGATCTTC ACCCGAACGG CAAGGTGATG AACACCTTCA TCGCCGAGCC CGCCGCGAGC GGCGCCGCGC GCCGGGCCGC GGATGCAGCC GCCCTGCGCA ACCGGCGCTT CCTGATCGAT TTCGGTGATG GGGAATTGGA GAAGCGTCTC GGCGATCCGG TGCCGCCCGA AGTCGTCGCC AGCGCCAGCA ACGGACGGAT CACCGCGACC TCGATCGTGC CGAATCCGCA TGTCGGCGGT TTCCGCGTCG CCCTCGACGT GCAGCTCGAC GGGCCGGGCG CGACCGAATT GCGGGCCTAT CTGAAGAAGG ACGATCAGGC CCTGACCGAG ACATGGTCCT ATCCCTGGAG CGTCGCTTGA
|
Protein sequence | MIRAKGPHAP NGHAEQAVPT GAPSRRGVMS GLAAAGLAAA LPASAQDDAR SRPFEPGMVE RQAQALAAQP FDGRFPPLPA PLAGLDYDAY RDIRFRKDHA LLGEAGAPFR LQLFHRGFLY PRPVSVSLVR DGISTPIPYD PALFDFGRTR IDGTLPNDLN FAGIRIHAPL NRPDRLDELI VFAGASYFRF LGQDQLYGLS ARALAIGSDG EKEEFPFFRA FYIEVPSADA NALTIHALLD SPSVAGAYRF TVEPGRTTGV RVSATLYPRQ DLASVGIAPL TSMFFISETD RGHSDDYRPE LHDSDGLQLA TGSGEWLWRP LDNPQSRRIS TFLDRDPKGF GLMQRDRDFG SYQDLEAGYE RRPGYFVEPE GAWGEGSVVL MELPTDNETA DNVVAFWRPK QPYPAGRPAR LAYTIRALAA EDLHPNGKVM NTFIAEPAAS GAARRAADAA ALRNRRFLID FGDGELEKRL GDPVPPEVVA SASNGRITAT SIVPNPHVGG FRVALDVQLD GPGATELRAY LKKDDQALTE TWSYPWSVA
|
| |