Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_2321 |
Symbol | mdoG |
ID | 7116899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 2441911 |
End bp | 2443530 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643525070 |
Product | glucan biosynthesis protein G |
Protein accession | YP_002421094 |
Protein GI | 218530278 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.405289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.314726 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCG CCAAAGGCCC TCACGCCCCC AACGGCCATG CCGAGCAGGC CGCGCCGACA GGCGCGCCCT CGCGCCGGGG CGTGATGTCG GGGCTGGCTG CCGCCGGCCT CGCCGCGGCC CTGCCGGCTT CCGCTGAAGA CGATGCGAAG AGCCGTCCCT TCGAGCCCGG CATGGTCGAG CGTCAGGCGC AGGCTCTCGC GGCGCAGCCC TTCGACGGCC GTTTCCCGCC GCTGCCGGCG CCGCTCGCAG GCCTCGATTA CGACGCCTAC CGCGACATCC GCTTCCGGAA GGACCATGCG TTTCTCGGCG AGGCCGGCGG GCCGTTCCGG CTTCAGCTGT TCCACCGTGG CTTTCTCTAT CCGCGCCCGG TCTCCGTGAG CCTCGTGCGC GATGGGATCA GCACGCCGAT CCCGTACGAT CCGGCCCTGT TCGATTTCGG CCGGACGCGG ATCGACGGGA CGCTGCCGAG CGACCTGAAC TTTGCCGGCA TCCGCATCCA CGCGCCACTC AACCGGCCCG ACCGCCTCGA CGAACTGATC GTCTTCGCTG GCGCCAGCTA TTTCCGCTTC CTCGGCCAGG ACCAGCTCTA CGGTCTCTCC GCCCGCGCCC TCGCGATCGG TTCGGATGGC GATAAGGAGG AGTTTCCCTT CTTCCGCGCG TTCTACATCG AGGTGCCGTC GGCGGATGCG AAGGCCCTGA CGATCCACGC CCTGCTCGAC AGTCCCTCCG TGGCCGGTGC CTACCGCTTC ACGGTCGAGC CGGGCCGCAC CACCGGGGTG CGGGTGAACG CGACGCTCTA TCCGCGGCAG GATCTGGCCT CCGTCGGCAT CGCGCCGCTG ACCTCCATGT TCTTCATCAG CGAGACCGAT CGCGGCCACA GCGACGATTA CCGGCCGGAA CTGCACGATT CGGACGGGCT CCAGCTCGCC ACCGGCTCCG GCGAGTGGCT GTGGCGGCCG CTCGACAACC CGCAAAGCCG GCGGATCTCG ACCTTTCTCG ACCGCGACCC GAAGGGCTTC GGGCTGATGC AGCGCGACCG CGACTTTGGC AGCTACCAGG ATCTCGAAGC CGGCTACGAG CGCCGGCCCG GCTACTTCGT CGAGCCGGAG GGAGCCTGGG GCGAGGGCAG CGTCGTGCTG ATGGAGCTCC CGACCGACAA CGAAACCGCC GACAACGTCG TCGCCTTCTG GCGCCCGAAA CAGCCTTATC CGGCGGGACG GCCGGCGCGG CTCGCCTATA CGATCCGGGC GCTTGCGGCC GAGGATCTCC ACCCGAACGG CAAGGTGATG AACACCTTCA TCGCCGAGCC CGCCGCGAGC GGCGCCGCGC GCCGGGCCGC GGACGCAGCC GCCCTGCGCA ACCGACGCTT CCTGATCGAT TTCGGTGATG GGGAACTGGA CAAGCGTCTC GGCGATCCGG TGCCGCCCGA AGTCGTCGCC AGCGCCAGCA ACGGACGGAT CACCGCGACC TCGATCGTGC CGAATCCGCA TGTCGGCGGT TTCCGCGTCG CCCTCGACGT GCAGCTCGAC GGGCCGGGCG CGACCGAATT GCGGGCCTAT CTGAAGAAGG ACGATCAGGC CCTGACCGAG ACATGGTCCT ATCCCTGGAG CGTCGCGTGA
|
Protein sequence | MIRAKGPHAP NGHAEQAAPT GAPSRRGVMS GLAAAGLAAA LPASAEDDAK SRPFEPGMVE RQAQALAAQP FDGRFPPLPA PLAGLDYDAY RDIRFRKDHA FLGEAGGPFR LQLFHRGFLY PRPVSVSLVR DGISTPIPYD PALFDFGRTR IDGTLPSDLN FAGIRIHAPL NRPDRLDELI VFAGASYFRF LGQDQLYGLS ARALAIGSDG DKEEFPFFRA FYIEVPSADA KALTIHALLD SPSVAGAYRF TVEPGRTTGV RVNATLYPRQ DLASVGIAPL TSMFFISETD RGHSDDYRPE LHDSDGLQLA TGSGEWLWRP LDNPQSRRIS TFLDRDPKGF GLMQRDRDFG SYQDLEAGYE RRPGYFVEPE GAWGEGSVVL MELPTDNETA DNVVAFWRPK QPYPAGRPAR LAYTIRALAA EDLHPNGKVM NTFIAEPAAS GAARRAADAA ALRNRRFLID FGDGELDKRL GDPVPPEVVA SASNGRITAT SIVPNPHVGG FRVALDVQLD GPGATELRAY LKKDDQALTE TWSYPWSVA
|
| |