Gene Mext_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2046 
SymbolmdoG 
ID5834775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2282359 
End bp2283978 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content69% 
IMG OID641367844 
Productglucan biosynthesis protein G 
Protein accessionYP_001639513 
Protein GI163851470 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.324147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.558823 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCG CCAAAGGCCC TCACGCCCCC AACGGCCATG CCGAGCAGGC CGTGCCGACA 
GGCGCGCCCT CGCGCCGGGG CGTGATGTCG GGGCTGGCCG CCGCGGGCCT CGCCGCGGCT
CTGCCAGCTT CCGCTCAAGA CGATGCGAGG AGCCGTCCCT TCGAGCCCGG CATGGTCGAG
CGCCAGGCGC AGGCGCTCGC GGCGCAGCCC TTCGACGGCC GTTTCCCCCC GCTGCCGGCG
CCGCTCGCGG GCCTCGATTA CGACGCCTAC CGCGACATCC GCTTCCGGAA GGACCATGCG
TTGCTGGGCG AGGCCGGCGC GCCGTTCCGG CTTCAACTGT TCCACCGTGG CTTTCTCTAT
CCGCGCCCGG TCTCCGTGAG CCTCGTGCGC GATGGGATCA GCACGCCGAT CCCGTACGAT
CCGGCCCTGT TCGATTTCGG CCGGACGCGG ATCGACGGGA CGCTGCCGAA CGACCTGAAC
TTTGCCGGCA TCCGCATCCA CGCGCCGCTC AACCGGCCCG ACCGCCTCGA CGAACTGATC
GTCTTCGCCG GCGCCAGCTA TTTCCGCTTC CTCGGCCAGG ACCAGCTCTA TGGTCTCTCC
GCCCGCGCCC TGGCGATCGG TTCGGACGGC GAGAAGGAGG AATTTCCGTT CTTCCGCGCG
TTCTACATCG AGGTGCCGTC GGCGGATGCG AATGCGCTGA CGATCCACGC CCTGCTCGAC
AGCCCGTCCG TGGCCGGCGC CTACCGCTTC ACGGTCGAGC CGGGCCGCAC CACGGGGGTG
CGGGTGAGCG CGACGCTCTA TCCGCGGCAG GATCTGGCCT CCGTCGGCAT CGCGCCGCTG
ACCTCGATGT TCTTCATCAG CGAGACCGAT CGCGGCCACA GCGACGATTA CCGGCCGGAA
TTGCACGATT CGGACGGGCT CCAGCTCGCC ACCGGCTCCG GCGAGTGGCT GTGGCGACCG
CTCGACAACC CGCAAAGCCG GCGGATCTCG ACCTTCCTCG ACCGCGACCC GAAGGGCTTC
GGGCTGATGC AGCGCGACCG CGACTTCGGC AGCTACCAGG ATCTCGAGGC CGGCTACGAG
CGCCGCCCCG GCTACTTCGT CGAGCCGGAG GGCGCCTGGG GCGAGGGCAG CGTCGTGCTG
ATGGAACTGC CGACCGATAA CGAGACCGCC GACAACGTCG TCGCCTTCTG GCGCCCGAAA
CAGCCTTATC CGGCGGGACG GCCGGCGCGG CTCGCCTATA CGATTCGGGC GCTCGCGGCC
GAGGATCTTC ACCCGAACGG CAAGGTGATG AACACCTTCA TCGCCGAGCC CGCCGCGAGC
GGCGCCGCGC GCCGGGCCGC GGATGCAGCC GCCCTGCGCA ACCGGCGCTT CCTGATCGAT
TTCGGTGATG GGGAATTGGA GAAGCGTCTC GGCGATCCGG TGCCGCCCGA AGTCGTCGCC
AGCGCCAGCA ACGGACGGAT CACCGCGACC TCGATCGTGC CGAATCCGCA TGTCGGCGGT
TTCCGCGTCG CCCTCGACGT GCAGCTCGAC GGGCCGGGCG CGACCGAATT GCGGGCCTAT
CTGAAGAAGG ACGATCAGGC CCTGACCGAG ACATGGTCCT ATCCCTGGAG CGTCGCTTGA
 
Protein sequence
MIRAKGPHAP NGHAEQAVPT GAPSRRGVMS GLAAAGLAAA LPASAQDDAR SRPFEPGMVE 
RQAQALAAQP FDGRFPPLPA PLAGLDYDAY RDIRFRKDHA LLGEAGAPFR LQLFHRGFLY
PRPVSVSLVR DGISTPIPYD PALFDFGRTR IDGTLPNDLN FAGIRIHAPL NRPDRLDELI
VFAGASYFRF LGQDQLYGLS ARALAIGSDG EKEEFPFFRA FYIEVPSADA NALTIHALLD
SPSVAGAYRF TVEPGRTTGV RVSATLYPRQ DLASVGIAPL TSMFFISETD RGHSDDYRPE
LHDSDGLQLA TGSGEWLWRP LDNPQSRRIS TFLDRDPKGF GLMQRDRDFG SYQDLEAGYE
RRPGYFVEPE GAWGEGSVVL MELPTDNETA DNVVAFWRPK QPYPAGRPAR LAYTIRALAA
EDLHPNGKVM NTFIAEPAAS GAARRAADAA ALRNRRFLID FGDGELEKRL GDPVPPEVVA
SASNGRITAT SIVPNPHVGG FRVALDVQLD GPGATELRAY LKKDDQALTE TWSYPWSVA