Gene Mext_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0200 
Symbol 
ID5833719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp214784 
End bp216838 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content71% 
IMG OID641365985 
ProductHAD family hydrolase 
Protein accessionYP_001637697 
Protein GI163849654 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0438] Glycosyltransferase
[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR01484] HAD-superfamily hydrolase, subfamily IIB
[TIGR01485] sucrose-6F-phosphate phosphohydrolase
[TIGR02471] sucrose phosphate synthase, sucrose phosphatase-like domain, bacterial
[TIGR02472] sucrose-phosphate synthase, putative, glycosyltransferase domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.70537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGTTC TGCACATTGC TCTACAAGGC TGCCTGCGCG GCCGTGACGT CGTGTATGGC 
CTGACCTCGG ACACCGGCGG GCATATCCGA TACCTGCTCG ATCTCGTCGC CGCCTCGGCC
CAAGACTCGC GAGTCGCGCG GATCGTGATG GCGACCCGTC GGTTCGAGGG CCCACCCGGC
CCCGACTACG CCGTGCCCGA AGAGCGGATC TCCGACAAGG TCACGCTCGT GCGGCTCGCG
AGCGCGTCAC CGGGCTACCG CTCGAAGGAG GCGATGCACG GTGAGGTCGA GAGCTACGCC
GAGAATCTCA TCGCCTGGAT CGGCCGCCAG CCCCGCGCGC CCGACATCAT CCACGCGCAT
TACGCGGATG CCGCCGCGGT TGCCGAGATC GTCGAGGATC GGCTCGGCAT CCCCTTCGTG
TTCACGGCCC ATTCGCTCGG GCGAGTGAAG GCGGCCATGG TCGGCGACGG CGCCGCGAAC
GACCTCGAAC TGTCGCGCCG GATCGTCACC GAGGAGGCGG CCCTGGCGCG GGCGAGCCTC
GTCATCGCTT CGTCGCGCGA CGAGGCCGAG GTGCAATATG CCGGCTATGC CGCCTACGAT
CCTGGCCGCG CCCGTGTCCT GCCGCCGGGC AGCGATCTCG CCCGCTTCGC GCAGAGCCGC
CCGCATCCCC GGATCGACGC GGCGATCGAC CGGTTCCTGC ACGATCCCGG CAAGCCGGCC
GTGCTGGCGC TGGCCCGACC GGTGGCACGG AAGAATCTGG CGGCCCTGGT TCAGGCCTAT
GGCGAGAGCC CGGAGCTTCA GGCCTGCGCC AACCTCGTGA TCGTCGCCGG CACCCGTGAC
GACATCGACC GGCTCGACGG CGACATGGCG GCGACCATGC GCGACCTCCT CGTGCTCATC
GACCGTTACG ACCTCTACGG CCACGTCGCC TATCCGAAGA CGCACCGCCC GGAGGACGTG
CCGGCGATCT ACGCCTATGC GCGGGAGCGG GGCGGCGTCT TCGTCAACCC GGCCCTCAAC
GAGCCGTTCG GCCTGACGCT TCTGGAGGCG TCCGCCGCCG GCTTGCCGCT GGTGGCCACC
GACAGCGGCG GCCCCAACGA CATCGTCGAG ACCTGCGGCA ACGGGCTGCT CGTCGATCCG
CGCGCCCCCG CGGCGATCGC GGCCGCCTGC CTGCACATCC TCACGGATGC CCCCTTCCGC
GCCCGCTGCG TCGCCGGCGG TGCCCGCGCG GCGGCCGCCT ATGATTGGGA CCGGCACGCC
GCCCGCTATC TCGACCTGCT CGGCGCGCTG CTCGCGCGGA ACCCGCCCCT GCGGACCCCG
CGCCAACTCC TGATCTGCGA TATCGACAAC ACGCTCGTGG GATGTGAATC CGCCTTGGCG
ACGTTCCGGC GCTGGCGCAG CCGGCAGACG GGGCTGGCCT TCGGTGTGGC CACCGGCCGC
TCGTTTCACA GCGCGATGGC GGTGCTGGAG CAGCAGGCGA GCCCGCGACC GCAGGTGATG
ATCACCTCGG TCGGCTCGGA GATCTACCAT CTCGATGCCA ACGGCGTGAC CTACACGGCC
GACGCCGCGT GGCGCGAGGC GGTCTCGGAC GCCTGGGACC GGGGGGCGGT CGGCGCGGCT
TTGGGCCGAC TCGACGGGCT CGTCCCGCAG GGCCCGCTCG AGCAGCGCGC GCACAAGCTG
AGCTTCTTCG GCGACGAGGC GACGGCCCAT CGGGCGCGCG ATCGCCTCCT GCAGGCGGGT
CTCCCGGCGA ACGTGATCCA CAGCCACGGC CGCTACCTCG ATGTCCTGCC CGCGACGGCC
TCCAAGGGGA CGGCGGTCGA CCACGTCCGC GCGCTCTACG GGTTGCCCGA GCAGGCCGTG
TTCGTGGCCG GTGATTCCGG CAACGATGTC GAGATGCTGC GCGCTCGGAC GCAGGCGATC
ATCGTCGCGA ACTACTCCGA CGGGCTGGCC ACCAACGCCG CGCTCAAGCA CTCCTACGTC
GCCCGCACTT CGCATGCCCG CGGCATCATC GAGGGCGTTC TGCATTTCCG CCGGGCGCTG
GCCTATGCGT CTTAG
 
Protein sequence
MFVLHIALQG CLRGRDVVYG LTSDTGGHIR YLLDLVAASA QDSRVARIVM ATRRFEGPPG 
PDYAVPEERI SDKVTLVRLA SASPGYRSKE AMHGEVESYA ENLIAWIGRQ PRAPDIIHAH
YADAAAVAEI VEDRLGIPFV FTAHSLGRVK AAMVGDGAAN DLELSRRIVT EEAALARASL
VIASSRDEAE VQYAGYAAYD PGRARVLPPG SDLARFAQSR PHPRIDAAID RFLHDPGKPA
VLALARPVAR KNLAALVQAY GESPELQACA NLVIVAGTRD DIDRLDGDMA ATMRDLLVLI
DRYDLYGHVA YPKTHRPEDV PAIYAYARER GGVFVNPALN EPFGLTLLEA SAAGLPLVAT
DSGGPNDIVE TCGNGLLVDP RAPAAIAAAC LHILTDAPFR ARCVAGGARA AAAYDWDRHA
ARYLDLLGAL LARNPPLRTP RQLLICDIDN TLVGCESALA TFRRWRSRQT GLAFGVATGR
SFHSAMAVLE QQASPRPQVM ITSVGSEIYH LDANGVTYTA DAAWREAVSD AWDRGAVGAA
LGRLDGLVPQ GPLEQRAHKL SFFGDEATAH RARDRLLQAG LPANVIHSHG RYLDVLPATA
SKGTAVDHVR ALYGLPEQAV FVAGDSGNDV EMLRARTQAI IVANYSDGLA TNAALKHSYV
ARTSHARGII EGVLHFRRAL AYAS