Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0200 |
Symbol | |
ID | 5833719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 214784 |
End bp | 216838 |
Gene Length | 2055 bp |
Protein Length | 684 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641365985 |
Product | HAD family hydrolase |
Protein accession | YP_001637697 |
Protein GI | 163849654 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0438] Glycosyltransferase [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR01484] HAD-superfamily hydrolase, subfamily IIB [TIGR01485] sucrose-6F-phosphate phosphohydrolase [TIGR02471] sucrose phosphate synthase, sucrose phosphatase-like domain, bacterial [TIGR02472] sucrose-phosphate synthase, putative, glycosyltransferase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.70537 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGTTC TGCACATTGC TCTACAAGGC TGCCTGCGCG GCCGTGACGT CGTGTATGGC CTGACCTCGG ACACCGGCGG GCATATCCGA TACCTGCTCG ATCTCGTCGC CGCCTCGGCC CAAGACTCGC GAGTCGCGCG GATCGTGATG GCGACCCGTC GGTTCGAGGG CCCACCCGGC CCCGACTACG CCGTGCCCGA AGAGCGGATC TCCGACAAGG TCACGCTCGT GCGGCTCGCG AGCGCGTCAC CGGGCTACCG CTCGAAGGAG GCGATGCACG GTGAGGTCGA GAGCTACGCC GAGAATCTCA TCGCCTGGAT CGGCCGCCAG CCCCGCGCGC CCGACATCAT CCACGCGCAT TACGCGGATG CCGCCGCGGT TGCCGAGATC GTCGAGGATC GGCTCGGCAT CCCCTTCGTG TTCACGGCCC ATTCGCTCGG GCGAGTGAAG GCGGCCATGG TCGGCGACGG CGCCGCGAAC GACCTCGAAC TGTCGCGCCG GATCGTCACC GAGGAGGCGG CCCTGGCGCG GGCGAGCCTC GTCATCGCTT CGTCGCGCGA CGAGGCCGAG GTGCAATATG CCGGCTATGC CGCCTACGAT CCTGGCCGCG CCCGTGTCCT GCCGCCGGGC AGCGATCTCG CCCGCTTCGC GCAGAGCCGC CCGCATCCCC GGATCGACGC GGCGATCGAC CGGTTCCTGC ACGATCCCGG CAAGCCGGCC GTGCTGGCGC TGGCCCGACC GGTGGCACGG AAGAATCTGG CGGCCCTGGT TCAGGCCTAT GGCGAGAGCC CGGAGCTTCA GGCCTGCGCC AACCTCGTGA TCGTCGCCGG CACCCGTGAC GACATCGACC GGCTCGACGG CGACATGGCG GCGACCATGC GCGACCTCCT CGTGCTCATC GACCGTTACG ACCTCTACGG CCACGTCGCC TATCCGAAGA CGCACCGCCC GGAGGACGTG CCGGCGATCT ACGCCTATGC GCGGGAGCGG GGCGGCGTCT TCGTCAACCC GGCCCTCAAC GAGCCGTTCG GCCTGACGCT TCTGGAGGCG TCCGCCGCCG GCTTGCCGCT GGTGGCCACC GACAGCGGCG GCCCCAACGA CATCGTCGAG ACCTGCGGCA ACGGGCTGCT CGTCGATCCG CGCGCCCCCG CGGCGATCGC GGCCGCCTGC CTGCACATCC TCACGGATGC CCCCTTCCGC GCCCGCTGCG TCGCCGGCGG TGCCCGCGCG GCGGCCGCCT ATGATTGGGA CCGGCACGCC GCCCGCTATC TCGACCTGCT CGGCGCGCTG CTCGCGCGGA ACCCGCCCCT GCGGACCCCG CGCCAACTCC TGATCTGCGA TATCGACAAC ACGCTCGTGG GATGTGAATC CGCCTTGGCG ACGTTCCGGC GCTGGCGCAG CCGGCAGACG GGGCTGGCCT TCGGTGTGGC CACCGGCCGC TCGTTTCACA GCGCGATGGC GGTGCTGGAG CAGCAGGCGA GCCCGCGACC GCAGGTGATG ATCACCTCGG TCGGCTCGGA GATCTACCAT CTCGATGCCA ACGGCGTGAC CTACACGGCC GACGCCGCGT GGCGCGAGGC GGTCTCGGAC GCCTGGGACC GGGGGGCGGT CGGCGCGGCT TTGGGCCGAC TCGACGGGCT CGTCCCGCAG GGCCCGCTCG AGCAGCGCGC GCACAAGCTG AGCTTCTTCG GCGACGAGGC GACGGCCCAT CGGGCGCGCG ATCGCCTCCT GCAGGCGGGT CTCCCGGCGA ACGTGATCCA CAGCCACGGC CGCTACCTCG ATGTCCTGCC CGCGACGGCC TCCAAGGGGA CGGCGGTCGA CCACGTCCGC GCGCTCTACG GGTTGCCCGA GCAGGCCGTG TTCGTGGCCG GTGATTCCGG CAACGATGTC GAGATGCTGC GCGCTCGGAC GCAGGCGATC ATCGTCGCGA ACTACTCCGA CGGGCTGGCC ACCAACGCCG CGCTCAAGCA CTCCTACGTC GCCCGCACTT CGCATGCCCG CGGCATCATC GAGGGCGTTC TGCATTTCCG CCGGGCGCTG GCCTATGCGT CTTAG
|
Protein sequence | MFVLHIALQG CLRGRDVVYG LTSDTGGHIR YLLDLVAASA QDSRVARIVM ATRRFEGPPG PDYAVPEERI SDKVTLVRLA SASPGYRSKE AMHGEVESYA ENLIAWIGRQ PRAPDIIHAH YADAAAVAEI VEDRLGIPFV FTAHSLGRVK AAMVGDGAAN DLELSRRIVT EEAALARASL VIASSRDEAE VQYAGYAAYD PGRARVLPPG SDLARFAQSR PHPRIDAAID RFLHDPGKPA VLALARPVAR KNLAALVQAY GESPELQACA NLVIVAGTRD DIDRLDGDMA ATMRDLLVLI DRYDLYGHVA YPKTHRPEDV PAIYAYARER GGVFVNPALN EPFGLTLLEA SAAGLPLVAT DSGGPNDIVE TCGNGLLVDP RAPAAIAAAC LHILTDAPFR ARCVAGGARA AAAYDWDRHA ARYLDLLGAL LARNPPLRTP RQLLICDIDN TLVGCESALA TFRRWRSRQT GLAFGVATGR SFHSAMAVLE QQASPRPQVM ITSVGSEIYH LDANGVTYTA DAAWREAVSD AWDRGAVGAA LGRLDGLVPQ GPLEQRAHKL SFFGDEATAH RARDRLLQAG LPANVIHSHG RYLDVLPATA SKGTAVDHVR ALYGLPEQAV FVAGDSGNDV EMLRARTQAI IVANYSDGLA TNAALKHSYV ARTSHARGII EGVLHFRRAL AYAS
|
| |