Gene Mext_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0040 
Symbol 
ID5835560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp46285 
End bp48720 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content60% 
IMG OID641365824 
Productputative phosphoketolase 
Protein accessionYP_001637539 
Protein GI163849496 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3957] Phosphoketolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.383665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGA TCCTCAAAGC TCGGCGGCAG CCCAAGCGCA CGGCGCGGAC ATCCGAGCTG 
GCCCTGATCG ATGCCTACTG GCGGGCGGCA AACTACTTGT CGGTCGGTCA GATCTACCTC
TACGACAACC CGCTACTGGT GGAACGACTG ACCAAGGAGC ACATCAAGCC GCGTCTACTC
GGCCACTGGG GAACCACTCC GGGTTTGAAT TTCATCTATG TTCATCTGAA TCGTCTTATT
AAAAAGCATG ATCTTGATGT TATCTATATT ACAGGGCCGG GGCATGGCGG TCCTGCCTTG
ATCGCCAACG CATACCTCGA GGGGACTTAC AGCGAAGTCT ATCCGAACAT CTCCGCCGAT
GCTGAGGGCA TGAAGCGCCT CTTCAAGCAG TTCTCCTTCC CAGGCGGCAT CCCGAGCCAT
GTGGCTCCTG AAACACCGGG CTCGATGCAC GAAGGCGGAG AGCTGGGCTA TTCCCTCTCG
CACGCCTACG GCGCCGCGTT CGACAACCCC GACCTCATCG TCGCTTGCGT CGTCGGCGAT
GGCGAGGCCG AGACCGGGCC TCTTGCCACA AGCTGGCATT CGAACAAATT TCTCAACCCC
GTGAGCGATG GGACGGTTCT GCCGATCCTG CACCTCAACG GGTACAAGAT TGCCAACCCG
ACTGTACTGG CCAGAATTAG CCACGCGGAG CTTGAACATC TCTTCCGTGG GTACGGGTAC
ACCCCCTACT TCGTGGAAGG ACATGATCCG GCCGAGATGC ACCAGCGCAT GGCCTCCACC
ATGGATGCTG TCCTGCGGGA CATTCGCCGG ATCAAGTCGG ACGCGCGCGA CAAGGGTTTC
ACGGGCCGGC CGTTCTGGCC GATGATCGTT CTTCGGACGC CAAAAGGATG GACATGTCCG
AAGGAAATCG ATGGACGGCG CACAGAGGAT TACTGGCGCT CGCACCAAGT GCCAATGGGC
GAGATGCACG ACAATCCCGC CCATGTGCGC ATGCTCGAAG ACTGGATGCA ATCTTATCGG
CCCGCCGAGC TCTTCGACGA AGGCGGCCGA CTTCGCTCGG AACTTGCCGA GCTTGCCCCG
ACGGGCGACC GCCGCATGAG CGCCAATCCG CATACCAATG GAGGCACTCT GCTCCGCGAC
CTGCGGCTCC CGGATTTTCA CGACTATGCA ATACCGGTGA CCGCCCCCGG TGCCGCCGTC
GCCGAGTCCA CGCGCGTGAT GGGACGCTTC CTCCGCGACG TCATGGACCT GAACGCAGAA
GCGCGAAACT TCAGGCTGTT CAGTCCGGAC GAGAATAACT CAAATCGCTG GCAGGACGCG
CTCGAGGTGA CCAACCGCGC CTGGGTGGCC GAGACGTATC CCTGGGATGA TCACCTCGCG
CATGACGGCC GCGTGATGGA GATGCTGAGC GAGCATCAAT GTCAGGGCTG GCTCGAAGGC
TATCTGCTGA CGGGTCGGCA CGGCTTCTTC TCGTGCTACG AGGCCTTCAT CCACATCATC
GACTCGATGT TCAATCAGCA CGCCAAGTGG CTGAAGGTCT GCAACCATAT TCCGTGGCGG
CGACCCATTG GGTCTTTGAA CTACCTTCTC TCCAGCCACG TCTGGCGTCA GGATCACAAC
GGGTTCAGTC ATCAGGATCC AGGCTTCATC GACCATGTCG TGAACAAGAA AGCCGAGGTC
GTTCGTGTCT ACTTACCACC GGATGCGAAT TGTTTGCTTT CCGTAACCGA TCACTGCTTG
CGAAGCCGCA ACTACGTCAA CGTGATCGTC GCGGGTAAAC AGCCAGCACC CCAGTGGCTC
ACGATGGATC AGGCGGTCAA GCACTGCACC GCCGGGCTTG GGATCTGGGA ATGGGCGAGC
AACGACCGCG GCAGCGAGCC GGACGTCGTG ATGGCGTGCT GCGGGGATGT GCCGACCCTT
GAAACGCTCG CGGCCGTCGA CCTCCTCCGC TGCCATGCGC CGGATCTCAA GGTGCGCGTC
ATCAACGTCG TGAACCTGAT GAAGCTGCAG CCCGACACGG AGCATCCACA CGGCCTGTCA
GATCAGGATT TCGATGCCCT GTTCACGACG GACAAACCGG TCGTCTTCGC CTTTCACGGG
TATCCTTGGC TCATTCACCG GCTGGTTTAC CGACGTCACG GACACAGCAA CTTCCATGTG
CGTGGCTACA AGGAGGAAGG CACGACGAGC ACGCCGTTCG ACATGTGCGT GATGAACGAC
ATGGATCGGT TCCATCTCGT CAGCGATGTC ATCGACCGGG TGCCGGGCCT GGCCGCTCGG
GCGGCCTACG CCAAGCAAGC GATCCGGGAC AAGCTAATCG ACCATCGCGC GTACATTCAT
CGGCACGGCG ACGACATGCC GGAAGTATCC GGCTGGTCCT GGAGCCCGAT GGCGACGACG
CGCGGTCTCG GCTCGACGGA GAGTGACAAT GTGTGA
 
Protein sequence
MDAILKARRQ PKRTARTSEL ALIDAYWRAA NYLSVGQIYL YDNPLLVERL TKEHIKPRLL 
GHWGTTPGLN FIYVHLNRLI KKHDLDVIYI TGPGHGGPAL IANAYLEGTY SEVYPNISAD
AEGMKRLFKQ FSFPGGIPSH VAPETPGSMH EGGELGYSLS HAYGAAFDNP DLIVACVVGD
GEAETGPLAT SWHSNKFLNP VSDGTVLPIL HLNGYKIANP TVLARISHAE LEHLFRGYGY
TPYFVEGHDP AEMHQRMAST MDAVLRDIRR IKSDARDKGF TGRPFWPMIV LRTPKGWTCP
KEIDGRRTED YWRSHQVPMG EMHDNPAHVR MLEDWMQSYR PAELFDEGGR LRSELAELAP
TGDRRMSANP HTNGGTLLRD LRLPDFHDYA IPVTAPGAAV AESTRVMGRF LRDVMDLNAE
ARNFRLFSPD ENNSNRWQDA LEVTNRAWVA ETYPWDDHLA HDGRVMEMLS EHQCQGWLEG
YLLTGRHGFF SCYEAFIHII DSMFNQHAKW LKVCNHIPWR RPIGSLNYLL SSHVWRQDHN
GFSHQDPGFI DHVVNKKAEV VRVYLPPDAN CLLSVTDHCL RSRNYVNVIV AGKQPAPQWL
TMDQAVKHCT AGLGIWEWAS NDRGSEPDVV MACCGDVPTL ETLAAVDLLR CHAPDLKVRV
INVVNLMKLQ PDTEHPHGLS DQDFDALFTT DKPVVFAFHG YPWLIHRLVY RRHGHSNFHV
RGYKEEGTTS TPFDMCVMND MDRFHLVSDV IDRVPGLAAR AAYAKQAIRD KLIDHRAYIH
RHGDDMPEVS GWSWSPMATT RGLGSTESDN V