Gene Mext_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2007 
Symbol 
ID5832847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2238166 
End bp2239995 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content71% 
IMG OID641367808 
Productphosphogluconate dehydratase 
Protein accessionYP_001639477 
Protein GI163851434 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.691716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGC TTCATCCCGA GGTCGCCGCG GTCACCGAGC GCGTCATCGC GCGCTCCCGC 
GCCAGCCGCC GCGCCTATCT CGACCTGATC GAACGCGAGC GCGAGGCGGG CGTGCATCGC
CCGATGCTCG CCTGCGGCAA CCTCGCCCAC GGTTTCGCGG CGTCCGGCGA GGACAAGCCG
GCGATCATCG CCGGCCGCGC CATGAATATC GGCATCGTCA CCGCCTACAA CGACATGCTC
TCGGCGCATC AGCCCTACGG GCGCTATCCC GAGCAGATCA AGCTGTTCGC CCGCGAAGTG
GGTGCGACGG CTCAGGTCGC TGGCGGCACG CCCGCAATGT GCGACGGCGT CACCCAAGGC
CAGCGCGGCA TGGAGCTGTC GCTGTTCTCC CGCGACACCA TCGCCCTCTC CACCGCGGTC
GCGCTCAGCC ACGGCATGTT CGAGGGCGCG GCTCTGCTGG GCATCTGCGA CAAGATCGTG
CCCGGTCTCA TCATCGGCGC GCTCCGCTTC GGCCACCTGC CGATGATCCT GGTCCCGGCC
GGCCCCATGC CCTCGGGTCT CGCCAACAAG GAGAAGCAGC GCATCCGGCA GCTCTATGCC
GAGGGCAAGG TCGGCCGCGC GGAACTGCTG GAGAGCGAGT CCGCCTCCTA TCACGGGGCC
GGCACCTGCA CCTTCTACGG CACCGCCAAT TCCAACCAGA TGATGATGGA CGTGATGGGC
CTGCACATGC CCGGCGCCTC CTTCATCAAT CCCGGCACGC GGCTGCGGCA GGCGGTGACG
CGTTCGGCGA TCCACCGGCT CACCGAGATC GGCTGGAACG GCAACGATTA CCGCCCCCTG
GGGCGCTGCA TCGACGAGCG GGCCATCGTC AACGCCATCG TCGGCCTGCT CGCCACCGGC
GGCTCGACCA ACCATGCGAT CCACATCCCG GCCATGGCGC GCGCCGCCGG CATCGTCGTC
GATTGGGAGG ATTTCGACCG GCTCTCCGGC GTGGTGCCGC TGATCGCGCG GGTCTACCCG
AACGGCGCGG GCGACGTGAA CCATTTCCAC GCGGCCGGCG GCATGTCCTA CGTCATCGCC
TCGCTGATCG ATGCCGGGCT CCTGCACGAC GATCTCCTGA CGGTGGCCGG CACGCGCCTG
CGCGACCACG CCCGCGACCC GAAGCTTCTG GGCGAGGGCA ACGACCTCAC CTTCGAGGAC
GCGCCGGCCG AGCCGCTGGA CGAGGCCATG CTGCGTCCCC CCTCCCGCCC GTTCCAGCCG
GATGGCGGGA TGCGGCTGGT GAAGGGCAAT CTCGGCCGCG CCACCTTCAA GACCAGCGCG
GTCGATCCCG AGCGCCGCAC CATCGAAGCT CCGGCCCGCG TCTTCTCCGA TCAGGACGAA
GTCATCGCCG CCTTCAAGGC GGGCGAGTTG GAGCGCGACG TCGTCGTGGT CGTGCGCTTC
CAGGGCCCGC GCGCCAACGG GATGCCGGAG TTGCATAAGC TGACCCCGCC GCTGGGCGTG
CTGCAGGACC GCGGCCACAA GGTCGCGCTC GTCACCGACG GGCGGATGTC GGGGGCCTCC
GGCAAGGTGC CGGCGGCGAT CCATGTCAGC CCCGAGGCGG TCGGCGGCGG CCCGATCAGC
CGCATCCGCG ACGGCGACAT CGTCCGCCTT TCGGCCGAGC AGGGATTGCT CGAAGTGCTG
GTCGATTCCG CCGAGTGGGA CAGCCGCGAG GACGCGGTGC GGCCGCCGGA CGGCCTCGGC
ACCGGCCGCG AGCTGTTCGC CTTCATGCGC CAGGGCGCCG ACGATGCCGA GCGCGGCGGC
TCGGCGATGC TGGCGGCGGC GGGCCTTTAG
 
Protein sequence
MPELHPEVAA VTERVIARSR ASRRAYLDLI EREREAGVHR PMLACGNLAH GFAASGEDKP 
AIIAGRAMNI GIVTAYNDML SAHQPYGRYP EQIKLFAREV GATAQVAGGT PAMCDGVTQG
QRGMELSLFS RDTIALSTAV ALSHGMFEGA ALLGICDKIV PGLIIGALRF GHLPMILVPA
GPMPSGLANK EKQRIRQLYA EGKVGRAELL ESESASYHGA GTCTFYGTAN SNQMMMDVMG
LHMPGASFIN PGTRLRQAVT RSAIHRLTEI GWNGNDYRPL GRCIDERAIV NAIVGLLATG
GSTNHAIHIP AMARAAGIVV DWEDFDRLSG VVPLIARVYP NGAGDVNHFH AAGGMSYVIA
SLIDAGLLHD DLLTVAGTRL RDHARDPKLL GEGNDLTFED APAEPLDEAM LRPPSRPFQP
DGGMRLVKGN LGRATFKTSA VDPERRTIEA PARVFSDQDE VIAAFKAGEL ERDVVVVVRF
QGPRANGMPE LHKLTPPLGV LQDRGHKVAL VTDGRMSGAS GKVPAAIHVS PEAVGGGPIS
RIRDGDIVRL SAEQGLLEVL VDSAEWDSRE DAVRPPDGLG TGRELFAFMR QGADDAERGG
SAMLAAAGL