Gene Mext_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1954 
Symbol 
ID5833791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2186317 
End bp2188047 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content70% 
IMG OID641367755 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001639424 
Protein GI163851381 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.160377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGC GTCAGACCGA CAAGTCGAAG CTGCCGAGCC GGCACGTGAC GGAGGGGCCC 
GAGCGGGCGC CCCACCGCTC GTACCTCTAC GCCATGGGCC TGACGACCGA GCAGATCCAC
CAGCCGCTGG TCGGCGTCGC CTCGTGCTGG AACGAGGCCG CGCCCTGCAA CATCTCGCTG
ATGCGCCAAG CCCAGGCCGT GAAGAAGGGT GTCGCCGCCG CCAAGGGCAC TCCGCGCGAG
TTCTGCACCA TCACCGTCAC CGACGGCATC GCCATGGGCC ATGGCGGTAT GCGCGCCTCG
CTGCCTTCCC GCGAGGTCAT CGCCGATTCG GTCGAGCTGA CGATCCGCGG CCATTCCTAC
GACGCCCTCG TGGGGCTGGC CGGCTGCGAC AAGTCCCTGC CCGGCATGAT GATGGCCATG
GTGCGCCTCA ACGTGCCCTC GATCTTCATC TATGGCGGCT CGATCCTGCC GGGCTCGTTC
CGCGGCCGGC CGGTTACGGT GCAGGATCTG TTCGAGGCGG TCGGCAAGGT CGCCGTCGGC
GACATGAGCC TCGACGACCT CGACGAGCTG GAGCGGGTTG CCTGCCCCTC GGCCGGCGCC
TGCGGCGCGC AGTTCACCGC CAACACCATG GCCACCGTCT CCGAGGCGAT CGGCCTCGCG
CTGCCCTACT CGGCCGGCGC GCCTGCCCCT TACGAGATCC GCGACCAATT CTGCGCCGCC
GCCGGCGAGA AGGTGATGGA GCTGATCGCC AAGAACATCC GCCCGCGCGA CATCGTCACC
CGCAAGGCGC TGGAGAACGC CGCCGCGACG GTCGCGGCCT CGGGCGGCTC GACCAACGCG
GCCCTGCACC TGCCGGCGAT CGCGCATGAA TGCGGCATCG AGTTCACCCT GTTCGACGTC
GCCGAGATCT TCCGCAAGAC CCCCTACATC GCCGACTTGA AGCCCGGCGG GCGCTATGTG
GCCAAGGACA TGTTCGAGGT CGGCGGCATC CCGCTGCTGA TGAAGACGCT GCTCGACCAC
GGCTACCTGC ACGGCGACTG CCTCACCGTC ACCGGCCGCA CCATCGCCGA GAACCTCGCC
AAGGTCGCCT GGAACCCGGA TCAGGACGTG GTGCGCCCGG CCGACAAGCC CATCACCGTC
ACCGGCGGCG TGGTGGGCCT GCGCGGCAAT CTCGCCCCCG AGGGCGCGAT CGTGAAGGTC
GCCGGCATGC CGCCCGAGGC CCAGGTCTTC ACCGGCCCGG CCCGCGTCTT CGACGGCGAG
GAAGCCTGTT TCGAGGCGGT GCAGAACCGC ACCTACAAGC CCGGCGACGT TCTGGTCATC
CGTTACGAGG GCCCGAAGGG AGGCCCCGGC ATGCGCGAGA TGCTCTCGAC CACCGCCGCC
CTCTACGGCC AGGGCATGGG CGACAAGGTG GCCCTCATCA CCGACGGGCG CTTCTCCGGC
GCGACCCGCG GCTTCTGCGT CGGCCATGTC GGCCCCGAGG CTGCCATCGG CGGGCCGATC
GGCCTGCTGC GCGACGGCGA CATCATCACC CTCGATGCGA TCAAGGGCAC GCTCGACGTG
GCGCTCTCCG ACGAGGAACT GGCCCAGCGT CGCAGCGAAT GGACGCCGCG GGGCAATGCC
GCGACCTCCG GCTACCTCTG GAAATACGCG CAGTCCGTCG GGCCTGCAGT GAACGGGGCC
GTGACGCATC CGGGCGGCGC GGGGGAGACG AACGTCTATG CCGACATCTA G
 
Protein sequence
MDARQTDKSK LPSRHVTEGP ERAPHRSYLY AMGLTTEQIH QPLVGVASCW NEAAPCNISL 
MRQAQAVKKG VAAAKGTPRE FCTITVTDGI AMGHGGMRAS LPSREVIADS VELTIRGHSY
DALVGLAGCD KSLPGMMMAM VRLNVPSIFI YGGSILPGSF RGRPVTVQDL FEAVGKVAVG
DMSLDDLDEL ERVACPSAGA CGAQFTANTM ATVSEAIGLA LPYSAGAPAP YEIRDQFCAA
AGEKVMELIA KNIRPRDIVT RKALENAAAT VAASGGSTNA ALHLPAIAHE CGIEFTLFDV
AEIFRKTPYI ADLKPGGRYV AKDMFEVGGI PLLMKTLLDH GYLHGDCLTV TGRTIAENLA
KVAWNPDQDV VRPADKPITV TGGVVGLRGN LAPEGAIVKV AGMPPEAQVF TGPARVFDGE
EACFEAVQNR TYKPGDVLVI RYEGPKGGPG MREMLSTTAA LYGQGMGDKV ALITDGRFSG
ATRGFCVGHV GPEAAIGGPI GLLRDGDIIT LDAIKGTLDV ALSDEELAQR RSEWTPRGNA
ATSGYLWKYA QSVGPAVNGA VTHPGGAGET NVYADI