Gene Mpe_A3651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3651 
Symbol 
ID4786117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3860638 
End bp3862341 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content73% 
IMG OID640092233 
Productdihydroxyacid dehydratase 
Protein accessionYP_001022839 
Protein GI124268835 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.583324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGTT CCTTCCGCAG CAATTTCGAG CCCGGTTCCA CGCGCTGGGC GGTGCGTCAT 
GCGCAGTGGA GCGCGATGGG CATCGCCGAG GCCGATTTCG ACAAGCCCAA GATCGCCGTC
GTCAACACCT CCAGCGGCCT GTCGGTGTGC TTCCAGCACC TCGACGGCAT CGCGAAGCGG
GTGGCCGAGG CGATCCGCGC CGCCGGCGGG CTGCCGCTGG AGATCCGCAC CGCGGCGCCG
TCCGACTTCG TCACCAGCGC GGGCCGCCAG GGCCGCTACC TGATGCCCAC GCGCGACCTG
ATCGTCAACG ACATCGAGGT GCAGGTCGAG GGCGCCGAGC TCGACGGCAT GCTGCTGCTG
TCGAGCTGCG ACAAGACCAC GCCGGCCCAC CTGATGGCCG CCGGCCGCCT GGACGTGCCC
AGCCTGGTGC TGGCCTGCGG CTACCAGCTC GGCCGCCAGT GCGGCGAGCA CCACGTCGAC
ATCGAGGAGG TCTACAAGGC GGTCGGCACC GTCAAGGCCG GCCAGATGGA CCTCGACACG
CTGCAGGGCA TGTGCCGTGT GGCGATCGAT GGCCCCGGCG TGTGTGCCGG TCTTGCCACT
GCCAACTCGA TGCACTGCCT GGCCGAGGCG CTGGGCATGG CGCTGCCCGG CAACGCGCCG
GTGCGCGCCG ACGGCCCGCG GCTGCATGCG CTGGCGGCGC AGGCCGGAGC ACGCATCGTC
GAGATGGTGC AGCAGGACCT GCGGCCGCGA GACATCCTCA CGCCCGGCGC CTTCCGCAAC
GCGGTGCGCT TCGCGGTGGC GACCGGCTGC TCGGTCAACG TGATGCGCCA CCTGATCGCC
ATCGCGATCG AGTCCGAGTG CCCGGTCGAT GTGATCGCGG AGTTCGAGCG CGCCGCCGAC
GAGGTGCCGA TGCTCACGCG CATCCGGCCC AACGGGCCGG ACCGCATCGA GACCTTCGAG
GCCGCGGGCG GCGTGCGCGG CGTGCTCCGG CAGCTGGCGC CGCAGCTCGA CCGCGAGGTG
CTCACCGCCG ACGGCCGGCG CCTGGGCGCG TTGATCGACG AGACCCCGGC GCCCGACGTC
GCCTGCATTC GGCCGCTGGC CGATCCGTTC GCGCGCGAGC CGGGGCTGAT GATCATCCGC
GGCTCGCTCG CGCCCGACGG CGCGCTGGTC AAGCTGGCGG CGGTGCCGGC GGCGATCCGC
CGCTTCCGCG GCGAGGCCCG GGTGTTCGAG GACGAGGCGC TGGCGATCGA GGGCCTGAAG
ACCGGCGCGG TGCGCCCGGG TCAGGTGATC GTGCTGCGCA TGCTGGGCCC GAAGGGCGGG
CCGGGCACGG TGTTCGCGGC CAGTTTCATG GCCGCGCTGG TTGGCGCCGG GCTGGGCAGC
GAGGTGGCGG TGGTCACCGA TGGCGAACTG TCGGGCCTGA ACAGCGGCAT CACCATCGGC
CAGGTGATGC CGGAGGCGGC CGAGGGCGGC CCGCTGGCGG CGGTGCGCGA CGGCGACGCG
ATCGAGATCG ACCTGACGGC ACGCCGCATC GAACTGCAGG TGCCGGCCGA GGAGGTGGCA
CGCCGGCTCG AAGGCTTCGT GCCGCCGGAG CCGGGCGGCC GCTTCGGCTG GCTGCACCTG
TACGGCACAC TGGTGCAGCC GCTGTCCAGG GGCGCGGTGC TCGGTGTGCG GCCGGTGCTG
ACGCCGGCGG ACCCGCAGCG CTGA
 
Protein sequence
MARSFRSNFE PGSTRWAVRH AQWSAMGIAE ADFDKPKIAV VNTSSGLSVC FQHLDGIAKR 
VAEAIRAAGG LPLEIRTAAP SDFVTSAGRQ GRYLMPTRDL IVNDIEVQVE GAELDGMLLL
SSCDKTTPAH LMAAGRLDVP SLVLACGYQL GRQCGEHHVD IEEVYKAVGT VKAGQMDLDT
LQGMCRVAID GPGVCAGLAT ANSMHCLAEA LGMALPGNAP VRADGPRLHA LAAQAGARIV
EMVQQDLRPR DILTPGAFRN AVRFAVATGC SVNVMRHLIA IAIESECPVD VIAEFERAAD
EVPMLTRIRP NGPDRIETFE AAGGVRGVLR QLAPQLDREV LTADGRRLGA LIDETPAPDV
ACIRPLADPF AREPGLMIIR GSLAPDGALV KLAAVPAAIR RFRGEARVFE DEALAIEGLK
TGAVRPGQVI VLRMLGPKGG PGTVFAASFM AALVGAGLGS EVAVVTDGEL SGLNSGITIG
QVMPEAAEGG PLAAVRDGDA IEIDLTARRI ELQVPAEEVA RRLEGFVPPE PGGRFGWLHL
YGTLVQPLSR GAVLGVRPVL TPADPQR