Gene Mpe_A1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1387 
Symbol 
ID4783980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1496461 
End bp1498143 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content68% 
IMG OID640089953 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001020584 
Protein GI124266580 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.63064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.987031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA ACCGCCGTTC GAAGAACATC ACCGAAGGCG TGGCCCGCGC ACCCAACCGC 
TCGATGTACT ACGCGATGGG CTACCAGGAG GCCGACTTCA AGAAGCCGAT GATCGGCGTG
GCGAACGGCC ACTCGACCAT CACGCCCTGC AACTCGGGCC TGCAGAAGCT GGCCGACGCC
GCGGTCGAGG GCATCGAGGC GGCCGGCGGC AATGCACAGA TCTTCGGCAC CCCCACCATC
AGCGACGGCA TGGCGATGGG CACCGAGGGC ATGAAGTACT CGCTGGTCTC GCGCGAGGTC
ATCGCCGACT GCGTGGAAAC CTGCGTCGGC GGCCAGTGGA TGGACGGCGT GCTGGTGGTC
GGCGGCTGCG ACAAGAACAT GCCGGGCGGC ATGATGGGCA TGCTGCGCGC CAACGTGCCC
GCGATCTACG TCTACGGCGG CACCATCCTG CCGGGAAAGT ACAAGGGCCA GGATCTCAAC
ATCGTCAGCG TGTTCGAGGC CGTCGGCCAG TTCACCGCGG GCAACATGAG CGAGGAAGAC
TTCTGCCAGA TCGAGCGACG CGCGATCCCG GGCAGTGGCT CCTGCGGGGG CATGTACACC
GCCAACACCA TGAGTTCGGC CTTCGAGGCC CTGGGCATGA GCCTGCCGTT CGCCTCCACG
ATGGCCAATG TCGAGGACCC GATCGTCGCG CACACCAAGG AAGCGGCGCG CGTGCTGGTC
GAGGCAGTGA AGGCCGACCT CAAGCCGCGT GACATCGTCA CACGCAAGAG CATCGAGAAC
GCGGTCGCGG TGATCATGGC CACCGGCGGC TCGACCAATG CGGTGCTGCA CTTCCTGGCC
ATCGCGCACG CCGCCGGCGT CGAGTGGACG ATCGACGACT TCGAGCGCGT GCGCCGCAAG
GTGCCGGTGC TGTGCGACCT CAAACCCAGC GGCAGGTACC TGGCGATCGA CCTGCACCGC
GCCGGCGGCA TCCCGCAGGT GATGAAGACG CTGCTCGCCG CCGGGCTGAT CCACGGCGAC
TGCATCACCA TCACCGGAAG GACCGTGGCC GAGAACCTGG CCGACATCCC CGATGCGCCG
CGCGCCGACC AGGACGTGAT CCGCCCGATC ACGAAGCCGA TGTACGAGCA AGGCCACCTG
GCCATCCTGA AGGGCAACCT GTCGCCTGAG GGCGCCGTGG CCAAGATCAC CGGCCTGAAG
AATCCCAGCA TCACTGGCCC GGCGCGCGTG TTCGACGACG AGCAGTCGGC GCTGGCCGCC
ATCATGGCCA AGCAGATCCA GGCCGGCGAC GTGATGGTGC TGCGCTACCT GGGCCCGATG
GGGGGCCCGG GCATGCCCGA GATGCTGGCG CCGACCGGTG CGCTGATCGG CCAAGGGCTG
GGCGAATCGG TGGGGCTCAT CACCGACGGC CGCTTCTCCG GCGGCACCTG GGGCATGGTG
GTCGGCCACG TGGCACCTGA GGCCGCGGCC GGCGGCACGA TCGCGCTGGT GCAGGAAGGC
GACTCGATCA CCATCGATGC GCACACGCTG GTGCTCAACC TCAACGTGAG CGAGGCCGAG
ATCGCAAAGC GTCGCGCCGC CTGGAAGGCA CCGGCGCCGC GCTACACACG CGGCGTGCTG
GCCAAGTTCG CGAAGAACGC GTCAAGCGCC AGCAGCGGCG CGGTATTGGA CCGCTTCGAG
TAG
 
Protein sequence
MSINRRSKNI TEGVARAPNR SMYYAMGYQE ADFKKPMIGV ANGHSTITPC NSGLQKLADA 
AVEGIEAAGG NAQIFGTPTI SDGMAMGTEG MKYSLVSREV IADCVETCVG GQWMDGVLVV
GGCDKNMPGG MMGMLRANVP AIYVYGGTIL PGKYKGQDLN IVSVFEAVGQ FTAGNMSEED
FCQIERRAIP GSGSCGGMYT ANTMSSAFEA LGMSLPFAST MANVEDPIVA HTKEAARVLV
EAVKADLKPR DIVTRKSIEN AVAVIMATGG STNAVLHFLA IAHAAGVEWT IDDFERVRRK
VPVLCDLKPS GRYLAIDLHR AGGIPQVMKT LLAAGLIHGD CITITGRTVA ENLADIPDAP
RADQDVIRPI TKPMYEQGHL AILKGNLSPE GAVAKITGLK NPSITGPARV FDDEQSALAA
IMAKQIQAGD VMVLRYLGPM GGPGMPEMLA PTGALIGQGL GESVGLITDG RFSGGTWGMV
VGHVAPEAAA GGTIALVQEG DSITIDAHTL VLNLNVSEAE IAKRRAAWKA PAPRYTRGVL
AKFAKNASSA SSGAVLDRFE