Gene Mpe_A1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1996 
SymbolispG 
ID4783783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2138219 
End bp2139517 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID640090566 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_001021189 
Protein GI124267185 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.165352 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGGC CAGTAGCGTC CACAGGGACC CGCAGTGACG CCGAAGCGAT CGACGTGGCG 
CAGCCTGCCG CGCGCCGTTC GCTGCAGGCG CGTGTCGTCT GGGGCAGCAA TGTCGTGACG
GTCGGTGGCG ACGCCCCGGT GCGCGTGCAG TCCATGACCA ACACCGACAC GGTGAATGCG
ATCGAGACTG CGATCCAGGT CAAGGAGCTG GCGTTGGCGG GCTCGGAGTT GGTGCGCATC
ACGGTCAACA CGCCGGAGGC GGCGCAGGCC GTGCCGCACG TGCGCGAGCA GCTAGACCGC
ATGGGCATCT CGGTGCCGCT GATCGGCGAT TTCCACTACA ACGGCCACCG CCTGCTGACC
GAGTTCCCGG ACTGCGCGGC CGCGCTGTCG AAGTACCGCA TCAATCCCGG CAACGTGGGC
AAGGGCGACA AGCGCGACCG GCAGTTCGCG ATGATGATCG AGGCCGCGAT GCGCCACGAC
AAGCCTGTGC GTATCGGCGT CAACTGGGGC AGCCTCGATC AGGAACTGCT GGCGGCTTTG
ATGGACGAGA ACGCCGCCCG CGCCCGGCCC TGGGACGCGA AGCAGGTGAT GTACCACGCG
CTGATCAGCT CGGCGCTGCA GTCGGCCGCA TATGCGCGCG AGCTGGGCAT GGACCCGTCC
CAGATCCTCA TCAGCTGCAA GGTCAGCGGC GTGCAGGACC TCGTGAGCGT CTACCGCGCA
CTGGCGCGAC GCTGCGATTA CCCGCTGCAC CTCGGGCTCA CCGAAGCCGG CATGGGCACC
AAGGGCACCG TGGCGTCGAC CGCGGCGCTG GCGATGCTGC TGCAGGACGG CATCGGCGAC
ACCATCCGCG TCAGCCTCAC GCCGCAGCCG GGCGAGGCCC GCACGCAGGA GGTGGTGGTG
GCGCTCGAGA TCCTGCAGTC GCTCGGCCTG CGTGCCTTCA ATCCCAGCGT CACCGCCTGC
CCGGGCTGCG GCCGCACCAC CAGCACCACC TTCCAGGAGC TGGCCAAGCA GATCGACGAC
TTCCTGCGGG CACAGATGCC GGTCTGGAAG GCGCGTTACC CGGGCGTGGA GAACATGAAG
GTGGCGGTGA TGGGCTGCAT CGTCAACGGG CCTGGCGAGA GCAAGCATGC CGATATCGGC
ATCAGCCTGC CCGGCACCGG CGAGGCGCCT GCCGCGCCGG TGTTCATCGA TGGCGAGAAG
GCGATGACCT TGCGCGGCGA GGGCATCGCG CGCGAGTTCC AGAACGTCGT CGAGCACTAC
ATCGAGCGCC GTTACGGCAG CATCACCGCC GCGCATTGA
 
Protein sequence
MNRPVASTGT RSDAEAIDVA QPAARRSLQA RVVWGSNVVT VGGDAPVRVQ SMTNTDTVNA 
IETAIQVKEL ALAGSELVRI TVNTPEAAQA VPHVREQLDR MGISVPLIGD FHYNGHRLLT
EFPDCAAALS KYRINPGNVG KGDKRDRQFA MMIEAAMRHD KPVRIGVNWG SLDQELLAAL
MDENAARARP WDAKQVMYHA LISSALQSAA YARELGMDPS QILISCKVSG VQDLVSVYRA
LARRCDYPLH LGLTEAGMGT KGTVASTAAL AMLLQDGIGD TIRVSLTPQP GEARTQEVVV
ALEILQSLGL RAFNPSVTAC PGCGRTTSTT FQELAKQIDD FLRAQMPVWK ARYPGVENMK
VAVMGCIVNG PGESKHADIG ISLPGTGEAP AAPVFIDGEK AMTLRGEGIA REFQNVVEHY
IERRYGSITA AH