Gene Mpe_A2947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2947 
Symbol 
ID4784369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3131029 
End bp3132123 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content72% 
IMG OID640091518 
ProductGTP cyclohydrolase II / 3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_001022135 
Protein GI124268131 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.631014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.509105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTTT CCCCTGTTCC CGAGCTGATC GCCGAGCTCG CCGCCGGCCG CATGGTCATC 
CTGGTCGACG AGGAAGATCG CGAGAACGAG GGCGACCTCG TGATGGCCGC CGAGCACGTC
ACGCCCGAAG CCATCAACTT CATGGCGCGC TACGGCCGCG GCCTGATCTG CCTCACGCTC
ACCCGCGAGC GCTGCGAACG GCTGCAACTG CCGCCGATGG CCGCGCGCAA CGGCACACAG
CACGGCACTG CCTTCACCGT CTCGATCGAG GCCGCCAGCG GCGTGACCAC CGGCATCTCC
GCGGCCGACC GCGCACGCAC CGTGCAGGCC GCGGTGGCGC GCGACGCGAA GGCCAGCGAC
CTGGTCCAGC CCGGCCACAT CTTCCCGCTG CAGGCGCAGG ACGGCGGCGT GCTGATGCGC
GCCGGGCACA CCGAGGCCGG CTGCGACCTC ACGCTGATGG CCGGCCTGAC GCCTGCGTCG
GTGATCTGCG AGATCATGAA GGACGACGGC ACGATGGCCC GGCTGCCCGA CCTGGTGCTG
TTCGCCAGGG AACACGGGCT CAAGATCGGC AGCATCGCCG ACCTGATCGA ATACCGCAGC
CGCAACGAAT CGCTCATCAC GCGCGTCGCC CAGCGCGTGC TGGTCACGCC GCAGGGCCCG
TTCGATTGCC AGGCCTTCCG CGACCGCTCG GGCGCCGTGC ACTTGGCGCT GAGCGTGGGG
CAATGGGGTG CCGACGACGA GGTGCTGGTG CGAGTGCACG AGCCGCTGTC GGTGCTGGAC
CTGCTCGACG CCGGCCGCTG CGGCCATTCG TGGCCGCTGC CACGGGCACT GGCCGCGCTG
CGGGCGGCCG AACGTGGCGT GGCGGTCCTG CTGAACTGCG GCGAGGCGGG CAACGGCCTG
CTGCAGCAGC TGACCGTGGG CCCCGACGCG CCGCCGACCC CGCGCGGCCA GATGGACCTG
CGCACCTACG GCGTGGGCGC GCAGATCCTG CGCGAACTCG GCATCGTCCG CATGAAGCTG
CTCGGCAGCC CGCGCCGCAT GCCCAGCATG GTGGGCTACG GACTCGAGGT CACCGGCTTC
GTCGCCGCCG AATGA
 
Protein sequence
MALSPVPELI AELAAGRMVI LVDEEDRENE GDLVMAAEHV TPEAINFMAR YGRGLICLTL 
TRERCERLQL PPMAARNGTQ HGTAFTVSIE AASGVTTGIS AADRARTVQA AVARDAKASD
LVQPGHIFPL QAQDGGVLMR AGHTEAGCDL TLMAGLTPAS VICEIMKDDG TMARLPDLVL
FAREHGLKIG SIADLIEYRS RNESLITRVA QRVLVTPQGP FDCQAFRDRS GAVHLALSVG
QWGADDEVLV RVHEPLSVLD LLDAGRCGHS WPLPRALAAL RAAERGVAVL LNCGEAGNGL
LQQLTVGPDA PPTPRGQMDL RTYGVGAQIL RELGIVRMKL LGSPRRMPSM VGYGLEVTGF
VAAE