Gene Mpe_A0346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0346 
Symbol 
ID4786837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp380872 
End bp381984 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content68% 
IMG OID640088901 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001019543 
Protein GI124265539 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.152583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTCA CCCCTTGGGA CAACCCGATG GGCACCGACG GCTTCGAGTT CATCGAATAC 
GCCGCGCCGG ACCCCGCCGC GATGGGCGCG CTGTTCGAGC GCATGGGCTT CCGCGCGATC
GCCAAGCACC GCCACAAGCA GGTCACGCTG TACCGGCAGG GCGAGATCAA CTTCATCGTC
AATGCGGAGC CCGACTCGTT CGCGCAGCGC TTCGCGCGCC TGCACGGCCC GAGCATCTGC
GCCATCGCGT TCCGCGTGCG GGACGCCAAG GCCGCCAGTG AGCGAGCGGT CGCGCTCGGC
GCGTGGGGCT ATGCCGGCCA CGCCGGTCCC GGCGAGCTGA ACATCCCTGC CATCAAGGGC
GTGGGCGACT CGCTGATCTA CCTGGTCGAC CGCTGGCGCG GCAAGAACGG CGCGAAGGAC
GGCGACATCG GCAACATCGG CTTCTTCGAC GTCGACTTCG AACCGCTGCC CGGCGCCACG
CTCACGCCCG TCGGCCACGG CCTCACCGTC GTCGACCACC TGACGCACAA CGTGCACCGC
GGCCGCATGG CCGAGTGGGC CGAGTTCTAT GCGCGGCTGT TCAACTTCCG CGAGATCCGC
TACTTCGACA TCGAGGGCCA GGTGACCGGC GTGAAGAGCA AGGCCATGAC CAGCCCCTGC
GGCAAGATCC GCATCCCGAT CAACGAAGAG GGCAACGAGA CCCCGGGGCA GATCCAGGAG
TACCTGGACC GCTACCACGG CGAGGGCATC CAGCACGTCG CGCTCGGCTC GGGCGACCTG
CACGCCACCG TCGACGCGCT GCGCGTCCAG GGCGTGAAGC TGCTCGACAC GCCCGACACC
TACTACGAGT TTGTCGACCG GCGCATCCCC GGTCACGGCG AGGACCTCGC GGCACTGCGT
TCGCGCGGGA TCCTGGTCGA TGGCAAGGCC GGCGAACTGC TGCTGCAGAT CTTCAGCGAG
AACCAGCTCG GGCCGATCTT CTTCGAGTTC ATCCAGCGCA AGGGTGACCA GGGCTTCGGC
GAGGGCAACT TCAAGGCCCT GTTCGAGAGC ATCGAGCTCG ACCAGATGCG CCGCGGCGTG
CTGGCCGGCG AGGCCGCGAC GCAGGGCGCC TGA
 
Protein sequence
MQFTPWDNPM GTDGFEFIEY AAPDPAAMGA LFERMGFRAI AKHRHKQVTL YRQGEINFIV 
NAEPDSFAQR FARLHGPSIC AIAFRVRDAK AASERAVALG AWGYAGHAGP GELNIPAIKG
VGDSLIYLVD RWRGKNGAKD GDIGNIGFFD VDFEPLPGAT LTPVGHGLTV VDHLTHNVHR
GRMAEWAEFY ARLFNFREIR YFDIEGQVTG VKSKAMTSPC GKIRIPINEE GNETPGQIQE
YLDRYHGEGI QHVALGSGDL HATVDALRVQ GVKLLDTPDT YYEFVDRRIP GHGEDLAALR
SRGILVDGKA GELLLQIFSE NQLGPIFFEF IQRKGDQGFG EGNFKALFES IELDQMRRGV
LAGEAATQGA