Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0346 |
Symbol | |
ID | 4786837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 380872 |
End bp | 381984 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640088901 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001019543 |
Protein GI | 124265539 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.152583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATTCA CCCCTTGGGA CAACCCGATG GGCACCGACG GCTTCGAGTT CATCGAATAC GCCGCGCCGG ACCCCGCCGC GATGGGCGCG CTGTTCGAGC GCATGGGCTT CCGCGCGATC GCCAAGCACC GCCACAAGCA GGTCACGCTG TACCGGCAGG GCGAGATCAA CTTCATCGTC AATGCGGAGC CCGACTCGTT CGCGCAGCGC TTCGCGCGCC TGCACGGCCC GAGCATCTGC GCCATCGCGT TCCGCGTGCG GGACGCCAAG GCCGCCAGTG AGCGAGCGGT CGCGCTCGGC GCGTGGGGCT ATGCCGGCCA CGCCGGTCCC GGCGAGCTGA ACATCCCTGC CATCAAGGGC GTGGGCGACT CGCTGATCTA CCTGGTCGAC CGCTGGCGCG GCAAGAACGG CGCGAAGGAC GGCGACATCG GCAACATCGG CTTCTTCGAC GTCGACTTCG AACCGCTGCC CGGCGCCACG CTCACGCCCG TCGGCCACGG CCTCACCGTC GTCGACCACC TGACGCACAA CGTGCACCGC GGCCGCATGG CCGAGTGGGC CGAGTTCTAT GCGCGGCTGT TCAACTTCCG CGAGATCCGC TACTTCGACA TCGAGGGCCA GGTGACCGGC GTGAAGAGCA AGGCCATGAC CAGCCCCTGC GGCAAGATCC GCATCCCGAT CAACGAAGAG GGCAACGAGA CCCCGGGGCA GATCCAGGAG TACCTGGACC GCTACCACGG CGAGGGCATC CAGCACGTCG CGCTCGGCTC GGGCGACCTG CACGCCACCG TCGACGCGCT GCGCGTCCAG GGCGTGAAGC TGCTCGACAC GCCCGACACC TACTACGAGT TTGTCGACCG GCGCATCCCC GGTCACGGCG AGGACCTCGC GGCACTGCGT TCGCGCGGGA TCCTGGTCGA TGGCAAGGCC GGCGAACTGC TGCTGCAGAT CTTCAGCGAG AACCAGCTCG GGCCGATCTT CTTCGAGTTC ATCCAGCGCA AGGGTGACCA GGGCTTCGGC GAGGGCAACT TCAAGGCCCT GTTCGAGAGC ATCGAGCTCG ACCAGATGCG CCGCGGCGTG CTGGCCGGCG AGGCCGCGAC GCAGGGCGCC TGA
|
Protein sequence | MQFTPWDNPM GTDGFEFIEY AAPDPAAMGA LFERMGFRAI AKHRHKQVTL YRQGEINFIV NAEPDSFAQR FARLHGPSIC AIAFRVRDAK AASERAVALG AWGYAGHAGP GELNIPAIKG VGDSLIYLVD RWRGKNGAKD GDIGNIGFFD VDFEPLPGAT LTPVGHGLTV VDHLTHNVHR GRMAEWAEFY ARLFNFREIR YFDIEGQVTG VKSKAMTSPC GKIRIPINEE GNETPGQIQE YLDRYHGEGI QHVALGSGDL HATVDALRVQ GVKLLDTPDT YYEFVDRRIP GHGEDLAALR SRGILVDGKA GELLLQIFSE NQLGPIFFEF IQRKGDQGFG EGNFKALFES IELDQMRRGV LAGEAATQGA
|
| |