Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_3528 |
Symbol | hppD |
ID | 4894777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009080 |
Strand | - |
Start bp | 3472672 |
End bp | 3473769 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640152174 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001083039 |
Protein GI | 126450956 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATCC CCACCTGGGA CAATCCCGTC GGCACCGACG GCTTCGAATT CATCGAATAC ACCGCCCCCG ATCCGAAGGC GCTCGGCCAA CTGTTCGAGC GAATGGGCTT CACCGCGGTC GCCCGCCATC GCCACAAGGA CGTGACGCTG TACCGCCAGG GCGACATCAA CTTCATCATC AACGCGGAAC CCGATTCGTT CGCGCAACGC TTCGCGCGGC TGCACGGGCC GTCGATCTGC GCGATCGCAT TCCGCGTGCA GGACGCCGCG AAAGCGTACA GGCATGCGCT CGAGCTCGGC GCATGGGGCT TCGACAACAA GACGGGCCCG ATGGAGCTGA ACATCCCGGC GATCAAGGGC ATCGGCGATT CGCTGATCTA CTTCGTCGAC CGCTGGCGCG GCAAGAACGG CGCGAAGCCG GGCGCGATCG GCGATATCAG CATCTACGAC GTCGATTTCG AGCCGATTCC GGGCGCCGAT CCGAACCCGG CCGGCCACGG CCTCACGTAC ATCGATCACC TCACGCACAA CGTCCACCGC GGCCGCATGC AGGAATGGGC GGAGTTCTAC GAGCGCCTGT TCAACTTCCG CGAGGTTCGC TACTTCGACA TCGAAGGCAA GGTGACGGGC GTGAAATCGA AGGCGATGAC GTCGCCGTGC GGCAAGATCC GGATTCCGAT CAACGAGGAA GGCTCGGACA CGGCCGGCCA GATCCAGGAA TATCTGGACG CGTATCGCGG CGAAGGCATC CAGCACATCG CGCTCGGCGC GGCCGACATC TATCGGGCGG TCGACGGCCT GCGCGCGAAG GGCGTGACGC TGCTCGACAC GATCGACACG TACTACGAGC TCGTCGATCG CCGCGTGCCG AACCACGGCG AGCCGCTCGA CGAGCTCAGA AAGCGCAAGA TCCTGATCGA CGGCGCGCAC GACGATCTGC TGCTGCAGAT CTTCACCGAG AACCAGATCG GGCCGATCTT CTTCGAGATT ATTCAGCGCA AGGGTAATCA GGGTTTCGGC GAGGGCAACT TCAAGGCGCT GTTCGAATCG ATCGAACTCG ACCAGATCCG CCGCGGCGTC GTGCAGGACA AGGCGTAA
|
Protein sequence | MQIPTWDNPV GTDGFEFIEY TAPDPKALGQ LFERMGFTAV ARHRHKDVTL YRQGDINFII NAEPDSFAQR FARLHGPSIC AIAFRVQDAA KAYRHALELG AWGFDNKTGP MELNIPAIKG IGDSLIYFVD RWRGKNGAKP GAIGDISIYD VDFEPIPGAD PNPAGHGLTY IDHLTHNVHR GRMQEWAEFY ERLFNFREVR YFDIEGKVTG VKSKAMTSPC GKIRIPINEE GSDTAGQIQE YLDAYRGEGI QHIALGAADI YRAVDGLRAK GVTLLDTIDT YYELVDRRVP NHGEPLDELR KRKILIDGAH DDLLLQIFTE NQIGPIFFEI IQRKGNQGFG EGNFKALFES IELDQIRRGV VQDKA
|
| |