Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I3106 |
Symbol | hppD |
ID | 3848787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 3539703 |
End bp | 3540800 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637842772 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_443601 |
Protein GI | 83720009 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATCC CCACCTGGGA CAATCCCGTC GGCACCGACG GCTTCGAATT CATCGAATAC ACCGCCCCCG ATCCGAAAGC GCTCGGCCAA CTGTTCGAGC GGATGGGCTT CACCGCGGTT GCGCGCCATC GCCATAAGGA CGTGACGCTG TACCGCCAGG GCGACATCAA CTTCATCATC AACGCGGAAC CGGATTCGTT CGCACAGCGC TTCGCGCGGC TGCACGGGCC GTCGATCTGC GCGATCGCGT TTCGCGTGCA GGATGCCGCG AAGGCGTACA AGCACGCGCT CGAACTCGGC GCGTGGGGCT TCGACAACAA GACGGGCCCG ATGGAGCTGA ACATCCCGGC GATCAAGGGC ATCGGCGATT CGCTGATCTA CTTCGTCGAC CGCTGGCGCG GCAAGAACGG CGCGAAGCCG GGCGCGATCG GCGACATCAG CATCTACGAC GTCGACTTCG AGCCGATCCC GGGCGTCGAT CCGAACCCGG TCGGCCACGG CCTCACGTAC ATCGACCATC TGACGCACAA CGTCCACCGC GGCCGCATGC AGGAATGGGC GGCGTTCTAC GAGCGCCTGT TCAACTTCCG CGAAGTCCGC TACTTCGACA TCGAAGGCAA GGTGACGGGC GTGAAGTCGA AGGCAATGAC GTCGCCGTGC GGCAAGATCC GGATTCCGAT CAACGAGGAA GGCTCGGACA CGGCCGGCCA GATCCAGGAA TACCTCGACG CTTATCGCGG CGAAGGCATC CAGCACATCG CGCTCGGCGC GGCCGACATC TATCGGGCGG TCGACGGACT GCGCGCGACG GGCGTGACGC TGCTCGACAC GATCGACACG TACTACGAGC TCGTCGACCG CCGCGTGCCG AACCACGGAG AGCCGCTCGA CGAGCTCAGG AAGCGCAAGA TCCTGATCGA CGGCGCGCGC GACGAACTGC TGCTGCAGAT CTTCACCGAG AACCAGATCG GGCCGATCTT CTTCGAGATC ATCCAGCGCA AGGGCAATCA GGGCTTCGGC GAAGGCAACT TCAAGGCGCT GTTCGAATCG ATCGAGCTCG ACCAGATCCG CCGCGGCGTC GTGCAGGACA AGGCTTAA
|
Protein sequence | MQIPTWDNPV GTDGFEFIEY TAPDPKALGQ LFERMGFTAV ARHRHKDVTL YRQGDINFII NAEPDSFAQR FARLHGPSIC AIAFRVQDAA KAYKHALELG AWGFDNKTGP MELNIPAIKG IGDSLIYFVD RWRGKNGAKP GAIGDISIYD VDFEPIPGVD PNPVGHGLTY IDHLTHNVHR GRMQEWAAFY ERLFNFREVR YFDIEGKVTG VKSKAMTSPC GKIRIPINEE GSDTAGQIQE YLDAYRGEGI QHIALGAADI YRAVDGLRAT GVTLLDTIDT YYELVDRRVP NHGEPLDELR KRKILIDGAR DELLLQIFTE NQIGPIFFEI IQRKGNQGFG EGNFKALFES IELDQIRRGV VQDKA
|
| |