Gene BTH_I3106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I3106 
SymbolhppD 
ID3848787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp3539703 
End bp3540800 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content64% 
IMG OID637842772 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_443601 
Protein GI83720009 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATCC CCACCTGGGA CAATCCCGTC GGCACCGACG GCTTCGAATT CATCGAATAC 
ACCGCCCCCG ATCCGAAAGC GCTCGGCCAA CTGTTCGAGC GGATGGGCTT CACCGCGGTT
GCGCGCCATC GCCATAAGGA CGTGACGCTG TACCGCCAGG GCGACATCAA CTTCATCATC
AACGCGGAAC CGGATTCGTT CGCACAGCGC TTCGCGCGGC TGCACGGGCC GTCGATCTGC
GCGATCGCGT TTCGCGTGCA GGATGCCGCG AAGGCGTACA AGCACGCGCT CGAACTCGGC
GCGTGGGGCT TCGACAACAA GACGGGCCCG ATGGAGCTGA ACATCCCGGC GATCAAGGGC
ATCGGCGATT CGCTGATCTA CTTCGTCGAC CGCTGGCGCG GCAAGAACGG CGCGAAGCCG
GGCGCGATCG GCGACATCAG CATCTACGAC GTCGACTTCG AGCCGATCCC GGGCGTCGAT
CCGAACCCGG TCGGCCACGG CCTCACGTAC ATCGACCATC TGACGCACAA CGTCCACCGC
GGCCGCATGC AGGAATGGGC GGCGTTCTAC GAGCGCCTGT TCAACTTCCG CGAAGTCCGC
TACTTCGACA TCGAAGGCAA GGTGACGGGC GTGAAGTCGA AGGCAATGAC GTCGCCGTGC
GGCAAGATCC GGATTCCGAT CAACGAGGAA GGCTCGGACA CGGCCGGCCA GATCCAGGAA
TACCTCGACG CTTATCGCGG CGAAGGCATC CAGCACATCG CGCTCGGCGC GGCCGACATC
TATCGGGCGG TCGACGGACT GCGCGCGACG GGCGTGACGC TGCTCGACAC GATCGACACG
TACTACGAGC TCGTCGACCG CCGCGTGCCG AACCACGGAG AGCCGCTCGA CGAGCTCAGG
AAGCGCAAGA TCCTGATCGA CGGCGCGCGC GACGAACTGC TGCTGCAGAT CTTCACCGAG
AACCAGATCG GGCCGATCTT CTTCGAGATC ATCCAGCGCA AGGGCAATCA GGGCTTCGGC
GAAGGCAACT TCAAGGCGCT GTTCGAATCG ATCGAGCTCG ACCAGATCCG CCGCGGCGTC
GTGCAGGACA AGGCTTAA
 
Protein sequence
MQIPTWDNPV GTDGFEFIEY TAPDPKALGQ LFERMGFTAV ARHRHKDVTL YRQGDINFII 
NAEPDSFAQR FARLHGPSIC AIAFRVQDAA KAYKHALELG AWGFDNKTGP MELNIPAIKG
IGDSLIYFVD RWRGKNGAKP GAIGDISIYD VDFEPIPGVD PNPVGHGLTY IDHLTHNVHR
GRMQEWAAFY ERLFNFREVR YFDIEGKVTG VKSKAMTSPC GKIRIPINEE GSDTAGQIQE
YLDAYRGEGI QHIALGAADI YRAVDGLRAT GVTLLDTIDT YYELVDRRVP NHGEPLDELR
KRKILIDGAR DELLLQIFTE NQIGPIFFEI IQRKGNQGFG EGNFKALFES IELDQIRRGV
VQDKA