Gene BMASAVP1_A0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A0003 
SymbolhppD-1 
ID4679636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp1499 
End bp2596 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content63% 
IMG OID639844281 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_991359 
Protein GI121600478 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATCC CCACCTGGGA CAATCCCGTC GGCACCGACG GCTTCGAATT CATCGAATAC 
ACCGCCCCCG ATCCGAAGGC GCTCGGCCAA CTGTTCGAGC GAATGGGCTT CACCGCGGTC
GCCCGCCATC GCCACAAGGA CGTGACGCTG TACCGCCAGG GCGACATCAA CTTCATCATC
AACGCGGAAC CCGATTCGTT CGCGCAACGC TTCGCGCGGC TGCACGGGCC GTCGATCTGC
GCGATCGCAT TCCGCGTGCA GGACGCCGCG AAAGCGTACA GGCATGCGCT CGAGCTCGGC
GCATGGGGCT TCGACAACAA GACGGGCCCG ATGGAGCTGA ACATCCCGGC GATCAAGGGC
ATCGGCGATT CGCTGATCTA CTTCGTCGAC CGCTGGCGCG GCAAGAACGG CGCGAAGCCG
GGCGCGATCG GCGATATCAG CATCTACGAC GTCGATTTCG AGCCGATTCC GGGCGCCGAT
CCGAACCCGG CCGGCCACGG CCTCACGTAC ATCGATCACC TCACGCACAA CGTCCACCGC
GGCCGCATGC AGGAATGGGC GGAGTTCTAC GAGCGCCTGT TCAACTTCCG CGAGGTTCGC
TACTTCGACA TCGAAGGCAA GGTGACGGGC GTGAAATCGA AGGCGATGAC GTCGCCGTGC
GGCAAGATCC GGATTCCGAT CAACGAGGAA GGCTCGGACA CGGCCGGCCA GATCCAGGAA
TATCTGGACG CGTATCGCGG CGAAGGCATC CAGCACATCG CGCTCGGCGC GGCCGACATC
TATCGGGCGG TCGACGGCCT GCGCGCGAAG GGCGTGACGC TGCTCGACAC GATCGACACG
TACTACGAGC TCGTCGATCG CCGCGTGCCG AACCACGGCG AGCCGCTCGA CGAGCTCAGA
AAGCGCAAGA TCCTGATCGA CGGCGCGCAC GACGATCTGC TGCTGCAGAT CTTCACCGAG
AACCAGATCG GGCCGATCTT CTTCGAGATT ATTCAGCGCA AGGGTAATCA GGGTTTCGGC
GAGGGCAACT TCAAGGCGCT GTTCGAATCG ATCGAACTCG ACCAGATCCG CCGCGGCGTC
GTGCAGGACA AGGCGTAA
 
Protein sequence
MQIPTWDNPV GTDGFEFIEY TAPDPKALGQ LFERMGFTAV ARHRHKDVTL YRQGDINFII 
NAEPDSFAQR FARLHGPSIC AIAFRVQDAA KAYRHALELG AWGFDNKTGP MELNIPAIKG
IGDSLIYFVD RWRGKNGAKP GAIGDISIYD VDFEPIPGAD PNPAGHGLTY IDHLTHNVHR
GRMQEWAEFY ERLFNFREVR YFDIEGKVTG VKSKAMTSPC GKIRIPINEE GSDTAGQIQE
YLDAYRGEGI QHIALGAADI YRAVDGLRAK GVTLLDTIDT YYELVDRRVP NHGEPLDELR
KRKILIDGAH DDLLLQIFTE NQIGPIFFEI IQRKGNQGFG EGNFKALFES IELDQIRRGV
VQDKA