Gene Bcep18194_B0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B0343 
Symbol 
ID3752102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp366233 
End bp368125 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content68% 
IMG OID637765188 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_371103 
Protein GI78061195 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.880013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCT CGATCGCCAC CGTGTCACTG TCCGGCACGC TCGCCGAGAA ACTCGCCGCC 
ATCCAGGCCG CAGGCTTCGA CGGCGTCGAA ATCTTCGAAA ACGACCTCGT CTACTTCGAT
GGATCGCCGG CCGACGTCCG GCGCATGGCC GCGGATCTCG GCCTCGACAT CGTGCTGTTC
CAGCCGTTTC GCGACTTCGA GGGCGTGAGC GCGACCCAGC TCGCCCGCAA CCTCGACCGG
ATCAAGCGCA AGTTCGACGT GATGCATGCG CTCGGCACCG ACCGCATCCT GGTGTGCAGC
AACGTGTCGG CCGACACGAT CGCGGACGAC GGGCTGCTCG TCGACCAGCT CGGTACGCTC
GCTGAGGCCG CGCGGCAGGC CGGCGTGGTC GCCGCGTACG AAGCGCTCGC GTGGGGCCGT
GTCGTGAAGC GGTACGGCCA TGCGTGGCAA CTCGTGAATA CCGTGAACAG CCCGCATCTC
GGCCTGGCGC TCGACAGCTT CCACACGCTG TCGCTCGACG ATTCGCCGGA CGCAATCGCC
GACATCCCCG GCGACCGGAT CGCGTTCGTG CAGATCGCCG ATGCGCCGAA GCTCGCGATG
GACGTGCTCG AATGGAGCCG CCATTACCGC TCGTTCCCGG GCCAGGGCGA TTTCGACCTC
GCGCACTTCA CTGCGCGCGT GATCGAGTCC GGCTATACGG GGCCGCTGTC GCTCGAGATC
TTCAACGACG GCTTCCGCGC GGCGCCCACG GCGATCACGG CCGCCGACGG CCATCGCTCG
CTGCTCTATC TGGAGGAACT GACGCGCGAG CGGCTCGCAC AGGACGGCCA TGCGCCGGCC
GCCAGCCAGC CGCTGTTCGC GCCGCCGGCG CCGCCCGCGC ACGTCGGGTT CCAGTTCATC
GAATTCGCGG TCGACGCGCA GGCGGCGGCG ACCGTCGGCG AATGGCTCGG CCGGATGCGC
TTCCGGCTCG CGGGCCGCCA TCGGTCGAAG GACGTGACGC TGTTCCAGCA CGGCGCCGCG
TCGATCGTGC TGAACGCGGA GCGCGATTCG TTCGCCGATG CGTTCTTCCA GCAGCATGGG
CTGTCGCTGT GTGCATCGGC ATTCCGCGTC GACGACGCGA ACGTGGCGTT CGAGCGCGCA
GCAGGCTTCG GCTATGCGCC GTTCTCCGGC CGCGTCGGGC CGAACGAGCG CGTGTTGCCG
AGCGTGCAGG CGCCGGACGG CAGCCTCGAA TACTTCGTCG ACGAAGCGCC GAACGCGCCG
ACGCTGTACG AATCGGATTT CGTGCTGACC GACATCGACG GCCCGAGCGA AGTCGGGCCG
CTGACCGGTA TCGACCATGT GTGCCTCGCC CTGCCGGCCG ACGCGCTCGA TACGTGGATC
CTGTTCTTCA AGACGGCATT CGGCTTCGAG GCCGAGCGCA GCTGGCTCGT GCCCGACCCG
TACGGGCTAA TGCGCAGCCG CGCGGTGCGC AGCGCCGACG GCTCGGTGCG GATCGCGCTG
AACGCGTCGG TCGACCGGCA CACGGCGGTG GCCGAATCGC TCGACCGCTA CCACGGCACA
GGGCTGAACC ACGTCGCATT CCGCACGGAC GACATCGTGA AGACGATCGC CGCGTTCGCG
GCCGACGGCA TTCCGTTCCT GCGCATTCCG CCGAACTACT ACGACGATCT CGCCGCACGC
TACGCGTTGT CGGACGAACT GATCGACACG CTGAGCACGC ATCACCTGCT GTACGACCGC
GACGAGCACG GCGGCGAATT CCTGCATGCG TATACAGAAC TGGTCGACAA CCGCTTCTCG
CTCGAGATCG TCGAGCGGCG CGGCGGATAC GACGGCTATG GCGCGACAAA CGCGGCCGTG
CGGCTCGCCG CACAGGCGCA GCGCAGGAAA TAA
 
Protein sequence
MQRSIATVSL SGTLAEKLAA IQAAGFDGVE IFENDLVYFD GSPADVRRMA ADLGLDIVLF 
QPFRDFEGVS ATQLARNLDR IKRKFDVMHA LGTDRILVCS NVSADTIADD GLLVDQLGTL
AEAARQAGVV AAYEALAWGR VVKRYGHAWQ LVNTVNSPHL GLALDSFHTL SLDDSPDAIA
DIPGDRIAFV QIADAPKLAM DVLEWSRHYR SFPGQGDFDL AHFTARVIES GYTGPLSLEI
FNDGFRAAPT AITAADGHRS LLYLEELTRE RLAQDGHAPA ASQPLFAPPA PPAHVGFQFI
EFAVDAQAAA TVGEWLGRMR FRLAGRHRSK DVTLFQHGAA SIVLNAERDS FADAFFQQHG
LSLCASAFRV DDANVAFERA AGFGYAPFSG RVGPNERVLP SVQAPDGSLE YFVDEAPNAP
TLYESDFVLT DIDGPSEVGP LTGIDHVCLA LPADALDTWI LFFKTAFGFE AERSWLVPDP
YGLMRSRAVR SADGSVRIAL NASVDRHTAV AESLDRYHGT GLNHVAFRTD DIVKTIAAFA
ADGIPFLRIP PNYYDDLAAR YALSDELIDT LSTHHLLYDR DEHGGEFLHA YTELVDNRFS
LEIVERRGGY DGYGATNAAV RLAAQAQRRK