Gene Bcen2424_4395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen2424_4395 
Symbol 
ID4453160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia HI2424 
KingdomBacteria 
Replicon accessionNC_008543 
Strand
Start bp1339284 
End bp1340411 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content68% 
IMG OID639696451 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_838022 
Protein GI116692489 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.83775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.881765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGCA ATTCCCACCC CTTGTCCAGC GATACGCCGC CCGTCGCCGA TCCGGCCGCC 
AATCCGCTCG GGATGGCCGG CCTCGAATTC GTCGAATTCG CGGCGCCCGT GCCGGACGCG
CTCGCGCAAC GCTTCGAGCA GCTTGGGTTC AAGGCGATCG CGCGGCACGT CAGCAAGAAC
GTCACGTTGT ACCGGCAGGG GCAGATGCAT TTCCTGATCA ACGCCGAGCC CGATTCGTTC
GCCGCGCGCT ATGCGGAGGA GTACGGGATG GGCGTCTGCG CGATCGGGCT GCGGGTGGCG
AATGCCCGGC GCGCGTTCGA ACGCGCGATC CAGCTCGGAG CATGGGCGTT CGAAGGCGAA
AAGGTCGGCG TCGGCGAGCT GAAGATCCCT GCCATCCAGG GCATCGGCGA TTCGCACCTG
TATTTCGTCG ACCGCTGGCG CGGGCGGGAC GGCCAGCGCG GCGGCGTCGG CGACATCTCC
ATCTTCGACA TCGATTTCCG GCCGATCGAC ATCGCGACCG CGCATACCGA TCTCGACTGC
GCGGGCGTCG GCCTGCAGCA GGTCGACCAC TTCACGCAAA CGGTCGGCGC GGGACGGATG
CAGGAGTGGC TGGATTTCTA CCACGACCTG CTGCATTTTC GCGAGATCCA CCAGATCGAC
GCGCACTGGC ATGTGTCGGA GGAATCGCGC GTAATGGTGT CGCCGTGCGG CGCGGTCCGG
ATTCCGGTCT ACGAGGAAGG CACGCGGCGT ACCGACCTGA TGCATGCGTA CCTGCCCGAC
CATCCGGGCG AGGGCGTGCA GCACGTCGCG CTGGCCACCG ACGACATCCT GTCGTGCGTC
GATGCCCTGC GCGCGAACGG CGTCGAGTTC ATCGAGCCGC CTGCACGCTA TTACGACGAC
GTCGATGCGC GGCTGCCCGC GCATGGTGTC GATCTCGACG CGCTGCGCCG CCGCGCGGTG
CTGGTCGACG GCGAGATCGG CAGCGACGGC GTGCCGAGGC TGTTCTTCCA GACCTTCGTC
AAACGCCGCC CCGGCGAGAT TTTCTTCGAG ATCGTGCAGC GCAAGGGGCA CCACGGGTTC
GGCGAGGGCA ACCTCGCGGC ACTCGCCCGC GCGCGCGACG CCGGGTGA
 
Protein sequence
MPGNSHPLSS DTPPVADPAA NPLGMAGLEF VEFAAPVPDA LAQRFEQLGF KAIARHVSKN 
VTLYRQGQMH FLINAEPDSF AARYAEEYGM GVCAIGLRVA NARRAFERAI QLGAWAFEGE
KVGVGELKIP AIQGIGDSHL YFVDRWRGRD GQRGGVGDIS IFDIDFRPID IATAHTDLDC
AGVGLQQVDH FTQTVGAGRM QEWLDFYHDL LHFREIHQID AHWHVSEESR VMVSPCGAVR
IPVYEEGTRR TDLMHAYLPD HPGEGVQHVA LATDDILSCV DALRANGVEF IEPPARYYDD
VDARLPAHGV DLDALRRRAV LVDGEIGSDG VPRLFFQTFV KRRPGEIFFE IVQRKGHHGF
GEGNLAALAR ARDAG