Gene BTH_II1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1222 
Symbol 
ID3845363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp1445675 
End bp1446820 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content69% 
IMG OID637838524 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_439418 
Protein GI83716682 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGCT CAGCCAGTCC CGCGTCCAGC GATCCCGCGT TTGCCGGCCC GCCGGGCGAC 
AACCCGCTCG GCATGGCGGG GCTCGAATTC GTCGAATTCG CGTCGCGCGA GCCTGACGCG
CTCGCGCGGC GCTTCGAGCA GCTCGGTTTC AAGGCGATCG CGCGGCACGT CAGCAAGGCG
GTCACGCTCT ACCGGCAAGG GCCGATGAAC TTTCTCGTGA ACGCGCAGCC CGATTCGTTC
GCCGCGCGCT ACGCGGACGA ATATGGCACG GGCGTGTGTG CGATCGGCAT TCGCGTCGAC
GACGCGCAGC GCGCGTTCGA GCGCGCGATC GAGCTCGGCG CGTGGGCGTT CGAGGGCGAG
CGGATCGGCG TCGGCGAATT GACGATTCCG GCGATCCAGG GGATCGGCGC GTCGCACATC
CATTTCGTCG ACCGCTGGCG CGGGCGCGGC GGATTGCGCG GCGGGGTCGG CGACATCTCG
ATCTTCGACG TCGATTTCCG CCCGATCGAC GTCGCCACGG CGCAGGCGGA CCTCGACTAC
TTCGGCGCCG GCCTGCGGCG CGTCGATCAC CTGACGCAGA CGGTCGGCCG TGGCCGGATG
CAGGAGTGGC TCGATTTCTA TCGCGATCTG CTGCACTTCC GCGAGATCCA TGAACTCAAC
GCGAACTGGC ACGTGTCGGA GGAGGCGCGC GTGATGGTGT CGCCGTGCGG CGACGTGCGG
ATTCCGGTGT ACGAGGAGGG CACGAGGCGC ACCGAGCTGA TGCACGAGTA TCTGCCCGAC
CATCCGGGCG AGGGGGTGCA GCACATCGCG CTCGCGACCG ACGACATTCT CGCGTGCGCG
GACGCGCTCG CGGCGAACGG CGTCGAGTTC GTCGAGCCGC CCGCGCGCTA CTACGACGAG
ATCGAGGCGC GATTGCCCGG CTGCCGGATC GACGTCGATG CGCTGCGCGC GCGCCGCATT
CTCGTCGACG GCGAGATCGG CGACGACGGC GTGCCGAGGC TGTTTCTCCA GACGTTCGTC
AAGCGCCGGC CCGGCGAGAT CTTCTTCGAG ATCGTCGAGC GGCGCGGGCA TCACGGCTTC
GGCGAGGGCA ATCTGCGCGC GCTCGCGCAC GCGAGGAATG CGGCGCGCGG CGCGCTCAGG
CAGTGA
 
Protein sequence
MSSSASPASS DPAFAGPPGD NPLGMAGLEF VEFASREPDA LARRFEQLGF KAIARHVSKA 
VTLYRQGPMN FLVNAQPDSF AARYADEYGT GVCAIGIRVD DAQRAFERAI ELGAWAFEGE
RIGVGELTIP AIQGIGASHI HFVDRWRGRG GLRGGVGDIS IFDVDFRPID VATAQADLDY
FGAGLRRVDH LTQTVGRGRM QEWLDFYRDL LHFREIHELN ANWHVSEEAR VMVSPCGDVR
IPVYEEGTRR TELMHEYLPD HPGEGVQHIA LATDDILACA DALAANGVEF VEPPARYYDE
IEARLPGCRI DVDALRARRI LVDGEIGDDG VPRLFLQTFV KRRPGEIFFE IVERRGHHGF
GEGNLRALAH ARNAARGALR Q