Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1222 |
Symbol | |
ID | 3845363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 1445675 |
End bp | 1446820 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637838524 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_439418 |
Protein GI | 83716682 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGCT CAGCCAGTCC CGCGTCCAGC GATCCCGCGT TTGCCGGCCC GCCGGGCGAC AACCCGCTCG GCATGGCGGG GCTCGAATTC GTCGAATTCG CGTCGCGCGA GCCTGACGCG CTCGCGCGGC GCTTCGAGCA GCTCGGTTTC AAGGCGATCG CGCGGCACGT CAGCAAGGCG GTCACGCTCT ACCGGCAAGG GCCGATGAAC TTTCTCGTGA ACGCGCAGCC CGATTCGTTC GCCGCGCGCT ACGCGGACGA ATATGGCACG GGCGTGTGTG CGATCGGCAT TCGCGTCGAC GACGCGCAGC GCGCGTTCGA GCGCGCGATC GAGCTCGGCG CGTGGGCGTT CGAGGGCGAG CGGATCGGCG TCGGCGAATT GACGATTCCG GCGATCCAGG GGATCGGCGC GTCGCACATC CATTTCGTCG ACCGCTGGCG CGGGCGCGGC GGATTGCGCG GCGGGGTCGG CGACATCTCG ATCTTCGACG TCGATTTCCG CCCGATCGAC GTCGCCACGG CGCAGGCGGA CCTCGACTAC TTCGGCGCCG GCCTGCGGCG CGTCGATCAC CTGACGCAGA CGGTCGGCCG TGGCCGGATG CAGGAGTGGC TCGATTTCTA TCGCGATCTG CTGCACTTCC GCGAGATCCA TGAACTCAAC GCGAACTGGC ACGTGTCGGA GGAGGCGCGC GTGATGGTGT CGCCGTGCGG CGACGTGCGG ATTCCGGTGT ACGAGGAGGG CACGAGGCGC ACCGAGCTGA TGCACGAGTA TCTGCCCGAC CATCCGGGCG AGGGGGTGCA GCACATCGCG CTCGCGACCG ACGACATTCT CGCGTGCGCG GACGCGCTCG CGGCGAACGG CGTCGAGTTC GTCGAGCCGC CCGCGCGCTA CTACGACGAG ATCGAGGCGC GATTGCCCGG CTGCCGGATC GACGTCGATG CGCTGCGCGC GCGCCGCATT CTCGTCGACG GCGAGATCGG CGACGACGGC GTGCCGAGGC TGTTTCTCCA GACGTTCGTC AAGCGCCGGC CCGGCGAGAT CTTCTTCGAG ATCGTCGAGC GGCGCGGGCA TCACGGCTTC GGCGAGGGCA ATCTGCGCGC GCTCGCGCAC GCGAGGAATG CGGCGCGCGG CGCGCTCAGG CAGTGA
|
Protein sequence | MSSSASPASS DPAFAGPPGD NPLGMAGLEF VEFASREPDA LARRFEQLGF KAIARHVSKA VTLYRQGPMN FLVNAQPDSF AARYADEYGT GVCAIGIRVD DAQRAFERAI ELGAWAFEGE RIGVGELTIP AIQGIGASHI HFVDRWRGRG GLRGGVGDIS IFDVDFRPID VATAQADLDY FGAGLRRVDH LTQTVGRGRM QEWLDFYRDL LHFREIHELN ANWHVSEEAR VMVSPCGDVR IPVYEEGTRR TELMHEYLPD HPGEGVQHIA LATDDILACA DALAANGVEF VEPPARYYDE IEARLPGCRI DVDALRARRI LVDGEIGDDG VPRLFLQTFV KRRPGEIFFE IVERRGHHGF GEGNLRALAH ARNAARGALR Q
|
| |