Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphy_2866 |
Symbol | |
ID | 6244349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phymatum STM815 |
Kingdom | Bacteria |
Replicon accession | NC_010622 |
Strand | + |
Start bp | 3216746 |
End bp | 3217843 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642594671 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001859084 |
Protein GI | 186477614 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0000187167 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGGTTT CAACCTGGGA GAATCCCGTC GGCACCGACG GCTTCGAATT CATCGAATAC ACTGCGCCGG ACCCCGTGGC GCTCGGCAAG TTGTTCGAGC AGATGGGCTT CACGGCGATC GCGAAGCATC GGCACAAGGA CGTGACGTTG TACCGCCAGG GCGACATCAA CTTCATCGTG AACGCCGAGC CGGACTCGTT CGCGCAACGT TTTGCCCGCC TGCACGGCCC GTCGATCTGC GCGATCGCGT TCCGCGTGCA GGACGCGGCG AAGGCCTACA GGCGAGCACT CGACCTCGGC GCTTGGGGAT TCGATAACAA GACGGGCCCG ATGGAACTGA ACATTCCCGC CATCAAGGGC ATCGGCGATT CGCTGATCTA TTTCGTCGAC CGCTGGCGCG GCAAGAACGG CGCGGCGCCG AACAGCATCG GCAACATCAG CATCTATGAT GTCGATTTCG AGCCAATCCC GGGCGCGAAT GCTAATCCGA CCGGGCACGG CCTCACCTAT ATCGATCACC TCACGCACAA CGTCCATCGC GGCCGTATGC ACGAGTGGGC CGAGTTCTAC GAGCGCCTGT TCAATTTCCG CGAAGTGCGC TACTTCGATA TCGAAGGCAA GGTGACGGGC GTGAAGTCGA AGGCAATGAC GTCGCCGTGC GGCAAGATCC GCATTCCGAT CAACGAGGAA GGCTCGGAAA CAGCCGGCCA GATTCAGGAA TATCTCGACG CGTATCATGG CGAGGGCATC CAGCACATCG CGCTCGGTAC GAACGACATC TACCGGACGG TCGACGGCCT GCGCAACTCG AAGATCACGC TGCTCGACAC CATCGACACG TACTATGAGC TCGTCGACCG CCGCGTGCCG AACCACGGCG AGCCGCTCGA AGAACTGCGC AAGCGCAAAA TCCTGATCGA CGGCGCACGC GAAGACCTGT TACTGCAGAT ATTCACTGAG AACCAGATCG GACCGATCTT CTTCGAGATC ATCCAGCGCA AGGGTAATCA GGGATTCGGC GAAGGCAACT TCAAGGCGCT GTTCGAATCG ATCGATCTGG ATCAGATTCG TCGCGGTGTC GTGCAAGACA AGGCTTAA
|
Protein sequence | MKVSTWENPV GTDGFEFIEY TAPDPVALGK LFEQMGFTAI AKHRHKDVTL YRQGDINFIV NAEPDSFAQR FARLHGPSIC AIAFRVQDAA KAYRRALDLG AWGFDNKTGP MELNIPAIKG IGDSLIYFVD RWRGKNGAAP NSIGNISIYD VDFEPIPGAN ANPTGHGLTY IDHLTHNVHR GRMHEWAEFY ERLFNFREVR YFDIEGKVTG VKSKAMTSPC GKIRIPINEE GSETAGQIQE YLDAYHGEGI QHIALGTNDI YRTVDGLRNS KITLLDTIDT YYELVDRRVP NHGEPLEELR KRKILIDGAR EDLLLQIFTE NQIGPIFFEI IQRKGNQGFG EGNFKALFES IDLDQIRRGV VQDKA
|
| |