Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_3671 |
Symbol | |
ID | 6283539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010681 |
Strand | + |
Start bp | 4112425 |
End bp | 4113522 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642623260 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001897285 |
Protein GI | 187925643 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0292973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0000000943483 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGGTTT CAACTTGGGA AAATCCGCTC GGCACAGACG GCTTCGAGTT CATTGAATAC ACCGCGCCGG ATCCGAAAGC GCTCGGCAAG CTGTTCGAAC AGATGGGCTT CACCGCCGTG GCCCGGCATC GTCATAAGGA CGTGACGCTG TACCGCCAGG GAGAAATCAA CTTCATCGTC AACGGCGAGC CGGATTCGTT CGCGCAACGC TTCACGCGTT TGCACGGCCC TTCCATCTGC GCGATCGCTT TCCGCGTTCA GGACGCCGCC AAGGCGTACA AAGAAGCGCT GGAAAAGGGC GCCTGGGGCT TCGACAACAA AACCGGCCCG ATGGAATTGA ACATTCCGGC GATCAAGGGC ATTGGCGACT CGCTGATCTA TTTCGTCGAT CGGTGGCGCG GCAAGAACGG CGCGGAGCCG AACAGCATCG GCAACATCGA CATTTACGAT GTCGACTTCG AACCGATTGC CGGCGCGAAC CCGAATCCGG TCGGCCACGG CCTGACCTAC ATCGACCACC TGACGCATAA CGTGCATCGC GGCCGGATGC AGGAATGGGC GGAGTTCTAC GAGCGTCTGT TCAACTTCCG CGAAGTGCGT TATTTCGACA TCGAAGGCAA GGTGACGGGC GTGAAGTCGA AGGCAATGAC CTCGCCGTGC GGCAAGATTC GCATCCCGAT CAATGAAGAA GGTTCGGAAA CCGCCGGCCA GATTCAGGAA TATCTCGACG CGTATCACGG CGAAGGCATT CAGCACATTG CCCTCGGCAG CAACGACATC TACCGCACGG TGGACGGCTT GCGCGGATCG AATATCTCGC TGCTCGACAC GATCGACACG TATTACGAGC TAGTCGATCG CCGCGTGCCG AATCACGGCG AGCCGCTCGA CGAACTGCGC AAACGCAAGA TTCTGATCGA CGGCGCACCC GAAGATCTGC TGCTGCAGAT TTTCACCGAA AACCAGATTG GCCCGATCTT CTTCGAGATC ATTCAGCGCA AGGGCAATCA GGGCTTCGGC GAGGGCAACT TCAAGGCACT GTTCGAATCG ATCGAACTGG ACCAGATTCG CCGTGGCGTG GTGCAAGACA AGGTCTGA
|
Protein sequence | MQVSTWENPL GTDGFEFIEY TAPDPKALGK LFEQMGFTAV ARHRHKDVTL YRQGEINFIV NGEPDSFAQR FTRLHGPSIC AIAFRVQDAA KAYKEALEKG AWGFDNKTGP MELNIPAIKG IGDSLIYFVD RWRGKNGAEP NSIGNIDIYD VDFEPIAGAN PNPVGHGLTY IDHLTHNVHR GRMQEWAEFY ERLFNFREVR YFDIEGKVTG VKSKAMTSPC GKIRIPINEE GSETAGQIQE YLDAYHGEGI QHIALGSNDI YRTVDGLRGS NISLLDTIDT YYELVDRRVP NHGEPLDELR KRKILIDGAP EDLLLQIFTE NQIGPIFFEI IQRKGNQGFG EGNFKALFES IELDQIRRGV VQDKV
|
| |