Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bmul_0223 |
Symbol | |
ID | 5765722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia multivorans ATCC 17616 |
Kingdom | Bacteria |
Replicon accession | NC_010084 |
Strand | - |
Start bp | 250987 |
End bp | 252084 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641312626 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001578415 |
Protein GI | 161523403 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.376735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000014808 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGATCC CCACCTGGGA CAACCCCGTC GGCACCGACG GCTTCGAATT CATCGAATAC ACGGCACCGG ACCCGAAGGC GCTCGGACAA CTGTTCGAAC GGATGGGTTT CACCGCGATC GCGCGTCACC GCCACAAGGA CGTGACGGTC TACCGCCAGG GCGACATCAA CTTCATCATC AACGCCGAGC CCGACTCGTT CGCGCAGCGC TTCGCCCGGC TGCACGGCCC GTCGATCTGC GCGATCGCGT TTCGCGTGCA GGACGCCGCG AAGGCGTACA AGCGCGCGCT CGAACTCGGC GCATGGGGCT TCGACAACAA GACGGGCCCG ATGGAGCTGA ACATCCCGGC GATCAAGGGC ATCGGCGATT CGCTGATCTA CTTCGTCGAC CGCTGGCGCG GCAAGAACGG CGCGCAGCCG GGCGCGATCG GCGACATCAG CATCTACGAC GTCGACTTCG AGCCGATCCC CGGCGCCGAG CCGAACCCGG TCGGTCACGG GCTCACGTAC ATCGACCACC TGACGCACAA CGTGCATCGC GGCCGCATGC AGGAATGGGC GGAGTTCTAC GAACGCCTGT TCAACTTCCG CGAAGTGCGC TACTTCGACA TCGAAGGCAA GGTGACGGGT GTGAAGTCGA AGGCGATGAC GTCGCCGTGC GGCAAGATCC GCATCCCGAT CAACGAGGAA GGCTCGGACA CCGCCGGCCA GATCCAGGAA TACCTGGACG CGTACCACGG CGAAGGCATT CAGCACATCG CGCTCGGCAC CAACAACATC TATGGGGCGG TCGACGGCCT GCGCAGCAAG GAAGTGAAGC TGCTCGACAC GATCGACACC TATTACGAAC TGGTCGATCG CCGCGTGCCG AACCACGGCG AATCGCTGGA AGAGCTGAAG AAGCGCAAGA TCCTGATCGA CGGCGCACGC GACGATCTGC TGCTGCAGAT CTTCACCGAA AACCAGATCG GGCCGATCTT CTTCGAGATC ATCCAGCGCA AGGGCAATCA GGGCTTCGGC GAGGGCAACT TCAAGGCGCT GTTCGAATCG ATCGAGCTCG ATCAGATCCG CCGCGGCGTC GTGCAGGACA AGGCCTGA
|
Protein sequence | MQIPTWDNPV GTDGFEFIEY TAPDPKALGQ LFERMGFTAI ARHRHKDVTV YRQGDINFII NAEPDSFAQR FARLHGPSIC AIAFRVQDAA KAYKRALELG AWGFDNKTGP MELNIPAIKG IGDSLIYFVD RWRGKNGAQP GAIGDISIYD VDFEPIPGAE PNPVGHGLTY IDHLTHNVHR GRMQEWAEFY ERLFNFREVR YFDIEGKVTG VKSKAMTSPC GKIRIPINEE GSDTAGQIQE YLDAYHGEGI QHIALGTNNI YGAVDGLRSK EVKLLDTIDT YYELVDRRVP NHGESLEELK KRKILIDGAR DDLLLQIFTE NQIGPIFFEI IQRKGNQGFG EGNFKALFES IELDQIRRGV VQDKA
|
| |