Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0201 |
Symbol | |
ID | 4882752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 195239 |
End bp | 196159 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640126129 |
Product | phytanoyl-CoA dioxygenase (PhyH) family protein |
Protein accession | YP_001057254 |
Protein GI | 126439521 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG5285] Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGGGTG CGCGCCCGAC AGTTCCTTTG CGTTACATCA AATTCGCGGC GACGATCGCG CGCCGTTTTT CACTTCGCGG CATGGTGGCG AGCATGTCGC GCACGTCCGT TTTACAATCG TTCATCGATA ACAATGACGC CGACCACCCC ATGTCTTCCT TGCACCCGGA GTTGATTCAC ACGCAGGTCC AGACGCTGCG CGAGCGCGGC TTCGTCGTCG CACCGGGGCT CGTCGCACCC GAGCGGTGCG CGCAACTGAA GACGATCGCC GAGCGGCAGT TGCGCGAGGC GGCGCAGCCG CTCGAATTCG AGGCCGACCT GCGCTATCCG GGCGCGCCCG AATCGCGGCA CGCGCCGGGC GGCCATACGG TGCGGCGGCT GCTCGATGCA TACGCGCGCG ACGCGGCGTT CGCCGAGCGC GCGACCGCGC CCGAGATCGG CGCGTGGATG CGTGCGTACT TTGACGAAAC GCCGGTGCTC TCGCGCGCGC ATCACAACTG CGTGATGACG AAGCATCCCG CATACGGCAG CCTCACCGGC TGGCATCGCG ACGTGCGCTA TTGGTCGTTC GAGCGCCCGG ATCTCGTATC CGTGTGGCTC GCGCTCGGGC CGGAGACGGA CGACAACGGC GCGCTATGGC TCGTGCCGGG CTCGCATGAC GCGGAATTCG GGCCGGAGCA TTTCGACGAA GCGAAGTTCT TTCGCGGCGA CGTGCCGGCA AACCGTCGGT TGATCGAGCA GGCGGTGTGC CCGGCGCTCG CGGCGGGCGA TGTCGTGTTC TTCCACTGCA ATACGCTGCA TTCGGCGGGG CAGAACCGCA GCGATCAGGT GAAGTTCTCG CTCGTGTTCA CCTATCACGG CGACAGCAAT CGGCCGGTGC CCGGCTCGCG CTCGGCGTCG AAACCGGAGG TGCGGTTCTA G
|
Protein sequence | MAGARPTVPL RYIKFAATIA RRFSLRGMVA SMSRTSVLQS FIDNNDADHP MSSLHPELIH TQVQTLRERG FVVAPGLVAP ERCAQLKTIA ERQLREAAQP LEFEADLRYP GAPESRHAPG GHTVRRLLDA YARDAAFAER ATAPEIGAWM RAYFDETPVL SRAHHNCVMT KHPAYGSLTG WHRDVRYWSF ERPDLVSVWL ALGPETDDNG ALWLVPGSHD AEFGPEHFDE AKFFRGDVPA NRRLIEQAVC PALAAGDVVF FHCNTLHSAG QNRSDQVKFS LVFTYHGDSN RPVPGSRSAS KPEVRF
|
| |