Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1559 |
Symbol | |
ID | 4904706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1503697 |
End bp | 1504680 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640144664 |
Product | TauD/TfdA family dioxygenase |
Protein accession | YP_001075592 |
Protein GI | 126455633 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATTCCG GGCGATATTT TTTTAATGCC TTTAATTCAA TCAGGAATCC GATTTTCGAT CGAATATCGA AAATAACGCC GCATTCGGAT TCCGATCGAA GCGCATCGCG TTCAAACATC GTCACCACAG GAATTCGCAT GATTTCACGC AAATTGTCCC CTGCGCTCGG CGCAGAGATT CGAGGCATCG ATTTTTCTGA ACCGCTGTCG TCGCAAGCGC GCGACGACGT CATCGGTTTG TTGTCCGAAC ATCAATTGCT CGTCTTTCCC GGCCAGCGCC TGTCGTGCGA ACAGCAGATC GCCGCGTGCG GCGCGTTCGG CGAGCTCGAG CCGCACCCGA TGACGACCAA TACGTCCTCG TTCCCGGAAA TGACGATCGT GTCGAACGTG ACGTCGGACG GCAAGCCGGT CGGCTATCCG ACGCCGCCGT TCGAGCTGTG GCATTCGGAT CTGTGCTATC TCGAGCACCC GGCGAAAATG ACGTTCTTCT ATGCCGAATC CGTGCCCGAC GCGCACGGCG ACACCTGGTT CGCAAACATG TTCCGCGCAT ACGAGACGCT GCCCGACGAA CTGAAAGCGG CGATCGACGG CAAGCATGCG GTCTTCAGTC TCGACAGCAG CCTCGTGAAG CGATGCAGGA AGATCGGCTT CGATCTCAAT ATCGCGGAAG ACGATTTCAA GCCGACCGTC TCGCATCCGG CGGTGCGCAC CCATCCGCAC ACGCGCCAAC GCTCGATCTT CGTCAACTGG GCGCACACCG ACCGGATCGA GGGCTATTCG CCCGAGGAAA GCGACGAGAT TCTCGATCGT ATCTTCGCGC ACTGCCGCAA CGAGGATTTC ATCTACCGTC ATCGCTACGC GAACGAAGAC CTCGTGATCT GGGACAACGC GTCGCTGATC CACACCAATT CGCCGAACCC GCCCGTCGGC AATCGCATCA TGCGGCGCGT GATGGTGTCC GGGCCGAAGC CGTTCTATCA GTAA
|
Protein sequence | MYSGRYFFNA FNSIRNPIFD RISKITPHSD SDRSASRSNI VTTGIRMISR KLSPALGAEI RGIDFSEPLS SQARDDVIGL LSEHQLLVFP GQRLSCEQQI AACGAFGELE PHPMTTNTSS FPEMTIVSNV TSDGKPVGYP TPPFELWHSD LCYLEHPAKM TFFYAESVPD AHGDTWFANM FRAYETLPDE LKAAIDGKHA VFSLDSSLVK RCRKIGFDLN IAEDDFKPTV SHPAVRTHPH TRQRSIFVNW AHTDRIEGYS PEESDEILDR IFAHCRNEDF IYRHRYANED LVIWDNASLI HTNSPNPPVG NRIMRRVMVS GPKPFYQ
|
| |