Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2223 |
Symbol | tauD |
ID | 4888705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2150316 |
End bp | 2151323 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640132160 |
Product | taurine dioxygenase |
Protein accession | YP_001063217 |
Protein GI | 126443662 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACGGC GCATTCGGCG ATGGCGCCAA GCCGGCCGCA ATCCCCGCGC GACGATAAGC AATGCGGATT TCGTTGGTTC ACCAAACGAA CGCGTTTTGC GAGACTTGGC GGCTTTCCGC CATGCGAGGC GCGCGCCGGC GCACCGAGGC GCACCGACGA TATCCGGACC GACGATGACC CGACTGACAT TGACCCGACT CACGCCCGCG CTCGGCGCGA TCGTCGACGA CGTGGACCTC TCGAACGCGA CCGACGCCCT GCGCGACGAC ATCCGCGCCG CGCTCGCGCA CCATCAGGTG CTGTTCTTCC GCGGCCAGCG CCTGAGCGCG GCCCGGCATC GCGACTTCGC GGCCGGATTC GGCGATCTGC ACGTGCACCC GATCTATCCG TCGCATCCGG ACGCGCGCGA GATCATGGTG CTCGACAACG CCGTGTTCGA CCTGCAGGAC AACGCGATCT GGCATACGGA CGTGACATTC ACCGAGACGC CGCCGCGCGC GTCGATCCTC GCCGCGCACA CGCTGCCCGA GACGGGCGGC GACACGCTGT GGGGCAGCGG CTTCGCCGCG TACGACGCGC TGTCCGGGCG CGTGAAGGCG CAGCTCGACG GCCTCACCGC GCAGCACGAT TTCACGAAGT CGTTTCCGCT GAAACGCTTC GGCGTCACCG CCGAGGATCG CGCGCGCTGG GAGAAGACGC GTGCGACGCA TCCGAGCGTC GCGCATCCCG TCGTGCGCAC GCACCCGGAG ACCGGCCGCA AGACGCTGTT CGTCAACGAA GGCTTCACGA CCGAGATCGA CGGGCTGCCC GAAGAGGAAG GCGCCGCGCT GCTGCGCTTC CTGTTCGCGC ATCAGTCGCG GCCCGAGTTC ACGCTGCGCT GGCGCTGGCA GCCGGGCGAC GTCGCGTTCT GGGACAACCG CTCGACGATC CATTACGCGG TGAACGACTA CGGCAAAGCG CATCGGGTGA TGCACCGCGC GACGATCGTC GGCGACAGGC CGTATTGA
|
Protein sequence | MRRRIRRWRQ AGRNPRATIS NADFVGSPNE RVLRDLAAFR HARRAPAHRG APTISGPTMT RLTLTRLTPA LGAIVDDVDL SNATDALRDD IRAALAHHQV LFFRGQRLSA ARHRDFAAGF GDLHVHPIYP SHPDAREIMV LDNAVFDLQD NAIWHTDVTF TETPPRASIL AAHTLPETGG DTLWGSGFAA YDALSGRVKA QLDGLTAQHD FTKSFPLKRF GVTAEDRARW EKTRATHPSV AHPVVRTHPE TGRKTLFVNE GFTTEIDGLP EEEGAALLRF LFAHQSRPEF TLRWRWQPGD VAFWDNRSTI HYAVNDYGKA HRVMHRATIV GDRPY
|
| |