Gene BURPS668_A2223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2223 
SymboltauD 
ID4888705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2150316 
End bp2151323 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content68% 
IMG OID640132160 
Producttaurine dioxygenase 
Protein accessionYP_001063217 
Protein GI126443662 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACGGC GCATTCGGCG ATGGCGCCAA GCCGGCCGCA ATCCCCGCGC GACGATAAGC 
AATGCGGATT TCGTTGGTTC ACCAAACGAA CGCGTTTTGC GAGACTTGGC GGCTTTCCGC
CATGCGAGGC GCGCGCCGGC GCACCGAGGC GCACCGACGA TATCCGGACC GACGATGACC
CGACTGACAT TGACCCGACT CACGCCCGCG CTCGGCGCGA TCGTCGACGA CGTGGACCTC
TCGAACGCGA CCGACGCCCT GCGCGACGAC ATCCGCGCCG CGCTCGCGCA CCATCAGGTG
CTGTTCTTCC GCGGCCAGCG CCTGAGCGCG GCCCGGCATC GCGACTTCGC GGCCGGATTC
GGCGATCTGC ACGTGCACCC GATCTATCCG TCGCATCCGG ACGCGCGCGA GATCATGGTG
CTCGACAACG CCGTGTTCGA CCTGCAGGAC AACGCGATCT GGCATACGGA CGTGACATTC
ACCGAGACGC CGCCGCGCGC GTCGATCCTC GCCGCGCACA CGCTGCCCGA GACGGGCGGC
GACACGCTGT GGGGCAGCGG CTTCGCCGCG TACGACGCGC TGTCCGGGCG CGTGAAGGCG
CAGCTCGACG GCCTCACCGC GCAGCACGAT TTCACGAAGT CGTTTCCGCT GAAACGCTTC
GGCGTCACCG CCGAGGATCG CGCGCGCTGG GAGAAGACGC GTGCGACGCA TCCGAGCGTC
GCGCATCCCG TCGTGCGCAC GCACCCGGAG ACCGGCCGCA AGACGCTGTT CGTCAACGAA
GGCTTCACGA CCGAGATCGA CGGGCTGCCC GAAGAGGAAG GCGCCGCGCT GCTGCGCTTC
CTGTTCGCGC ATCAGTCGCG GCCCGAGTTC ACGCTGCGCT GGCGCTGGCA GCCGGGCGAC
GTCGCGTTCT GGGACAACCG CTCGACGATC CATTACGCGG TGAACGACTA CGGCAAAGCG
CATCGGGTGA TGCACCGCGC GACGATCGTC GGCGACAGGC CGTATTGA
 
Protein sequence
MRRRIRRWRQ AGRNPRATIS NADFVGSPNE RVLRDLAAFR HARRAPAHRG APTISGPTMT 
RLTLTRLTPA LGAIVDDVDL SNATDALRDD IRAALAHHQV LFFRGQRLSA ARHRDFAAGF
GDLHVHPIYP SHPDAREIMV LDNAVFDLQD NAIWHTDVTF TETPPRASIL AAHTLPETGG
DTLWGSGFAA YDALSGRVKA QLDGLTAQHD FTKSFPLKRF GVTAEDRARW EKTRATHPSV
AHPVVRTHPE TGRKTLFVNE GFTTEIDGLP EEEGAALLRF LFAHQSRPEF TLRWRWQPGD
VAFWDNRSTI HYAVNDYGKA HRVMHRATIV GDRPY