Gene BURPS1106A_A1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1559 
Symbol 
ID4904706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1503697 
End bp1504680 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content58% 
IMG OID640144664 
ProductTauD/TfdA family dioxygenase 
Protein accessionYP_001075592 
Protein GI126455633 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTCCG GGCGATATTT TTTTAATGCC TTTAATTCAA TCAGGAATCC GATTTTCGAT 
CGAATATCGA AAATAACGCC GCATTCGGAT TCCGATCGAA GCGCATCGCG TTCAAACATC
GTCACCACAG GAATTCGCAT GATTTCACGC AAATTGTCCC CTGCGCTCGG CGCAGAGATT
CGAGGCATCG ATTTTTCTGA ACCGCTGTCG TCGCAAGCGC GCGACGACGT CATCGGTTTG
TTGTCCGAAC ATCAATTGCT CGTCTTTCCC GGCCAGCGCC TGTCGTGCGA ACAGCAGATC
GCCGCGTGCG GCGCGTTCGG CGAGCTCGAG CCGCACCCGA TGACGACCAA TACGTCCTCG
TTCCCGGAAA TGACGATCGT GTCGAACGTG ACGTCGGACG GCAAGCCGGT CGGCTATCCG
ACGCCGCCGT TCGAGCTGTG GCATTCGGAT CTGTGCTATC TCGAGCACCC GGCGAAAATG
ACGTTCTTCT ATGCCGAATC CGTGCCCGAC GCGCACGGCG ACACCTGGTT CGCAAACATG
TTCCGCGCAT ACGAGACGCT GCCCGACGAA CTGAAAGCGG CGATCGACGG CAAGCATGCG
GTCTTCAGTC TCGACAGCAG CCTCGTGAAG CGATGCAGGA AGATCGGCTT CGATCTCAAT
ATCGCGGAAG ACGATTTCAA GCCGACCGTC TCGCATCCGG CGGTGCGCAC CCATCCGCAC
ACGCGCCAAC GCTCGATCTT CGTCAACTGG GCGCACACCG ACCGGATCGA GGGCTATTCG
CCCGAGGAAA GCGACGAGAT TCTCGATCGT ATCTTCGCGC ACTGCCGCAA CGAGGATTTC
ATCTACCGTC ATCGCTACGC GAACGAAGAC CTCGTGATCT GGGACAACGC GTCGCTGATC
CACACCAATT CGCCGAACCC GCCCGTCGGC AATCGCATCA TGCGGCGCGT GATGGTGTCC
GGGCCGAAGC CGTTCTATCA GTAA
 
Protein sequence
MYSGRYFFNA FNSIRNPIFD RISKITPHSD SDRSASRSNI VTTGIRMISR KLSPALGAEI 
RGIDFSEPLS SQARDDVIGL LSEHQLLVFP GQRLSCEQQI AACGAFGELE PHPMTTNTSS
FPEMTIVSNV TSDGKPVGYP TPPFELWHSD LCYLEHPAKM TFFYAESVPD AHGDTWFANM
FRAYETLPDE LKAAIDGKHA VFSLDSSLVK RCRKIGFDLN IAEDDFKPTV SHPAVRTHPH
TRQRSIFVNW AHTDRIEGYS PEESDEILDR IFAHCRNEDF IYRHRYANED LVIWDNASLI
HTNSPNPPVG NRIMRRVMVS GPKPFYQ