Gene BURPS1710b_A0131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0131 
Symbol 
ID3693266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp160763 
End bp161746 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content58% 
IMG OID637730385 
ProductTauD/TfdA family dioxygenase 
Protein accessionYP_335290 
Protein GI76818747 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.442379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTCCG GGCGATATTT TTTTAATGCC TTTAATTCAA TCAGGAATCC GATTTTCGAT 
CGAATATCGA AAATAACGCC GCATTCGGAT TCCGATCGAA GCGCATCGCG TTCAAACATC
GTCACCACAG GAATTCGCAT GATTTCACGC AAATTGTCCC CTGCGCTCGG CGCAGAGATT
CGAGGCATCG ATTTTTCTAA ACCGCTGTCG TCGCAAGCGC GCGACGACGT CATCGGTTTG
TTGTCCGAAC ATCAATTGCT CGTCTTTCCC GGCCAGTGCC TGTCGTGCGA ACAGCAGATC
GCCGCGTGCG GCGCGTTCGG CGAGCTCGAG CCGCACCCGA TGACGACCAA TACGTCCTCG
TTCCCGGAAA TGACGATCGT GTCGAACGTG ACGTCGGACG GCAAGCCGGT CGGCTATCCG
ACGCCGCCGT TCGAGCTGTG GCATTCGGAT CTGTGCTATC TCGAGCACCC GGCGAAAATG
ACGTTCTTCT ATGCCGAATC CGTGCCCGAC GCGCACGGCG ATACCTGGTT CGCGAACATG
TTCCGCGCAT ACGAGACGCT GCCCGACGAA CTGAAAGCGG CGATCGACGG CAAGCATGCG
GTCTTCAGTC TCGACAGCAG CCTCGTGAAG CGATGCAGGA AGATCGGCTT CGATCTCAAT
ATCGCGGAAG ACGATTTCAA GCCGACCGTC TCGCATCCGG CGGTGCGCAC CCATCCGCAC
ACGCGCCAAC GCTCGATCTT CGTCAACTGG GCGCACACCG ACCGGATCGA GGGCTATTCG
CCCGAGGAAA GCGACGAGAT TCTCGATCGC ATCTTCGCGC ACTGCCGCAA CGAGGATTTC
ATCTACCGTC ATCGCTACGC GAACGAAGAC CTCGTGATCT GGGACAACGC GTCGCTGATC
CACACCAATT CGCCGAACCC GCCCGTCGGC AATCGCATCA TGCGGCGCGT GATGGTGTCC
GGGCCGAAGC CGTTCTATCA GTAA
 
Protein sequence
MYSGRYFFNA FNSIRNPIFD RISKITPHSD SDRSASRSNI VTTGIRMISR KLSPALGAEI 
RGIDFSKPLS SQARDDVIGL LSEHQLLVFP GQCLSCEQQI AACGAFGELE PHPMTTNTSS
FPEMTIVSNV TSDGKPVGYP TPPFELWHSD LCYLEHPAKM TFFYAESVPD AHGDTWFANM
FRAYETLPDE LKAAIDGKHA VFSLDSSLVK RCRKIGFDLN IAEDDFKPTV SHPAVRTHPH
TRQRSIFVNW AHTDRIEGYS PEESDEILDR IFAHCRNEDF IYRHRYANED LVIWDNASLI
HTNSPNPPVG NRIMRRVMVS GPKPFYQ