Gene BURPS668_A1640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1640 
Symbol 
ID4887467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1568614 
End bp1569597 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content58% 
IMG OID640131579 
ProductTauD/TfdA family dioxygenase 
Protein accessionYP_001062636 
Protein GI126445134 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTCCG GGCGATATTT TTTTAATGCC TTTAATTCAA TCAGGAATCC GATTTTCGAT 
CGAATATCGA AAATAACGCC GCATTCGGAT TCCGATCGAA GCGCATCGCG TTCAAACATC
GTCACCACAG GAATTCGCAT GATTTCACGC AAATTGTCCC CTGCGCTCGG CGCAGAGATT
CGAGGCATCG ATTTTTCTGA ACCGCTGTCG TCGCAAGCGC GCGACGACGT CATCGGTTTG
TTGTCCGAAC ATCAATTGCT CGTCTTTCCC GGCCAGCGCC TGTCGTGCGA ACAGCAGATC
GCCGCGTGCG GCGCGTTCGG CGAGCTCGAG CCGCACCCGA TGACGACCAA TACGTCCTCG
TTCCCGGAAA TGACGATCGT GTCGAACGTG ACGTCGGACG GCAAGCCGGT CGGCTATCCG
ACGCCGCCGT TCGAGCTGTG GCATTCGGAT CTGTGCTATC TCGAGCACCC GGCGAAAATG
ACGTTCTTCT ATGCCGAATC CGTGCCCGAC GCGCACGGCG ATACCTGGTT CGCGAACATG
TTCCGCGCAT ACGAGACGCT GCCCGACGAA CTGAAAGCGG CGATCGACGG CAAGCATGCG
GTCTTCAGTC TCGACAGCAG CCTCGTGAAG CGATGCAGGA AGATCGGCTT CGATCTCAAT
ATCGCGGAAG ACGATTTCAA GCCGACCGTC TCGCATCCGG CGGTGCGCAC CCATCCGCAC
ACGCGCCAAC GCTCGATCTT CGTCAACTGG GCGCACACCG ACCGGATCGA GGGCTATTCG
CCCGAGGAAA GCGACGAGAT TCTCGATCGT ATCTTCGCGC ACTGCCGCAA CGAGGATTTC
ATCTACCGTC ATCGCTACGC GAACGAAGAC CTCGTGATCT GGGACAACGC GTCGCTGATC
CACACCAATT CGCCGAACCC GCCCGTCGGC AATCGCATCA TGCGGCGCGT GATGGTGTCC
GGGCCGAAGC CGTTCTATCA GTAA
 
Protein sequence
MYSGRYFFNA FNSIRNPIFD RISKITPHSD SDRSASRSNI VTTGIRMISR KLSPALGAEI 
RGIDFSEPLS SQARDDVIGL LSEHQLLVFP GQRLSCEQQI AACGAFGELE PHPMTTNTSS
FPEMTIVSNV TSDGKPVGYP TPPFELWHSD LCYLEHPAKM TFFYAESVPD AHGDTWFANM
FRAYETLPDE LKAAIDGKHA VFSLDSSLVK RCRKIGFDLN IAEDDFKPTV SHPAVRTHPH
TRQRSIFVNW AHTDRIEGYS PEESDEILDR IFAHCRNEDF IYRHRYANED LVIWDNASLI
HTNSPNPPVG NRIMRRVMVS GPKPFYQ