Gene Tbd_0306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_0306 
Symbol 
ID3672092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp327923 
End bp331048 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content69% 
IMG OID637708967 
Productarylsulfatase 
Protein accessionYP_314064 
Protein GI74316324 
COG category[M] Cell wall/membrane/envelope biogenesis
[P] Inorganic ion transport and metabolism 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis
[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.219055 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAGCAC CGGAAACCCA AAGCAGCGTG AGCGCGCTCA CGGTTCTGAT CTGCACGCAC 
AACCGCGCCG ATCTGCTCGA GCGCACGCTC GCCTCGCTGA ACCGTGCGCA ACGGCCCGAG
ATGCCGGTGC GCATCCTCGT GGCGGCGAAC GCCTGCAGCG ACGACACGGT CGCGCGCATG
CAGCGCTACC AGGCCGTGCA GGCCCGGGAG AATCTGCTGC CGCTCGACGT CGTCGAAGTC
GCCACCCCCG GCAAATCCCA CGCCTTGAAC ACGGCGCTTC CGCTGATCGA AACCGAGCTC
ACGACCTTCG TCGACGACGA CCATCGCGTC GACGAGGCCT TTCTGGTGGC GATCGAACAG
GCGTCGCGGC GCTGGCCCGA AGCCGGGCTG TATTGCGGCC GCATCCTGCC CGACTGGGAC
GGCAGCGAGC CCGCGTGGGT GCACGACGAG GGACCGTACC GGATCTATCC GCTGCCGGTG
CCGCGTTACG ACCAAGGTGA TGAACCCGCG GCGATCACCG CCGAGCGCGG ACCGATACCG
GGCGGCGGCA ACCTGACGCT GCGTCGAGCC GTGTTCGACC TCGGCGGCGC GTTCTCGACC
GAGCTCGGGC CGCGCGGGCA CGATCTCGGC GGCGGCGAGG ACAGCGAATA CGTGCTGCGC
GTCCTCGCGC GCGGCGCGCG CTGCCAGTAC GCGCCGGCGA TCGTGCAATA CCACTACGTC
GACACCGAGC GGCTGCGCCT GCGCTACCTG CTCAAGAAAA GCTATCAGCG CACGCGCTCG
AGCGCGCGCA TCCAGGGCGG CGGCTCGGTT CCGCTTTACA TGTGGCGCAA GCTCGGCGAA
TACGCCTTTC ACGGCGTGTT CAGCCGGACC TGGGCGCAGC GTCGCTTCTA CTGGGTGCGG
ACGGCCGCCG CACTCGGCGA AATCCGCGGA CACCGGGAAT CGGGCCATCG GCGCAAGCGC
CTCGCCTTGC CGCAGGACCG CGGCGGCTTG CAGGTCGAAG CGCTCGGCCT CGTCGCCGTC
GCGGCGGGGC TGGTCGCGTG GTTGGCCGCG GGCGACGCAC GCTGGGCCGG CCTCGTCCCG
GCGATCGCCG TCGCCGGTGC GGGGACGGCC GCGCTCCTCG CCAAGTCGCT GCTCGACTTC
TCGCAGACCG GTCCGCGCAT CCGCGAGGAA ATCCTCACCC ACTACCGGCG CTACACGCTG
TTCGCGCTCG CCCGGCTGTC GGCCTGGGCC TTCGCCGTGA TGCTGTTCAC CGCCGGCGGC
GGCGTGCTCG TGTACTTCAT GCTCGCGAGC GCGCTGAGCA CGACCTGGTC GAGCGCGCTC
GCGACCGTCG CCGCCCTGCT CGGCCTGCTC GCCGGCTTCG CGCTGCAGTT CGTGCGCAAG
CTGCGGTTCA ATCCTGGCCT CCTCGTGGCG TCGATGCATT ACCGCGTGAG CCGGCTCTAT
GGCCTGTGGC GGGTGGCGAC GCCACAACGC ATCCGCGCCG CCGAATATAT CGGCGGCGGC
GCGCTCGTGC TCCTGCTGGC GGTCGCGTCG GTGCAACTCG CGCGCGCGAT GCGCCTCGAC
GACCTCGTGG CCCTGTGGGG AAGCATGCTG CTCTATCTGG GAACGATCGC CTGGGCGGCC
TGGGAGCCGC CGGCCGGGCC GTCGCGCAGG CGGCCGCCTC GCGGGCCGGG CCGGCCGCCC
AACATCCTGA TGATCGGCTC GGACACTCTA CGCGCCGACC GGCTCGGCGC GCTCGGCTAT
CGTCGCGCGC TGACGCCGAA CCTCGACCGC CTGGCCTCAA CGAGCGCGCT CTTCGCCAAC
TGCTATGTGC CCTGCGCCCG CACCGCACCG AGCCTGATCT CGATGCTGAC CGGGACCTGG
CCGCACACCC ACGGCATCCG CGACAATTTC GTCGACGACG AGAGCACGCG TCTGAAGGTC
GACGCACTGC CTGTGCTGCT CAAGCAGGCC GGCTACCGCA GCGCGGCGAT CTCGGACTGG
TGCGGGGCCG ACATGGGCAA GTTTTCCTTC GGCTTCGACT ACACCGATTT GCCCGAAGAC
CAGTGGAACC TGAAATATCT GATCCGACAG GGCCCCAAGG ACCTGCGGCT CTTCGTTTCG
CTGTTCACCC ACAACCGGCT CGGCCGGCTG CTCCTGCCCG AACTCTATTA TCTCGGCGGC
GTCCCGCTCA CCGGGCCGCT GGGAAGCCGG GCGCGGCGGC TCGTCTCGCG TCTAGCCGAA
AGCGACGCGC CCTTCCTGCT CAACGTCTTC TATTCGACCA CGCATCCGCC TTTCGCATCC
GAATGGCCGT GGTATACCCG CTTCGCCGAC CCGGGCTACG CCGGCGAGTC GAAATTCGCG
ATGGCGCGGC TGACCGATCC GTTCGAGATC ATCCGCCGGC AGGGCGCGCC CAAGGAGGAA
TTCGACCTCG ACCAGATCAT CGATCTCTAC GACGGCTGCG TCGCCGCCTT CGACACCGAG
ATCGGCAAGA TGCTCGCCCA CCTCGAGGCC TGCGGACTCG CCGACGACAC CATCGTCGCC
GTCTATTCGG ACCACGGCAT GGAATTCTTC GAACACGACA CCTGGGGCCA GGGCAACTCC
GCGGTCGGCG ACTTCAGCCC GCGCATCCCG CTCGTGATTC ACGACCCGAG GCGGCCCGGG
CGCGGCACGG TCGCCCAGGT CGTGCGCTCG ATCGATCTCG CCCCGACCCT GCTCGAACTC
GCCGGCCTGC CGGCGCCGGC GAGCATGGAC GGCGCGTCAC TCGTCGGCTG TCTGGCGCCG
GCGGGCCCCT GCCCCGATCT CGACGCCTTC AACGAAACCG GCATCTGGCT CGCCGACGTG
CCGGGCCTGC CCGAGCAGCA TTTGCGCTAC CCCGACGTGC TCGAACTCAT GGGCGTTCCC
AACCGCGAAA GCGGAACGCT CGCGATCAAG ACCCAGTACG CGCCGATCAC GCTGCAGGCC
AAGGACAGGA TGCTGAGGCG CGGGCGCTGG AAGCTCGTCT ATCAGCCGCT CGAAAACGGC
TGCCTGCTGC GGCTTTTTGA CGTCGAGAGC GACCCGGCCT GCCAGCACGA CGTGTCGGAC
GCGCATCCCG ACGTCAAGGC CGAACTCTGG GCACGGCTGC AAAGCTTCCT CGACAGCGGC
GGCTGA
 
Protein sequence
MAAPETQSSV SALTVLICTH NRADLLERTL ASLNRAQRPE MPVRILVAAN ACSDDTVARM 
QRYQAVQARE NLLPLDVVEV ATPGKSHALN TALPLIETEL TTFVDDDHRV DEAFLVAIEQ
ASRRWPEAGL YCGRILPDWD GSEPAWVHDE GPYRIYPLPV PRYDQGDEPA AITAERGPIP
GGGNLTLRRA VFDLGGAFST ELGPRGHDLG GGEDSEYVLR VLARGARCQY APAIVQYHYV
DTERLRLRYL LKKSYQRTRS SARIQGGGSV PLYMWRKLGE YAFHGVFSRT WAQRRFYWVR
TAAALGEIRG HRESGHRRKR LALPQDRGGL QVEALGLVAV AAGLVAWLAA GDARWAGLVP
AIAVAGAGTA ALLAKSLLDF SQTGPRIREE ILTHYRRYTL FALARLSAWA FAVMLFTAGG
GVLVYFMLAS ALSTTWSSAL ATVAALLGLL AGFALQFVRK LRFNPGLLVA SMHYRVSRLY
GLWRVATPQR IRAAEYIGGG ALVLLLAVAS VQLARAMRLD DLVALWGSML LYLGTIAWAA
WEPPAGPSRR RPPRGPGRPP NILMIGSDTL RADRLGALGY RRALTPNLDR LASTSALFAN
CYVPCARTAP SLISMLTGTW PHTHGIRDNF VDDESTRLKV DALPVLLKQA GYRSAAISDW
CGADMGKFSF GFDYTDLPED QWNLKYLIRQ GPKDLRLFVS LFTHNRLGRL LLPELYYLGG
VPLTGPLGSR ARRLVSRLAE SDAPFLLNVF YSTTHPPFAS EWPWYTRFAD PGYAGESKFA
MARLTDPFEI IRRQGAPKEE FDLDQIIDLY DGCVAAFDTE IGKMLAHLEA CGLADDTIVA
VYSDHGMEFF EHDTWGQGNS AVGDFSPRIP LVIHDPRRPG RGTVAQVVRS IDLAPTLLEL
AGLPAPASMD GASLVGCLAP AGPCPDLDAF NETGIWLADV PGLPEQHLRY PDVLELMGVP
NRESGTLAIK TQYAPITLQA KDRMLRRGRW KLVYQPLENG CLLRLFDVES DPACQHDVSD
AHPDVKAELW ARLQSFLDSG G