Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_0306 |
Symbol | |
ID | 3672092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | - |
Start bp | 327923 |
End bp | 331048 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637708967 |
Product | arylsulfatase |
Protein accession | YP_314064 |
Protein GI | 74316324 |
COG category | [M] Cell wall/membrane/envelope biogenesis [P] Inorganic ion transport and metabolism |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.219055 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCAGCAC CGGAAACCCA AAGCAGCGTG AGCGCGCTCA CGGTTCTGAT CTGCACGCAC AACCGCGCCG ATCTGCTCGA GCGCACGCTC GCCTCGCTGA ACCGTGCGCA ACGGCCCGAG ATGCCGGTGC GCATCCTCGT GGCGGCGAAC GCCTGCAGCG ACGACACGGT CGCGCGCATG CAGCGCTACC AGGCCGTGCA GGCCCGGGAG AATCTGCTGC CGCTCGACGT CGTCGAAGTC GCCACCCCCG GCAAATCCCA CGCCTTGAAC ACGGCGCTTC CGCTGATCGA AACCGAGCTC ACGACCTTCG TCGACGACGA CCATCGCGTC GACGAGGCCT TTCTGGTGGC GATCGAACAG GCGTCGCGGC GCTGGCCCGA AGCCGGGCTG TATTGCGGCC GCATCCTGCC CGACTGGGAC GGCAGCGAGC CCGCGTGGGT GCACGACGAG GGACCGTACC GGATCTATCC GCTGCCGGTG CCGCGTTACG ACCAAGGTGA TGAACCCGCG GCGATCACCG CCGAGCGCGG ACCGATACCG GGCGGCGGCA ACCTGACGCT GCGTCGAGCC GTGTTCGACC TCGGCGGCGC GTTCTCGACC GAGCTCGGGC CGCGCGGGCA CGATCTCGGC GGCGGCGAGG ACAGCGAATA CGTGCTGCGC GTCCTCGCGC GCGGCGCGCG CTGCCAGTAC GCGCCGGCGA TCGTGCAATA CCACTACGTC GACACCGAGC GGCTGCGCCT GCGCTACCTG CTCAAGAAAA GCTATCAGCG CACGCGCTCG AGCGCGCGCA TCCAGGGCGG CGGCTCGGTT CCGCTTTACA TGTGGCGCAA GCTCGGCGAA TACGCCTTTC ACGGCGTGTT CAGCCGGACC TGGGCGCAGC GTCGCTTCTA CTGGGTGCGG ACGGCCGCCG CACTCGGCGA AATCCGCGGA CACCGGGAAT CGGGCCATCG GCGCAAGCGC CTCGCCTTGC CGCAGGACCG CGGCGGCTTG CAGGTCGAAG CGCTCGGCCT CGTCGCCGTC GCGGCGGGGC TGGTCGCGTG GTTGGCCGCG GGCGACGCAC GCTGGGCCGG CCTCGTCCCG GCGATCGCCG TCGCCGGTGC GGGGACGGCC GCGCTCCTCG CCAAGTCGCT GCTCGACTTC TCGCAGACCG GTCCGCGCAT CCGCGAGGAA ATCCTCACCC ACTACCGGCG CTACACGCTG TTCGCGCTCG CCCGGCTGTC GGCCTGGGCC TTCGCCGTGA TGCTGTTCAC CGCCGGCGGC GGCGTGCTCG TGTACTTCAT GCTCGCGAGC GCGCTGAGCA CGACCTGGTC GAGCGCGCTC GCGACCGTCG CCGCCCTGCT CGGCCTGCTC GCCGGCTTCG CGCTGCAGTT CGTGCGCAAG CTGCGGTTCA ATCCTGGCCT CCTCGTGGCG TCGATGCATT ACCGCGTGAG CCGGCTCTAT GGCCTGTGGC GGGTGGCGAC GCCACAACGC ATCCGCGCCG CCGAATATAT CGGCGGCGGC GCGCTCGTGC TCCTGCTGGC GGTCGCGTCG GTGCAACTCG CGCGCGCGAT GCGCCTCGAC GACCTCGTGG CCCTGTGGGG AAGCATGCTG CTCTATCTGG GAACGATCGC CTGGGCGGCC TGGGAGCCGC CGGCCGGGCC GTCGCGCAGG CGGCCGCCTC GCGGGCCGGG CCGGCCGCCC AACATCCTGA TGATCGGCTC GGACACTCTA CGCGCCGACC GGCTCGGCGC GCTCGGCTAT CGTCGCGCGC TGACGCCGAA CCTCGACCGC CTGGCCTCAA CGAGCGCGCT CTTCGCCAAC TGCTATGTGC CCTGCGCCCG CACCGCACCG AGCCTGATCT CGATGCTGAC CGGGACCTGG CCGCACACCC ACGGCATCCG CGACAATTTC GTCGACGACG AGAGCACGCG TCTGAAGGTC GACGCACTGC CTGTGCTGCT CAAGCAGGCC GGCTACCGCA GCGCGGCGAT CTCGGACTGG TGCGGGGCCG ACATGGGCAA GTTTTCCTTC GGCTTCGACT ACACCGATTT GCCCGAAGAC CAGTGGAACC TGAAATATCT GATCCGACAG GGCCCCAAGG ACCTGCGGCT CTTCGTTTCG CTGTTCACCC ACAACCGGCT CGGCCGGCTG CTCCTGCCCG AACTCTATTA TCTCGGCGGC GTCCCGCTCA CCGGGCCGCT GGGAAGCCGG GCGCGGCGGC TCGTCTCGCG TCTAGCCGAA AGCGACGCGC CCTTCCTGCT CAACGTCTTC TATTCGACCA CGCATCCGCC TTTCGCATCC GAATGGCCGT GGTATACCCG CTTCGCCGAC CCGGGCTACG CCGGCGAGTC GAAATTCGCG ATGGCGCGGC TGACCGATCC GTTCGAGATC ATCCGCCGGC AGGGCGCGCC CAAGGAGGAA TTCGACCTCG ACCAGATCAT CGATCTCTAC GACGGCTGCG TCGCCGCCTT CGACACCGAG ATCGGCAAGA TGCTCGCCCA CCTCGAGGCC TGCGGACTCG CCGACGACAC CATCGTCGCC GTCTATTCGG ACCACGGCAT GGAATTCTTC GAACACGACA CCTGGGGCCA GGGCAACTCC GCGGTCGGCG ACTTCAGCCC GCGCATCCCG CTCGTGATTC ACGACCCGAG GCGGCCCGGG CGCGGCACGG TCGCCCAGGT CGTGCGCTCG ATCGATCTCG CCCCGACCCT GCTCGAACTC GCCGGCCTGC CGGCGCCGGC GAGCATGGAC GGCGCGTCAC TCGTCGGCTG TCTGGCGCCG GCGGGCCCCT GCCCCGATCT CGACGCCTTC AACGAAACCG GCATCTGGCT CGCCGACGTG CCGGGCCTGC CCGAGCAGCA TTTGCGCTAC CCCGACGTGC TCGAACTCAT GGGCGTTCCC AACCGCGAAA GCGGAACGCT CGCGATCAAG ACCCAGTACG CGCCGATCAC GCTGCAGGCC AAGGACAGGA TGCTGAGGCG CGGGCGCTGG AAGCTCGTCT ATCAGCCGCT CGAAAACGGC TGCCTGCTGC GGCTTTTTGA CGTCGAGAGC GACCCGGCCT GCCAGCACGA CGTGTCGGAC GCGCATCCCG ACGTCAAGGC CGAACTCTGG GCACGGCTGC AAAGCTTCCT CGACAGCGGC GGCTGA
|
Protein sequence | MAAPETQSSV SALTVLICTH NRADLLERTL ASLNRAQRPE MPVRILVAAN ACSDDTVARM QRYQAVQARE NLLPLDVVEV ATPGKSHALN TALPLIETEL TTFVDDDHRV DEAFLVAIEQ ASRRWPEAGL YCGRILPDWD GSEPAWVHDE GPYRIYPLPV PRYDQGDEPA AITAERGPIP GGGNLTLRRA VFDLGGAFST ELGPRGHDLG GGEDSEYVLR VLARGARCQY APAIVQYHYV DTERLRLRYL LKKSYQRTRS SARIQGGGSV PLYMWRKLGE YAFHGVFSRT WAQRRFYWVR TAAALGEIRG HRESGHRRKR LALPQDRGGL QVEALGLVAV AAGLVAWLAA GDARWAGLVP AIAVAGAGTA ALLAKSLLDF SQTGPRIREE ILTHYRRYTL FALARLSAWA FAVMLFTAGG GVLVYFMLAS ALSTTWSSAL ATVAALLGLL AGFALQFVRK LRFNPGLLVA SMHYRVSRLY GLWRVATPQR IRAAEYIGGG ALVLLLAVAS VQLARAMRLD DLVALWGSML LYLGTIAWAA WEPPAGPSRR RPPRGPGRPP NILMIGSDTL RADRLGALGY RRALTPNLDR LASTSALFAN CYVPCARTAP SLISMLTGTW PHTHGIRDNF VDDESTRLKV DALPVLLKQA GYRSAAISDW CGADMGKFSF GFDYTDLPED QWNLKYLIRQ GPKDLRLFVS LFTHNRLGRL LLPELYYLGG VPLTGPLGSR ARRLVSRLAE SDAPFLLNVF YSTTHPPFAS EWPWYTRFAD PGYAGESKFA MARLTDPFEI IRRQGAPKEE FDLDQIIDLY DGCVAAFDTE IGKMLAHLEA CGLADDTIVA VYSDHGMEFF EHDTWGQGNS AVGDFSPRIP LVIHDPRRPG RGTVAQVVRS IDLAPTLLEL AGLPAPASMD GASLVGCLAP AGPCPDLDAF NETGIWLADV PGLPEQHLRY PDVLELMGVP NRESGTLAIK TQYAPITLQA KDRMLRRGRW KLVYQPLENG CLLRLFDVES DPACQHDVSD AHPDVKAELW ARLQSFLDSG G
|
| |