Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_0563 |
Symbol | |
ID | 3673673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | - |
Start bp | 591642 |
End bp | 593474 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637709234 |
Product | sulfate thiol esterase SoxB |
Protein accession | YP_314321 |
Protein GI | 74316581 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000246299 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.545825 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAACATG GGCAGGAGCT ACTATTTACC GGGCTCGCCG GCAAACTGAC AATAGAGCCG ATTCCAGCTA CCGATTCCGA GGAGACCCCC ATGCACATGA ACCGCCGCGA GTTCCTGCAA CTGCTCGCCG TCGCGGCCGC GTCCGGCATG GCGCTGGACA GCAAGTCCGC GCTCGCCGGC AAGGCCCCGG CCAATTTCTA CGACGTGCCG CGTCACGGCA ACGTCTCCTT CCTGCACTTC ACCGACTGCC ACGCCCAGCT GCTGCCGGTG TGGTTCCGCG AGCCCAACGT CAACCTCGGG GTCGGCGCCG CGCACGGCAA GGCGCCGCAC CTGGTCGGCC AGCATCTGCT CAAACAGTTC GGCATCAAGC CGGGCAGCGC CGAAGCCCAT GCCTTCACTT ACCTCGACTT CACCGAAGCG GCCAAGGTCT ACGGCAAGGT CGGGGGCTTC GCCCATCTGA AAACGCTCGT GGACAAGCTG CGCGCGCAGC GCCCGGGCGC GCTGCTGCTC GACGGCGGCG ATACCTGGCA GGGCTCGGCG ACTTCGCTGT GGACCAACGC TCAGGACATG GTCGACGCCT GCATCAAGCT TGGCGTCAAT GTCATGACAC CCCACTGGGA GTCGACCTTC GGCGCCGAGC GCGTGCTGGA AATCGTGAAC GGCGACTTCA AGAAGGCGAA CATCGATTTC GTCGCGCAGA ACGTCGTCAC CAACGACTTC GGCGACCAGG TGTTCAAGCC CTACGTCATG AAGGACATGA ACGGCGTCAA GGTCGCGGTG ATCGGCCAGG CTTTCCCCTA CACGCCGATC GCGAACCCGC GCCACATGGT GCCGGACTGG AGCTTCGGCA TCCGTGACGA CAGCATGCAG CAGTTCGTCG ACCAGGCCCG CGCCGAAGGC GCGAAGGTCG TCGTCGTGCT CTCGCACAAC GGCATGGACG TCGACCTCAA GATGGCGAGC CGCGTGACCG GCATCGACGC GATCTTCGGC GGACACACCC ACGACGGCGT GCCGCAGCCG ACGAAAGTCA AGAACGCAAA GGGCGTGACG CTGGTCACCA ATGCCGGCTC CAACGGCAAG TTCCTCGGTG TCATGGACTT CGACGTGCGC GGCGGCAAGG TGCAGAGCTA CAAATACCGC CTGCTGCCCG TGTTCTCCAA CCTGCTGCCC GCCGACCCGG GCATGGACGC CTTCATCAAG CAGGTGCGCG CGCCCTACGA AGCCAAGCTG AGCGAAAAGC TCGCCGTCAC CGACGACTTC CTCTACCGCC GCGGCAACTT CAACGGCACC TGGGACCAGT TGCTGGTCGA CGCGCTGATG GAGGTCAAGG GCGCCGACGC GGCGTTCTCG CCCGGTTTCC GCTGGGGGAC CACGCTGCTG CCCGGTGACG CGATCACGAT GGAACGGCTG ATGGACCAGA CCGCGATCAC CTATCCGCAG ACCACGCTCA CCGAGATGAC GGGCGAAACG ATCAAGACGA TCATGGAAGA CGTCGCCGAC AATCTGTTCA ACGCCGACCC GTACTACCAG CAGGGCGGCG ACATGGTCCG CGTCGGCGGC ATCGAGTACA CGATCGACCC GAACAAGAAG ATCGGCCAGC GCATCGGCGA CATGAAGCTG AACGGGAAAC CGGTCAGCGC CGACAAGACC TACATGGTCG CGGGCTGGGC GCCGGTCGGC GAAGGCGTCC AAGGCGAACC GGTATGGGAC GTCGTCGCCA CTTACCTGCG CGACAAGAAA GTGATCAAAG GCCTGAAGCT CAACGAACCG AAGATCGTCG GCGTGGGCAA ATCCAATCCA GGCATCGCTG CCTACTCGGG CGGCCTCTCG TAA
|
Protein sequence | MEHGQELLFT GLAGKLTIEP IPATDSEETP MHMNRREFLQ LLAVAAASGM ALDSKSALAG KAPANFYDVP RHGNVSFLHF TDCHAQLLPV WFREPNVNLG VGAAHGKAPH LVGQHLLKQF GIKPGSAEAH AFTYLDFTEA AKVYGKVGGF AHLKTLVDKL RAQRPGALLL DGGDTWQGSA TSLWTNAQDM VDACIKLGVN VMTPHWESTF GAERVLEIVN GDFKKANIDF VAQNVVTNDF GDQVFKPYVM KDMNGVKVAV IGQAFPYTPI ANPRHMVPDW SFGIRDDSMQ QFVDQARAEG AKVVVVLSHN GMDVDLKMAS RVTGIDAIFG GHTHDGVPQP TKVKNAKGVT LVTNAGSNGK FLGVMDFDVR GGKVQSYKYR LLPVFSNLLP ADPGMDAFIK QVRAPYEAKL SEKLAVTDDF LYRRGNFNGT WDQLLVDALM EVKGADAAFS PGFRWGTTLL PGDAITMERL MDQTAITYPQ TTLTEMTGET IKTIMEDVAD NLFNADPYYQ QGGDMVRVGG IEYTIDPNKK IGQRIGDMKL NGKPVSADKT YMVAGWAPVG EGVQGEPVWD VVATYLRDKK VIKGLKLNEP KIVGVGKSNP GIAAYSGGLS
|
| |