Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_4157 |
Symbol | |
ID | 8431171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 4332652 |
End bp | 4336266 |
Gene Length | 3615 bp |
Protein Length | 1204 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 645036350 |
Product | S-layer domain protein |
Protein accession | YP_003193448 |
Protein GI | 258517226 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00151258 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTCTGG ATATGTTTGG GGGGATGTTA TTGAATCAAT ACATAAAGTT TTCTGTTCTG CTGGCCGGTT TAACACTGTT TTTTTGTCAG GGAATAGTTT TAAATCCTTT CCGGTGTACA GCAGAGGAGC AGATTTCGTT TTTAAATGAC CGGGCCAGGG ATATTGCCGG AGCAACGGCA GTAAACACTC CGGGATTTTT AGTTCCCGGC GGTCTAAGCG GGGAAGGGCA AATAGTTGCC ATTGCCGACA GCGGGCTGGA TAAAGGTTCT CTGGAGGACA TTCACCCCGA TTTAAAAAGC ACACCCGGCA AAATGCCTAA AGTGGTAATG CTCAAGTCCT GGGCAGGCCG TGACGTGCCG GATGACCCGA TTGGTCACGG TACTCACATG GCTGCTACGT TGGCCGGTAC AGGTGCCGCT TCCGGAGGCA AATTTCGCGG CATCGCCCCT GGCGCCAGCT TGTACTTTCA GGCTATTTTA AATAAGTATG GTGAATCGCA GCCGCCGGAG AATCTGGCGG ATCTGTTCCG GCCCGCCTAC CAGGCGGGAG CCAGAATTCA TGTGGACGGG TGGGGCGGCG GATCGGATGT TTACCGGGAA TCATCCTCTC AGGTGGATGA TTTTGTGCGC AGCCATCCGG ATTTTCTGGT TATCTTTGGT GCAGGCAACA GCGGGCCTTC GGACAGGACG ATTACTGCTG AGGCCAACAG TAAAAATGCT CTGGCCGTAG GGGCCTCCGT TTTGCCCCGG CCGGCTCTGG TGCCCGGCGC AGGCGATACT TCCTCCCAGG CCGTGTTTTC CTCACGCGGT CCCACAGGCG ACGGGAGAAT CAAGCCCGAA CTGCTGGCTC CCGGTTCAGC CGTGATCTCT GCCCGTTCCA GTTTGATAGA AGGAAACTTG ACCGGGTTTC CCGAATACAC GAGCATGCAG GGGACAAGCA TGGCATCGGC GGTTGCCGGG GGGTCGGCAG CTCTGCTGCG CGAATATATG AAGAAATATC TGATTATACC AGATCCGTCG GCGGCGCTGC TGAAGGCGAT TTTGATTAAC GGGGCCAGAA CTGCTGAAGG CGGGCCTTCC AAAGAAGGCT TTGGCGTGCT TGATTTGTCC GCAAGCACAA TTGCCCTAAA GGACGGTGCT TTTCAATTTA CCGATGAAAT TGCCGGAGTG GCGCAGGAGG AAGAAAAGAC ATATACCTTC CATGTCGCTG ATCCTTCGGC GCCTGTAAAG GTAACTCTGG ACTGGACTGA TCCGCCTGAC ACCGCGGGCA GCGGGAGCAC ACTGGTTAAC GATCTTGATT TAATTGTTAA GACACCGGAC GGAAAAGTCT TTTATGGCAA TCATTTTCTG GGTGCCAATA CCCCGGACCG GTTAAATAAT GTTGAGCAGG TATTTCTGCC CTCACCCGAG CCGGGTGAGT ATACAGTTCA TGTGGTCGGT GCGGCCGTGC TTAAGAATAC AGGGTATAAC AGCAGTAAAC CGGCGCAGGA CTATGCGCTT GTTTACGGGC AGGCGCCTGT TGAGGGAGTG CTGCAAAAGA CTGCCGGCAA ACCTGTTATT AAAAAAGACG GTAAAACACT GAGCATGCCC CAAAAACCGC TTATTAATCT GATAGATGAC GGTATTATTG CGGCGGATGA CGCGCACCTG TTTACCGGAG CGGAAGTCCT TATGACTGAA AAGCAGGTTT ATTTAGTATC TCGAGTCTGG CGGGCCAATG CTGTGAAGGT GCTTAACACT GCTGAAGGAA CGGTTTTTTC GGAAATCAAT CCTGATAACA GGTTGGGAGG CTTCTATTTA GCTCCGGACG GAGCGGACCT TTTGTTAAAT GACAGTCCGT CTTCTCCCGA TAAATTTCCC ACGGGAGTTG AAATCAATGC CGTAGTCAAC CCGCTTGACC AAAAAATCAG ATGGGCGCGT GCTGCCAACA GCGAGCGTAA AGGTGTAATC TTGGAAGTGC AGGATGAAAA CGGCTTAAAG AAGATATCTC TTGCCGGCGA TAAGACTTCC TATCAGGTTA TGCCCGGCGC CGTTTATTCC TATGAAGATG ATTACGGAAA ATCTGAGCTG GCTGATATGC CCTTTGGTAC CGGGGCGCTG GATGAATTGG AGGACGTGTT GCCCGGTATG CCGGTTACCT TCAGGCTTGC ACCCTCCACC AGACAGGTGC AATACCTGGC CGTGCAGAGG CAGGTAATTC TTGGGACTGT GCGCGGAATC ACCGCTGCTG GTGAAATTAA AATGGAAAAC GGTTCTCTTT TGCGGCTTTT TCCGGGTGCT CCGGTAAATA AAGATAAGGA AAGTTCTGAT GTAAGGAGTT TGAAACAAGG CGACCACATA TCAGCCGTAA TTTTGCCTGA CACAGGAGAA GCTATTGGAT TAGTAGCCTA TAGCAAGGTG TTTTATGGAA AGGTTATTGA CTGCAGCAAA AAAAGCGGCA AGCTCTATTT ACAGGATGAC AGTGGTTCCT ATCTTTCGTT TGATCTTTCT CCCCAGTCAA TTATATATCG TTGGGGTGTG AGAGGCTCTG CTGAGTCAAT TGATGTCGGG CTCAGAATCA GGATCACTGT GGACCCGCTG CAAAATGAAG TGTGGCGCTT GGATATAGCG GATACTGCTT TTGAGCAGGG AACACTGGCA GGCTATAATA AGACAGACAA TATTATTACC ATGAAAGAAT CCGGTAAGTA TCTGATTTCC GATTCGACCA GGTTCTCCAA AAACGGGTAT CAGGTTACGC CGAATGATTT GCTGACCGGT GAGAAGATTG AATTGGAATA TGCCGCGGTT CCGCAGCTCG GCAATGTTTT GCTTTCCGTA AGCGCTCAAA ATAAGGTGCC GGCTCCTTTA TTGACAACCG CCGGTTTATT TGCGGACAAC AAATTAAAAT TGTCAGGCAA AACCGATCCT GACACTAAAC TTTATATAAG AAATAAAGAT GGTTTAATCC GGACACCGGT TGTGGACGAC TCGGGAAGGT TTACTTTTAG CATGCCAATA AGGGAAAAAG AGGACCAAAC TATTAACCTG GTGGTTTTAA ACGAAAAAAC CGGGGGAATT AACGGCAGTC ACTTAACCCT GGCTAATCTT AATAACAATC CTATAAATGC AATGTCTTGG GCAACAACTG AAAAAAAGAC GACATTGATG AGCGGTACCT TGTTTGATTG GCCCTTGACC AGGAGCGAAG CCACAGTTGC AATGTCACAG GTGTTTAACT GGTCTGATAT AAGCAGTAGG AGGCTTTCTT TTTCCGATAT AAATCATTTA TCGCTGCCTT ACCGGACAGC TATTGCCGAA GCCAGTGCCC GCGGTATCTT TAAAGGCTAT GCGGACGGCA GCTTTCACCC CGACGGTATT CTGAATCGCG CTGAGGCCGC AGTGATTTTA GCGGCATTAA TAAAGGATTT GAATATTAAA AGCCAGCCTG CTTCTGCCGG GGTTTATTCG GACATTGGTG AAATACCGCA TTGGTCTGCT TCTGCTGTTG ACTTGACTAC AGCCTCAGGC ATTTTTCACG GACATGCCGA CGGCAGCTTT GCACCGGACG AGACGGTTAC CGCAAGAAAA TTTGAAACTC TTTTGGAGCG TGTGATTGAA TTATATATAA AATAA
|
Protein sequence | MFLDMFGGML LNQYIKFSVL LAGLTLFFCQ GIVLNPFRCT AEEQISFLND RARDIAGATA VNTPGFLVPG GLSGEGQIVA IADSGLDKGS LEDIHPDLKS TPGKMPKVVM LKSWAGRDVP DDPIGHGTHM AATLAGTGAA SGGKFRGIAP GASLYFQAIL NKYGESQPPE NLADLFRPAY QAGARIHVDG WGGGSDVYRE SSSQVDDFVR SHPDFLVIFG AGNSGPSDRT ITAEANSKNA LAVGASVLPR PALVPGAGDT SSQAVFSSRG PTGDGRIKPE LLAPGSAVIS ARSSLIEGNL TGFPEYTSMQ GTSMASAVAG GSAALLREYM KKYLIIPDPS AALLKAILIN GARTAEGGPS KEGFGVLDLS ASTIALKDGA FQFTDEIAGV AQEEEKTYTF HVADPSAPVK VTLDWTDPPD TAGSGSTLVN DLDLIVKTPD GKVFYGNHFL GANTPDRLNN VEQVFLPSPE PGEYTVHVVG AAVLKNTGYN SSKPAQDYAL VYGQAPVEGV LQKTAGKPVI KKDGKTLSMP QKPLINLIDD GIIAADDAHL FTGAEVLMTE KQVYLVSRVW RANAVKVLNT AEGTVFSEIN PDNRLGGFYL APDGADLLLN DSPSSPDKFP TGVEINAVVN PLDQKIRWAR AANSERKGVI LEVQDENGLK KISLAGDKTS YQVMPGAVYS YEDDYGKSEL ADMPFGTGAL DELEDVLPGM PVTFRLAPST RQVQYLAVQR QVILGTVRGI TAAGEIKMEN GSLLRLFPGA PVNKDKESSD VRSLKQGDHI SAVILPDTGE AIGLVAYSKV FYGKVIDCSK KSGKLYLQDD SGSYLSFDLS PQSIIYRWGV RGSAESIDVG LRIRITVDPL QNEVWRLDIA DTAFEQGTLA GYNKTDNIIT MKESGKYLIS DSTRFSKNGY QVTPNDLLTG EKIELEYAAV PQLGNVLLSV SAQNKVPAPL LTTAGLFADN KLKLSGKTDP DTKLYIRNKD GLIRTPVVDD SGRFTFSMPI REKEDQTINL VVLNEKTGGI NGSHLTLANL NNNPINAMSW ATTEKKTTLM SGTLFDWPLT RSEATVAMSQ VFNWSDISSR RLSFSDINHL SLPYRTAIAE ASARGIFKGY ADGSFHPDGI LNRAEAAVIL AALIKDLNIK SQPASAGVYS DIGEIPHWSA SAVDLTTASG IFHGHADGSF APDETVTARK FETLLERVIE LYIK
|
| |