Gene Dtox_4157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4157 
Symbol 
ID8431171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4332652 
End bp4336266 
Gene Length3615 bp 
Protein Length1204 aa 
Translation table11 
GC content49% 
IMG OID645036350 
ProductS-layer domain protein 
Protein accessionYP_003193448 
Protein GI258517226 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00151258 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTCTGG ATATGTTTGG GGGGATGTTA TTGAATCAAT ACATAAAGTT TTCTGTTCTG 
CTGGCCGGTT TAACACTGTT TTTTTGTCAG GGAATAGTTT TAAATCCTTT CCGGTGTACA
GCAGAGGAGC AGATTTCGTT TTTAAATGAC CGGGCCAGGG ATATTGCCGG AGCAACGGCA
GTAAACACTC CGGGATTTTT AGTTCCCGGC GGTCTAAGCG GGGAAGGGCA AATAGTTGCC
ATTGCCGACA GCGGGCTGGA TAAAGGTTCT CTGGAGGACA TTCACCCCGA TTTAAAAAGC
ACACCCGGCA AAATGCCTAA AGTGGTAATG CTCAAGTCCT GGGCAGGCCG TGACGTGCCG
GATGACCCGA TTGGTCACGG TACTCACATG GCTGCTACGT TGGCCGGTAC AGGTGCCGCT
TCCGGAGGCA AATTTCGCGG CATCGCCCCT GGCGCCAGCT TGTACTTTCA GGCTATTTTA
AATAAGTATG GTGAATCGCA GCCGCCGGAG AATCTGGCGG ATCTGTTCCG GCCCGCCTAC
CAGGCGGGAG CCAGAATTCA TGTGGACGGG TGGGGCGGCG GATCGGATGT TTACCGGGAA
TCATCCTCTC AGGTGGATGA TTTTGTGCGC AGCCATCCGG ATTTTCTGGT TATCTTTGGT
GCAGGCAACA GCGGGCCTTC GGACAGGACG ATTACTGCTG AGGCCAACAG TAAAAATGCT
CTGGCCGTAG GGGCCTCCGT TTTGCCCCGG CCGGCTCTGG TGCCCGGCGC AGGCGATACT
TCCTCCCAGG CCGTGTTTTC CTCACGCGGT CCCACAGGCG ACGGGAGAAT CAAGCCCGAA
CTGCTGGCTC CCGGTTCAGC CGTGATCTCT GCCCGTTCCA GTTTGATAGA AGGAAACTTG
ACCGGGTTTC CCGAATACAC GAGCATGCAG GGGACAAGCA TGGCATCGGC GGTTGCCGGG
GGGTCGGCAG CTCTGCTGCG CGAATATATG AAGAAATATC TGATTATACC AGATCCGTCG
GCGGCGCTGC TGAAGGCGAT TTTGATTAAC GGGGCCAGAA CTGCTGAAGG CGGGCCTTCC
AAAGAAGGCT TTGGCGTGCT TGATTTGTCC GCAAGCACAA TTGCCCTAAA GGACGGTGCT
TTTCAATTTA CCGATGAAAT TGCCGGAGTG GCGCAGGAGG AAGAAAAGAC ATATACCTTC
CATGTCGCTG ATCCTTCGGC GCCTGTAAAG GTAACTCTGG ACTGGACTGA TCCGCCTGAC
ACCGCGGGCA GCGGGAGCAC ACTGGTTAAC GATCTTGATT TAATTGTTAA GACACCGGAC
GGAAAAGTCT TTTATGGCAA TCATTTTCTG GGTGCCAATA CCCCGGACCG GTTAAATAAT
GTTGAGCAGG TATTTCTGCC CTCACCCGAG CCGGGTGAGT ATACAGTTCA TGTGGTCGGT
GCGGCCGTGC TTAAGAATAC AGGGTATAAC AGCAGTAAAC CGGCGCAGGA CTATGCGCTT
GTTTACGGGC AGGCGCCTGT TGAGGGAGTG CTGCAAAAGA CTGCCGGCAA ACCTGTTATT
AAAAAAGACG GTAAAACACT GAGCATGCCC CAAAAACCGC TTATTAATCT GATAGATGAC
GGTATTATTG CGGCGGATGA CGCGCACCTG TTTACCGGAG CGGAAGTCCT TATGACTGAA
AAGCAGGTTT ATTTAGTATC TCGAGTCTGG CGGGCCAATG CTGTGAAGGT GCTTAACACT
GCTGAAGGAA CGGTTTTTTC GGAAATCAAT CCTGATAACA GGTTGGGAGG CTTCTATTTA
GCTCCGGACG GAGCGGACCT TTTGTTAAAT GACAGTCCGT CTTCTCCCGA TAAATTTCCC
ACGGGAGTTG AAATCAATGC CGTAGTCAAC CCGCTTGACC AAAAAATCAG ATGGGCGCGT
GCTGCCAACA GCGAGCGTAA AGGTGTAATC TTGGAAGTGC AGGATGAAAA CGGCTTAAAG
AAGATATCTC TTGCCGGCGA TAAGACTTCC TATCAGGTTA TGCCCGGCGC CGTTTATTCC
TATGAAGATG ATTACGGAAA ATCTGAGCTG GCTGATATGC CCTTTGGTAC CGGGGCGCTG
GATGAATTGG AGGACGTGTT GCCCGGTATG CCGGTTACCT TCAGGCTTGC ACCCTCCACC
AGACAGGTGC AATACCTGGC CGTGCAGAGG CAGGTAATTC TTGGGACTGT GCGCGGAATC
ACCGCTGCTG GTGAAATTAA AATGGAAAAC GGTTCTCTTT TGCGGCTTTT TCCGGGTGCT
CCGGTAAATA AAGATAAGGA AAGTTCTGAT GTAAGGAGTT TGAAACAAGG CGACCACATA
TCAGCCGTAA TTTTGCCTGA CACAGGAGAA GCTATTGGAT TAGTAGCCTA TAGCAAGGTG
TTTTATGGAA AGGTTATTGA CTGCAGCAAA AAAAGCGGCA AGCTCTATTT ACAGGATGAC
AGTGGTTCCT ATCTTTCGTT TGATCTTTCT CCCCAGTCAA TTATATATCG TTGGGGTGTG
AGAGGCTCTG CTGAGTCAAT TGATGTCGGG CTCAGAATCA GGATCACTGT GGACCCGCTG
CAAAATGAAG TGTGGCGCTT GGATATAGCG GATACTGCTT TTGAGCAGGG AACACTGGCA
GGCTATAATA AGACAGACAA TATTATTACC ATGAAAGAAT CCGGTAAGTA TCTGATTTCC
GATTCGACCA GGTTCTCCAA AAACGGGTAT CAGGTTACGC CGAATGATTT GCTGACCGGT
GAGAAGATTG AATTGGAATA TGCCGCGGTT CCGCAGCTCG GCAATGTTTT GCTTTCCGTA
AGCGCTCAAA ATAAGGTGCC GGCTCCTTTA TTGACAACCG CCGGTTTATT TGCGGACAAC
AAATTAAAAT TGTCAGGCAA AACCGATCCT GACACTAAAC TTTATATAAG AAATAAAGAT
GGTTTAATCC GGACACCGGT TGTGGACGAC TCGGGAAGGT TTACTTTTAG CATGCCAATA
AGGGAAAAAG AGGACCAAAC TATTAACCTG GTGGTTTTAA ACGAAAAAAC CGGGGGAATT
AACGGCAGTC ACTTAACCCT GGCTAATCTT AATAACAATC CTATAAATGC AATGTCTTGG
GCAACAACTG AAAAAAAGAC GACATTGATG AGCGGTACCT TGTTTGATTG GCCCTTGACC
AGGAGCGAAG CCACAGTTGC AATGTCACAG GTGTTTAACT GGTCTGATAT AAGCAGTAGG
AGGCTTTCTT TTTCCGATAT AAATCATTTA TCGCTGCCTT ACCGGACAGC TATTGCCGAA
GCCAGTGCCC GCGGTATCTT TAAAGGCTAT GCGGACGGCA GCTTTCACCC CGACGGTATT
CTGAATCGCG CTGAGGCCGC AGTGATTTTA GCGGCATTAA TAAAGGATTT GAATATTAAA
AGCCAGCCTG CTTCTGCCGG GGTTTATTCG GACATTGGTG AAATACCGCA TTGGTCTGCT
TCTGCTGTTG ACTTGACTAC AGCCTCAGGC ATTTTTCACG GACATGCCGA CGGCAGCTTT
GCACCGGACG AGACGGTTAC CGCAAGAAAA TTTGAAACTC TTTTGGAGCG TGTGATTGAA
TTATATATAA AATAA
 
Protein sequence
MFLDMFGGML LNQYIKFSVL LAGLTLFFCQ GIVLNPFRCT AEEQISFLND RARDIAGATA 
VNTPGFLVPG GLSGEGQIVA IADSGLDKGS LEDIHPDLKS TPGKMPKVVM LKSWAGRDVP
DDPIGHGTHM AATLAGTGAA SGGKFRGIAP GASLYFQAIL NKYGESQPPE NLADLFRPAY
QAGARIHVDG WGGGSDVYRE SSSQVDDFVR SHPDFLVIFG AGNSGPSDRT ITAEANSKNA
LAVGASVLPR PALVPGAGDT SSQAVFSSRG PTGDGRIKPE LLAPGSAVIS ARSSLIEGNL
TGFPEYTSMQ GTSMASAVAG GSAALLREYM KKYLIIPDPS AALLKAILIN GARTAEGGPS
KEGFGVLDLS ASTIALKDGA FQFTDEIAGV AQEEEKTYTF HVADPSAPVK VTLDWTDPPD
TAGSGSTLVN DLDLIVKTPD GKVFYGNHFL GANTPDRLNN VEQVFLPSPE PGEYTVHVVG
AAVLKNTGYN SSKPAQDYAL VYGQAPVEGV LQKTAGKPVI KKDGKTLSMP QKPLINLIDD
GIIAADDAHL FTGAEVLMTE KQVYLVSRVW RANAVKVLNT AEGTVFSEIN PDNRLGGFYL
APDGADLLLN DSPSSPDKFP TGVEINAVVN PLDQKIRWAR AANSERKGVI LEVQDENGLK
KISLAGDKTS YQVMPGAVYS YEDDYGKSEL ADMPFGTGAL DELEDVLPGM PVTFRLAPST
RQVQYLAVQR QVILGTVRGI TAAGEIKMEN GSLLRLFPGA PVNKDKESSD VRSLKQGDHI
SAVILPDTGE AIGLVAYSKV FYGKVIDCSK KSGKLYLQDD SGSYLSFDLS PQSIIYRWGV
RGSAESIDVG LRIRITVDPL QNEVWRLDIA DTAFEQGTLA GYNKTDNIIT MKESGKYLIS
DSTRFSKNGY QVTPNDLLTG EKIELEYAAV PQLGNVLLSV SAQNKVPAPL LTTAGLFADN
KLKLSGKTDP DTKLYIRNKD GLIRTPVVDD SGRFTFSMPI REKEDQTINL VVLNEKTGGI
NGSHLTLANL NNNPINAMSW ATTEKKTTLM SGTLFDWPLT RSEATVAMSQ VFNWSDISSR
RLSFSDINHL SLPYRTAIAE ASARGIFKGY ADGSFHPDGI LNRAEAAVIL AALIKDLNIK
SQPASAGVYS DIGEIPHWSA SAVDLTTASG IFHGHADGSF APDETVTARK FETLLERVIE
LYIK