Gene Dtox_2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2331 
Symbol 
ID8429315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2503378 
End bp2504595 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content41% 
IMG OID645034638 
ProductSte24 endopeptidase 
Protein accessionYP_003191767 
Protein GI258515545 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0205695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTTA AATTAAATCC TTTATGGCTA TTACTAATTG TCTCGTCAGC TTTGTTTTGC 
CTGCTTTATT TATGGTTTAC TTTGTTTCCG GGACGGGTAA TTCCGGAAGC CTGGCAGTAC
TTCAGTACAG AACAAATTAC TAACGGCAGA GACTATAACC GGACACAACA GCTGCTTTTT
ATAAGCGGGT TTATTTTAAA GGCATTATTT TTAGGGTGGT TTGTTTTTAG CGGTAAAGCA
GTTGCTGTTT CCAAATATCT GGAGAGACTT ACCGGCAGCT ACCGCTGGAG CCTGCTTATA
TTTTTTATAT TGATTTGGCT GCTGCTAAAA GTTATAAACT TGCCTTTAAC GCTCTATGGA
AGTTACTTTT TTCAGCACCG GTGGGGTTTT TCCACACAGA GCCTGGGTTT CTGGTGGCTA
GACTATTTTA AAGGGTCGGT CCTGGATTTA ATACTATCTA TGGCAGGTTT TATAATATTA
TTTTGGAGTT TTGCTCGTTG GCCAAGAACC TGGTGGTTAG CCTGTGCAGT TCTTATTTCT
TTTTGGCTGC TAATTCAAAG TTTTTTATGG CCCGTATTAA TATCACCCCT CTTCAACCGC
TTTGAACCGG CGACTGACCC GGCAATCATC AATATGGTAC ATAATATTTC CCAAAAGGCG
GGTTTGGAGA TAGAGCAGGT GCTGGTTATG GATGCCAGTC GAAGAACTAC CAAGGCCAAT
GCTTATTTTA CCGGCCTGGG GCATACCAAG CGTATTGTGC TATATGATAA TTTATTGAAT
AACTATTCTC TCGATGAGGT CGAGGCAGTA ATTGCGCATG AAATGGCGCA CTGGAAGCAA
GGACATATTG TGCAGGGACT CCTCTGGGGC ATAGTGCTTA CCTTTATGCT TTGGTTAGTG
CTGTTTCTGC TGCTAAAACA AGCAGTACCG CTGAATACAC GCTTCCCGCC TTTTGTATGG
CCGTTAATTT TGCTGTATTT TTTACTGGTT TCTTTTGTCG GAAGCCCGGC AGAAAACTAT
ATCTCAAGGA GTATGGAAAA AGAAGCTGAT CAGGTTGCAG TTAAGCTCAC GGAAAACGAA
GCTGCGGCTA TACGCCTTCA GAAAAATTTA TCTGTTAAAA ATACATCGGA TGTATCACCC
CCGGCTTTTA TCAGGTGGTT TGATTATTCA CACCCACCGG TCATAGAAAG GATTAATAAT
ATAAAACAAA TTAAATAA
 
Protein sequence
MQLKLNPLWL LLIVSSALFC LLYLWFTLFP GRVIPEAWQY FSTEQITNGR DYNRTQQLLF 
ISGFILKALF LGWFVFSGKA VAVSKYLERL TGSYRWSLLI FFILIWLLLK VINLPLTLYG
SYFFQHRWGF STQSLGFWWL DYFKGSVLDL ILSMAGFIIL FWSFARWPRT WWLACAVLIS
FWLLIQSFLW PVLISPLFNR FEPATDPAII NMVHNISQKA GLEIEQVLVM DASRRTTKAN
AYFTGLGHTK RIVLYDNLLN NYSLDEVEAV IAHEMAHWKQ GHIVQGLLWG IVLTFMLWLV
LFLLLKQAVP LNTRFPPFVW PLILLYFLLV SFVGSPAENY ISRSMEKEAD QVAVKLTENE
AAAIRLQKNL SVKNTSDVSP PAFIRWFDYS HPPVIERINN IKQIK