Gene Dtox_2995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2995 
Symbol 
ID8429985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3186311 
End bp3187609 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content42% 
IMG OID645035249 
Productpeptidase M48 Ste24p 
Protein accessionYP_003192372 
Protein GI258516150 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0104331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.541556 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAAC TGTCATTTGC TCGAATTGTA ACCATGTCTT TCTACTACGC AGCTTCTTCT 
GAGGATAGGA CCTACTTAAT TAACAGGTTA AAACCTGTTC TTACCCAAAA AGGAACCAGC
GTGGATTTAA GCTTTAACAC ATTAAAGCAT GAGGCTCAGC TGTTAACTTC CAACCAGAAG
CTTTTATTAA TGACATATTT TCATATGTAT GTTCTCTTGC GGGATCGCAA CCTTTTGGAG
GATGCCGATA TAAAAAGCAT CCGGGATGTT TTTTTAGTAT CGGATGATTT TGTTGCCTAT
TATATGCGTT CAATGCGGAG GGCTCATTAT AAAAAATGGA TTGACGAGTC ATCGCTTTTA
GAGTGCATAT TAAACCCCTG CGAAGAATTT CAGAAATGGG CGGAAGATAC GACTTATGGA
GCAGTACAGG AAGGGTGCTA TATAAAAAAA CGAATTCTTC ACGGCTTATC AAAGACCGAG
TATGAACACC CCCAGGATAC AGCGGCCATG GAAGCTCTCT CAAAGATCCC GGGTATTGAT
AAATTGGTTC GAAAAGTAAA TGAGCATGGT CTTGATAAAC TATACCGGGT AGTTTATTCC
GGCAGTAACA TCAAGGTAAC AACCAGGAAC TTTCCTCAAT TATACAGGGC GCTGTTGACT
GTCTGTGAAG TCTTGAATGT AGGGAAAATA CCGGAATTTT ATGTGGAGCA GGGTTTTATT
AATGCCCTTA CAGTGGGAGT TGAAAATCCT ATTGTAGTGA TTAAGTCGGC GGCTATTAGC
TTGTTGTCTT ATGATGAGCT GCTGTTTTTG TTAGGACATG AAGTTGCTCA TATCAAGAGC
GAGCATATGC TCTATCATCA AATAGCCCAG ATATTCCCAT TTATTAGCGG CCTAATGGGC
GCTATAGGTT CTCTGGTTGG TTCCGGACTC CAGGTGGCTC TTCTTAACTG GTACCGCAAG
TCTGAGTATA CGGCAGACAG GGGTGGTCTT TTGGCATGTC AGAACATTAA TGCTGCTGTA
TCCGCGATGA TGAAGATAGC AGGTGCCCCT ATGAGGTATT ACAAGGCTTT AAATCCTGCC
GATTTTCTGG AACAGGCCAG GGAGTTCGAA GGAATGGATG ATGATAAGAT GAACACCATG
GCTAAATATC TAAGCATTAT GTTTGCCGAT CACCCCTGGA CGGTTATGAG AGCAAGCGAA
ATGGATAAAT GGGTTAATAA CGGTATATAC CGGAAGGTTG TTGAGAAATG TTCAGGCTGC
TCTTCTTTGC CGGAAAGTAT GAGGGTATAT GATGGATGA
 
Protein sequence
MDELSFARIV TMSFYYAASS EDRTYLINRL KPVLTQKGTS VDLSFNTLKH EAQLLTSNQK 
LLLMTYFHMY VLLRDRNLLE DADIKSIRDV FLVSDDFVAY YMRSMRRAHY KKWIDESSLL
ECILNPCEEF QKWAEDTTYG AVQEGCYIKK RILHGLSKTE YEHPQDTAAM EALSKIPGID
KLVRKVNEHG LDKLYRVVYS GSNIKVTTRN FPQLYRALLT VCEVLNVGKI PEFYVEQGFI
NALTVGVENP IVVIKSAAIS LLSYDELLFL LGHEVAHIKS EHMLYHQIAQ IFPFISGLMG
AIGSLVGSGL QVALLNWYRK SEYTADRGGL LACQNINAAV SAMMKIAGAP MRYYKALNPA
DFLEQAREFE GMDDDKMNTM AKYLSIMFAD HPWTVMRASE MDKWVNNGIY RKVVEKCSGC
SSLPESMRVY DG