Gene Dtox_1378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1378 
Symbol 
ID8428327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1409607 
End bp1410746 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content47% 
IMG OID645033713 
ProductHtrA2 peptidase 
Protein accessionYP_003190877 
Protein GI258514655 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000042799 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTATTTA AAAACAGAAA AATAACTTTA ACTAATTTAC TTCTGGTTAT CCTGATCTTC 
ACTGTCATAG CCGCTACGAT CACAGCCAAA AGGTCTTCCG CCGAAGAAGA CACAGGCAAT
ACGGCACAAA CCATCAGCAT GCCGGCAGTC GGCCCAAACA CCATAGCTGA TATGGTGGAT
AAAGCCAGTT CAGCAGTGGT AAAAATAAAC ACCACCGTTG AGCAGCAGGT TACCGGTGTC
AATCCCCTGT TTAGTGACCC GTTCTTCAGG GAGTTTTTCG GTCATCAATA TCAAGTGCCG
AGCAGAACCG AAGTACAGCA CGGTATTGGC TCCGGCTTTA TTATTTCTAA AGAAGGCTTG
ATTTTGACCA ACGAGCATGT CATCGACGGC GCCAGCAAGA TAGAAGTATT ACTGGATAAT
GATAAAAATC CCCTAACCGC CAAACTGGTT GGTAAAGATA AAGATCTGGA TTTAGCCGTG
TTAAAAATTG AACCGACTAA GGATTTACCG GTTTTAAAGC TTGGCAATTC CGACAATACC
AGGGTTGCTG ACTGGGTAGT GGCTATCGGT AATCCTTACG GGCTTGATCA TACTGTAACT
GTAGGTGTGG TCAGCGCTAA AAGCCGCCCG GTGGATATTG AAGACAGGCA TTATAAAAAC
CTATTGCAAA CTGACGCATC CATTAACCCC GGCAACAGCG GCGGCCCGCT CCTCAACTTG
AAGGGCGAGG TAATCGGTAT AAATACAGCC ATCAATGCCA GCGCGCAGGG TATCGGCTTT
GCTATACCCA GCAATACTGT CCAGGCAGTA CTAAATGATC TGGAAACCGG ACAATTGAAG
CATCCCTGGC TGGGAGTATC TGTACAGGCA TTAACTCAGG AGCTGGCTGA CGCTCTGGGC
TTGCAAAACA CTCAGGGAGC GCTGGTCGGC AGTGTCTCTT CCGGCGGCCC GGCGGAAAAA
GCCGGACTGC AGAGGGGGGA TGTTATTATC AAGTACAATG ATACACAAAT CGATAATGAA
CAAAAGCTGA TTGATTGCGT TCAGAAAAGC AAGGTGGGAG ATACCGCCGT AATGGTAGTT
GTCAGAAACA AAAACAATAT TTTTCTGACG GCAACTATTG AGGACAAGAA CAGCCAGTAG
 
Protein sequence
MVFKNRKITL TNLLLVILIF TVIAATITAK RSSAEEDTGN TAQTISMPAV GPNTIADMVD 
KASSAVVKIN TTVEQQVTGV NPLFSDPFFR EFFGHQYQVP SRTEVQHGIG SGFIISKEGL
ILTNEHVIDG ASKIEVLLDN DKNPLTAKLV GKDKDLDLAV LKIEPTKDLP VLKLGNSDNT
RVADWVVAIG NPYGLDHTVT VGVVSAKSRP VDIEDRHYKN LLQTDASINP GNSGGPLLNL
KGEVIGINTA INASAQGIGF AIPSNTVQAV LNDLETGQLK HPWLGVSVQA LTQELADALG
LQNTQGALVG SVSSGGPAEK AGLQRGDVII KYNDTQIDNE QKLIDCVQKS KVGDTAVMVV
VRNKNNIFLT ATIEDKNSQ