Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1378 |
Symbol | |
ID | 8428327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 1409607 |
End bp | 1410746 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645033713 |
Product | HtrA2 peptidase |
Protein accession | YP_003190877 |
Protein GI | 258514655 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000000042799 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGTATTTA AAAACAGAAA AATAACTTTA ACTAATTTAC TTCTGGTTAT CCTGATCTTC ACTGTCATAG CCGCTACGAT CACAGCCAAA AGGTCTTCCG CCGAAGAAGA CACAGGCAAT ACGGCACAAA CCATCAGCAT GCCGGCAGTC GGCCCAAACA CCATAGCTGA TATGGTGGAT AAAGCCAGTT CAGCAGTGGT AAAAATAAAC ACCACCGTTG AGCAGCAGGT TACCGGTGTC AATCCCCTGT TTAGTGACCC GTTCTTCAGG GAGTTTTTCG GTCATCAATA TCAAGTGCCG AGCAGAACCG AAGTACAGCA CGGTATTGGC TCCGGCTTTA TTATTTCTAA AGAAGGCTTG ATTTTGACCA ACGAGCATGT CATCGACGGC GCCAGCAAGA TAGAAGTATT ACTGGATAAT GATAAAAATC CCCTAACCGC CAAACTGGTT GGTAAAGATA AAGATCTGGA TTTAGCCGTG TTAAAAATTG AACCGACTAA GGATTTACCG GTTTTAAAGC TTGGCAATTC CGACAATACC AGGGTTGCTG ACTGGGTAGT GGCTATCGGT AATCCTTACG GGCTTGATCA TACTGTAACT GTAGGTGTGG TCAGCGCTAA AAGCCGCCCG GTGGATATTG AAGACAGGCA TTATAAAAAC CTATTGCAAA CTGACGCATC CATTAACCCC GGCAACAGCG GCGGCCCGCT CCTCAACTTG AAGGGCGAGG TAATCGGTAT AAATACAGCC ATCAATGCCA GCGCGCAGGG TATCGGCTTT GCTATACCCA GCAATACTGT CCAGGCAGTA CTAAATGATC TGGAAACCGG ACAATTGAAG CATCCCTGGC TGGGAGTATC TGTACAGGCA TTAACTCAGG AGCTGGCTGA CGCTCTGGGC TTGCAAAACA CTCAGGGAGC GCTGGTCGGC AGTGTCTCTT CCGGCGGCCC GGCGGAAAAA GCCGGACTGC AGAGGGGGGA TGTTATTATC AAGTACAATG ATACACAAAT CGATAATGAA CAAAAGCTGA TTGATTGCGT TCAGAAAAGC AAGGTGGGAG ATACCGCCGT AATGGTAGTT GTCAGAAACA AAAACAATAT TTTTCTGACG GCAACTATTG AGGACAAGAA CAGCCAGTAG
|
Protein sequence | MVFKNRKITL TNLLLVILIF TVIAATITAK RSSAEEDTGN TAQTISMPAV GPNTIADMVD KASSAVVKIN TTVEQQVTGV NPLFSDPFFR EFFGHQYQVP SRTEVQHGIG SGFIISKEGL ILTNEHVIDG ASKIEVLLDN DKNPLTAKLV GKDKDLDLAV LKIEPTKDLP VLKLGNSDNT RVADWVVAIG NPYGLDHTVT VGVVSAKSRP VDIEDRHYKN LLQTDASINP GNSGGPLLNL KGEVIGINTA INASAQGIGF AIPSNTVQAV LNDLETGQLK HPWLGVSVQA LTQELADALG LQNTQGALVG SVSSGGPAEK AGLQRGDVII KYNDTQIDNE QKLIDCVQKS KVGDTAVMVV VRNKNNIFLT ATIEDKNSQ
|
| |