Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_0986 |
Symbol | |
ID | 8427925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 1008738 |
End bp | 1009886 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 645033324 |
Product | HtrA2 peptidase |
Protein accession | YP_003190498 |
Protein GI | 258514276 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00462206 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGAACT GGAGGCGCAA TCTTTTGTTT GTGGCCATAG CTGCTTTTGT GGCAGGGTTG ATGTTTTCCG GTGCTTCCCT GCTGGTGCAA GATTTATCGC CCAAGGCAGA CAAATCTAAA TACTCGGCCA GTGGGTCAGC TGCAGGAGTA GGCCCTGATA CAATTGCCAA TATTGTGGAT AAGGCCGGTG CTTCAGTGGT TAAAATCAGT ACTACGGTTA CTGTTGATGT GAGAAGGCAG AATAACCCGT TTTTCAGCGA CCCGTTTTTT AGGCAGTTCT TCGGGCCAGG TTTATCTGAG CCCAGGCAGA GGCAGGAGAC AGGTTTGGGT TCCGGCTTTA TTATATCGCA GGATGGGTAT ATTGTGACTA ATGAGCACGT AATAGACGGG GCCGAGCAAA TAGAGGTTAC TATGAAGGGC AGCGATAAGC CTTCTAAAGC AACTGTGGTG GGTTCTGATT TTGATTTGGA TCTGGCGGTA ATAAAAATCG ACTCTTCAGA GAAGCTGCCG GTTTTGAAAA TGGGAGATTC AGAGCAGATA AAAGTGGGAA ACTGGGTAAT AGCTATAGGC AATCCTTATG GACTGGACCA TACTGTAACC ATCGGGGTGA TTAGTGCTAA AGGCAGGCCG GTTAATATAG AACAAAGGCA GTATAAAAAT TTGCTGCAAA CGGATGCCTC TATTAATCCC GGTAACAGCG GAGGCCCTCT CTTAAACCTG GACGGTGAAG TTGTGGGCAT AAATACAGCT ATTAATGCTG AGGCCCAGGG AATTGGCTTT GCTATTCCTA CCAGTACCGT GAAGTCTGTG CTTGATGAGT TAATTCAAAA AGGCAAGGTT GTTCATCCCT GGATGGGAGT GCAATTGCAA CCGGTTACCG AGCAAATTGC CGAATATTAT AGTTTAAAGA ATACGGATGG TGCTCTGGTA GCCGGTGTGG TAAAGGACAG CCCGGCAGAG AAAGTAGGTT TGCAGCAGGG TGATATTATC CTGGAAATTG ACGGTCAGAA AATTAAGTCT GTTGATAATT TGATAGATAT TGTAGGACAA ACTAAGGTGG GTCAAAAGCT CAAGCTTTTA GTTCACCGGG AAAAGGATTT TTATGTTAGC ATAATTGTCA ATGAGAAGCC CTCCCAGCTT ACTAAATAG
|
Protein sequence | MQNWRRNLLF VAIAAFVAGL MFSGASLLVQ DLSPKADKSK YSASGSAAGV GPDTIANIVD KAGASVVKIS TTVTVDVRRQ NNPFFSDPFF RQFFGPGLSE PRQRQETGLG SGFIISQDGY IVTNEHVIDG AEQIEVTMKG SDKPSKATVV GSDFDLDLAV IKIDSSEKLP VLKMGDSEQI KVGNWVIAIG NPYGLDHTVT IGVISAKGRP VNIEQRQYKN LLQTDASINP GNSGGPLLNL DGEVVGINTA INAEAQGIGF AIPTSTVKSV LDELIQKGKV VHPWMGVQLQ PVTEQIAEYY SLKNTDGALV AGVVKDSPAE KVGLQQGDII LEIDGQKIKS VDNLIDIVGQ TKVGQKLKLL VHREKDFYVS IIVNEKPSQL TK
|
| |