Gene Dtox_0986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0986 
Symbol 
ID8427925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1008738 
End bp1009886 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content45% 
IMG OID645033324 
ProductHtrA2 peptidase 
Protein accessionYP_003190498 
Protein GI258514276 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00462206 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGAACT GGAGGCGCAA TCTTTTGTTT GTGGCCATAG CTGCTTTTGT GGCAGGGTTG 
ATGTTTTCCG GTGCTTCCCT GCTGGTGCAA GATTTATCGC CCAAGGCAGA CAAATCTAAA
TACTCGGCCA GTGGGTCAGC TGCAGGAGTA GGCCCTGATA CAATTGCCAA TATTGTGGAT
AAGGCCGGTG CTTCAGTGGT TAAAATCAGT ACTACGGTTA CTGTTGATGT GAGAAGGCAG
AATAACCCGT TTTTCAGCGA CCCGTTTTTT AGGCAGTTCT TCGGGCCAGG TTTATCTGAG
CCCAGGCAGA GGCAGGAGAC AGGTTTGGGT TCCGGCTTTA TTATATCGCA GGATGGGTAT
ATTGTGACTA ATGAGCACGT AATAGACGGG GCCGAGCAAA TAGAGGTTAC TATGAAGGGC
AGCGATAAGC CTTCTAAAGC AACTGTGGTG GGTTCTGATT TTGATTTGGA TCTGGCGGTA
ATAAAAATCG ACTCTTCAGA GAAGCTGCCG GTTTTGAAAA TGGGAGATTC AGAGCAGATA
AAAGTGGGAA ACTGGGTAAT AGCTATAGGC AATCCTTATG GACTGGACCA TACTGTAACC
ATCGGGGTGA TTAGTGCTAA AGGCAGGCCG GTTAATATAG AACAAAGGCA GTATAAAAAT
TTGCTGCAAA CGGATGCCTC TATTAATCCC GGTAACAGCG GAGGCCCTCT CTTAAACCTG
GACGGTGAAG TTGTGGGCAT AAATACAGCT ATTAATGCTG AGGCCCAGGG AATTGGCTTT
GCTATTCCTA CCAGTACCGT GAAGTCTGTG CTTGATGAGT TAATTCAAAA AGGCAAGGTT
GTTCATCCCT GGATGGGAGT GCAATTGCAA CCGGTTACCG AGCAAATTGC CGAATATTAT
AGTTTAAAGA ATACGGATGG TGCTCTGGTA GCCGGTGTGG TAAAGGACAG CCCGGCAGAG
AAAGTAGGTT TGCAGCAGGG TGATATTATC CTGGAAATTG ACGGTCAGAA AATTAAGTCT
GTTGATAATT TGATAGATAT TGTAGGACAA ACTAAGGTGG GTCAAAAGCT CAAGCTTTTA
GTTCACCGGG AAAAGGATTT TTATGTTAGC ATAATTGTCA ATGAGAAGCC CTCCCAGCTT
ACTAAATAG
 
Protein sequence
MQNWRRNLLF VAIAAFVAGL MFSGASLLVQ DLSPKADKSK YSASGSAAGV GPDTIANIVD 
KAGASVVKIS TTVTVDVRRQ NNPFFSDPFF RQFFGPGLSE PRQRQETGLG SGFIISQDGY
IVTNEHVIDG AEQIEVTMKG SDKPSKATVV GSDFDLDLAV IKIDSSEKLP VLKMGDSEQI
KVGNWVIAIG NPYGLDHTVT IGVISAKGRP VNIEQRQYKN LLQTDASINP GNSGGPLLNL
DGEVVGINTA INAEAQGIGF AIPTSTVKSV LDELIQKGKV VHPWMGVQLQ PVTEQIAEYY
SLKNTDGALV AGVVKDSPAE KVGLQQGDII LEIDGQKIKS VDNLIDIVGQ TKVGQKLKLL
VHREKDFYVS IIVNEKPSQL TK