Gene Dtox_3964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3964 
Symbol 
ID8430979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4148228 
End bp4149358 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content47% 
IMG OID645036182 
ProductPeptidase M23 
Protein accessionYP_003193280 
Protein GI258517058 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.73675e-08 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.27958e-09 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGGTTGG TCAAAAAGGT AATTGCTTGG GGCCTGACTG TTGTGCTGTT GGGTGCGGCT 
ATTGGTACTG CCGGTGCTAA AACGCTGGAA CAACAGCTGA AGGATACCCG TGCGCAGATA
CAGCAGACAC AGAAGGGTGT CAATGAAAAT AAATCAGATA TCAAGAACTT TACCTCTCAA
TTAGCATCAA TCAACCAGTC TATCAACCAG GTTAGCGTTG AAATCAGGGA GTTGGAAGGT
AAATTAGGTG TTACCCAGAA CAAGCTGCAG GGTATCATTA ACGAGCTTAG AAAGGTAGAG
GCCAGGCTTG CGGAGACACA AGAGGTTTTT AATACAAGGA TTAAGAATAT ATATGTAAAC
GGGAATGTCA GCTATCTGGC TGTTCTTCTT GACTCCCAGG ATTTCGGTGA TTTTGTAAAT
CGCTATGAAA TGTTAAAACG AGTGGCGGCC CGTGACGCGG CTATAGTTGA GCAGGTCGAA
AATGATCGCA AGCTTATTGT CAGCCAGAAG GCGGAAGTGG AAAAAGAGCG CAATCGCATT
TATGAGCTAA AGAGGCGGCA GGAAGACGCG AAGCATGCTT TGACTGCCCG TCAGGCTGAA
CGGGAGACGC TTCTGATCGA GGCCAATAAG GATTTGGCTA AGAAAGAAGC GGAAATGGAT
GCTCTGGAGG CTAAAGAACA GGAAATCATA AGGCAGATTG CCATTAGGAG CGCGCAGCAG
AATAAAGATA TTAAGAAATA TACCGGTCAG TTTATTTGGC CTGTGTCCGG CTATACCAGT
ATATCTTCTC CTTTCGGTTA TCGCAAGCAC CCGGTTTTAG GTACTGCGAA ATTTCATTCC
GGTATAGATA TTCCCGCGCC AAACGGAACC AGTGTAATGG CGGCTCAGAG TGGAACTGTT
ATTCAAGTGG GCAGCATGAC CGGTTACGGC AATATAGTGA TGATTAACCA CGGGGGCGGA
TTAATCACTC TGTACGCTCA CTTATCGCGA CAGCTTGTAA GCAGCGGCCA GCAGGTTACC
CGGGGACAAG TTATAGCCAA GGTAGGCAGC ACCGGAATGT CTACAGGCCC GCACCTGCAT
TTTGAAGTAA GACTCAACGG TTCCGCAGTC AACCCGATGG GGTATTTGTA A
 
Protein sequence
MRLVKKVIAW GLTVVLLGAA IGTAGAKTLE QQLKDTRAQI QQTQKGVNEN KSDIKNFTSQ 
LASINQSINQ VSVEIRELEG KLGVTQNKLQ GIINELRKVE ARLAETQEVF NTRIKNIYVN
GNVSYLAVLL DSQDFGDFVN RYEMLKRVAA RDAAIVEQVE NDRKLIVSQK AEVEKERNRI
YELKRRQEDA KHALTARQAE RETLLIEANK DLAKKEAEMD ALEAKEQEII RQIAIRSAQQ
NKDIKKYTGQ FIWPVSGYTS ISSPFGYRKH PVLGTAKFHS GIDIPAPNGT SVMAAQSGTV
IQVGSMTGYG NIVMINHGGG LITLYAHLSR QLVSSGQQVT RGQVIAKVGS TGMSTGPHLH
FEVRLNGSAV NPMGYL