Gene Dtox_3203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3203 
Symbol 
ID8430197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3404481 
End bp3405506 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content47% 
IMG OID645035449 
Productmembrane-associated zinc metalloprotease 
Protein accessionYP_003192568 
Protein GI258516346 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000432825 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000345421 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCACAT TTTTTGCGTC CGTATTTGTC TTTGCTATGT TGATATTTTT TCATGAGCTG 
GGGCACTTTG CTGTAGCAAA ATTAGCAGGT ATTAAGGTTC ATGAATTCAG TGTGGGCTTC
GGCCCCAAGC TTTTTGGCAA ACTACACGGT GAAACTACTT ATAACCTGAG ACTTTTGCCG
CTGGGAGGTT TTGTCCGTAT GGCCGGCATG GATCCTGCGG ATGAAGCGGA TTATGCTGAT
GAGAGGGCTT TTAACAAGAA ATCCATCCTC CAGCGTATGG CGGTAATCTT TGCCGGGCCG
CTGATGAATT TTTTTCTGGC AGCTCTGCTT TTGGCCTTTA TATTTATGGC TCAGGGTTAT
CCCGCCGGTA CCACCACCGG TGTGGATAAG GTGCTGCCCG GTTATCCGGC GGAAAAGATT
GGCCTGGTAT CGGGCGATAA AATTGTGGCT ATTGATGGCC GCAGCATGGA TAGCTGGGAG
CAGGTGGCTG AATATATTAA CCAGCGCCCG GATAAGCAAA TTGTTATTAC GGTAGAAAGA
GATGCGGCCA AGCGCAGCTT TGATATAGTT CCGGTTAAAG ATGAAAGCGG TCATGGCAAG
ATCGGCATTT ATCCCGCACA GGAAATGAAG AAGATGGGTT TTTTTACCGC TCTCTATTCC
GGTGCTGAGT ATACAGTTAA GGCAACCTGG TTTATAATTA GTTTTATTGG CAAGATGTTT
GTGCATGAAG CTCCTGTTGA TTTAGGCGGG CCGGTCAGGG TTGTTTGGGA AATCGGTCAG
GCGGCTAATA CTGGCTTTTA CCACCTGCTG CAGCTTGCTG CTTTCCTGAG TATTAACCTG
GGTCTTTTTA ACTTGTTTCC TATTCCTGCC TTAGACGGCA GCAGGGTGGT TTTCTTGTTC
TGGGAAGCGC TGCGCGGCAA ACCGGTGGAT CCCTCCAGGG AGAGCTTTAT TCACCTGGTT
GGTTTTGTCC TGCTGCTGGT TTTGATGGTG GTCATTACTT ATAATGATTT GTTGAATTTA
TTGTAA
 
Protein sequence
MSTFFASVFV FAMLIFFHEL GHFAVAKLAG IKVHEFSVGF GPKLFGKLHG ETTYNLRLLP 
LGGFVRMAGM DPADEADYAD ERAFNKKSIL QRMAVIFAGP LMNFFLAALL LAFIFMAQGY
PAGTTTGVDK VLPGYPAEKI GLVSGDKIVA IDGRSMDSWE QVAEYINQRP DKQIVITVER
DAAKRSFDIV PVKDESGHGK IGIYPAQEMK KMGFFTALYS GAEYTVKATW FIISFIGKMF
VHEAPVDLGG PVRVVWEIGQ AANTGFYHLL QLAAFLSINL GLFNLFPIPA LDGSRVVFLF
WEALRGKPVD PSRESFIHLV GFVLLLVLMV VITYNDLLNL L