Gene Dtox_3560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3560 
Symbol 
ID8430562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3759766 
End bp3761106 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content41% 
IMG OID645035784 
ProductPeptidoglycan-binding LysM 
Protein accessionYP_003192895 
Protein GI258516673 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1388] FOG: LysM repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATAA AAAATAAACT CAAAAAAAGT GCTGCAGGGA CTTTAGCCGG TATTTCACTA 
TTCATTTGCA GTACCGCCGG ACTGGCTGCA ACAAATTATC CGACTTATAA TGCTATAAGC
GGAGATACCC TATGGATAAT GTCCAATAAA TTATATACGT CCAGTGACAA AATTGCTTCT
GTGAACAGCA TTAATTCCGA TTCTATTATG GCAGGCCAAA GACTGGTAAT ACCCCAAAGT
AATATTTATA AAGTAGTATC GGGTGATACT CCTTACTTAA TAGCTAAGAG ATTTGGCATA
TCACTGCAGG TTTTTTTAAC TGCGAACAAC ATGACATCCT CAGACGTGCT CTATCCCGGA
CAGCAAGTAA ATCTACCGGG AATTCTTGCA TACAAGGTTG CCAATAAGGA TACCCTAAGC
CTCCTGGCCA CACAATTCGG AACAACCATT AATCAACTTA CTAAAATCAA TAACTTAACT
GATACCAATA TTATTACAGA TCAAAAAATA TATATACCGG TGGCTGAAAC CAAACAATAC
ACTGTTCAAT CAGGAGACAG CCTTTATCTA ATAGCTCAAA AATACAACTT ATCAGTCAGT
GATCTAACTA ATGTTAATTG GCTTAGTTCT ACAAACCTGA AAGCAGGGCA AATACTCATA
ATTCCAGGTA AAACGTCGAC AACAACGTCA GCAGCAACAA GCAGCAGCGG CCAATCGCAG
GCAACACTGT GGAATATCCC TTCCGGTGCC TTGCTTTATC ATGTTCAGGA AGGAGACAAC
CAGTGGAGTA TCGCCCAAAA ATATAATACC ACTACGGAAG CCATTAACAA AACAAATAAT
ATTAAAATTG ATCTCATACT TCCTGAACAG GCACTTTTTG TACCTAAAAA CTCCACACAA
CCTATATATG GTATTAAGTG CCCTTCTGTA AAAGCTAAAA CAGGTTACGG GGAACTTTTG
GATTGGGAAT ATGTCAACTG GTTTTTCAAC CCGGGAAGCA CAGCTGTAAT TGAGGATTTA
CAAACCGGTA TAAAATTTAA GGCCCACCGG ATCGGCGGCT CCAACCACGC TGATTGCGAA
CCCTTAAGCG CTGATGATAC CGCGATTATG AAAGGAATAT TTGGCGGTCA GTGGAACTGG
TCAACCAGAC CTGTGCTGGT TCGTTTTGAA GGAAGGGTCT TGGCCGCATC TATGGCCGGC
ATGCCGCATT CCTTTGACAC CCTTAGCAAT AACGCTTTTT ACGGTATGTT TGATCTTCAC
TTTTTAAACA GCCGCACCCA CAACACAAAT ACAATCGACC CCGATCACCA GGCATCAGTG
CGAAAAGCTG CCGGTTATTA A
 
Protein sequence
MSIKNKLKKS AAGTLAGISL FICSTAGLAA TNYPTYNAIS GDTLWIMSNK LYTSSDKIAS 
VNSINSDSIM AGQRLVIPQS NIYKVVSGDT PYLIAKRFGI SLQVFLTANN MTSSDVLYPG
QQVNLPGILA YKVANKDTLS LLATQFGTTI NQLTKINNLT DTNIITDQKI YIPVAETKQY
TVQSGDSLYL IAQKYNLSVS DLTNVNWLSS TNLKAGQILI IPGKTSTTTS AATSSSGQSQ
ATLWNIPSGA LLYHVQEGDN QWSIAQKYNT TTEAINKTNN IKIDLILPEQ ALFVPKNSTQ
PIYGIKCPSV KAKTGYGELL DWEYVNWFFN PGSTAVIEDL QTGIKFKAHR IGGSNHADCE
PLSADDTAIM KGIFGGQWNW STRPVLVRFE GRVLAASMAG MPHSFDTLSN NAFYGMFDLH
FLNSRTHNTN TIDPDHQASV RKAAGY