Gene Dtox_2979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2979 
Symbol 
ID8429969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3165891 
End bp3167762 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content43% 
IMG OID645035233 
ProductCRISPR-associated protein, Crm2 family 
Protein accessionYP_003192356 
Protein GI258516134 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02577] CRISPR-associated protein, Crm2 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.113889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGA AAATTCTTCA TTTTACACTC GGCCCGGTAC AGAGTTTTGT CGCCCAGGCC 
CGGCGGACCC GCGATCTCTG GGCCGGCTCA TTTATACTCT CATATCTGGC CGGACAGGCT
ATGTATTTAG TACACAGAGC CGGGGGAAAG ATAATTTTTC CTGCTGTATA CGATAATAAC
GGGATTATAA ATGATCCACT GCTGAATGTA ATAGCCGACA AAAATGAGGG AAAGCCGATA
AGTAAAAAAC CTGTTGCCGG TACTTTACCC AATCGCTTTA AGGCTGAGAT TCCCGGTGAA
TTTAAGCCTG TTGAATGCAG GCAGGCAGTA ATAAGTGCCT GGAGAGAAAT AGCCGGTGCA
GTATGGAATA AGTATATAGA ACCAGTGGCT ATCTATGGGA AAGGTACTAA AGAAATATGG
GAGCGCCAGG TAAATAATTT TTGGGAAGTA TCCTGGGTGG TTAGTGATAA TGCAGAAAGT
GATGATTTAC TTGACCGGAG GAAAAATTGG CGCAGTTATG TGCAGGCTGA GGAACCTGGT
GATAAATGTA CTATCATGGG TAATCTTCAG GAACTTTCAG GTCATACCCG TTCCTTAAAT
AGTAAAAAAC AGGATGAATT TTGGGGGGAA CTGCGTAAAA AAGTTAAAGC GTTGGAGTTA
CGAAAAGATG AGCGGCTCTG TGCCGTGACT ATAATCAAAC GACTATTCCC CTTTACTGCA
AAAACAGTAT TGGGGTGGGA TGTCCCAACA AGTTACCCCT CAAATTCGAA TATCTCGGCA
GTTCACTGGG TAGAAAAGGT AATGCAAAAT GATACTGAAA AGGTTGTAAA TTTTTTAGAA
TATTCGCGCA ATTGCGGCCT GGCTCCGGAA GGCGGTAAAC AGTCAGCCCA AATTTACTGC
TTGCGGCAGA TGCAAAAAGA AAAACCGGAA ATAGCGGCGT TTGCAAAATT AAACGGCAAT
TTGTTTTTTG AAAGCACTTT ACAAAATGAT AATCTCTGGG ATGAAAGTAT TAAGGAGCAG
AGAAAACATT TAACGAAGTT ACTGAAGGAA TTTAAGCAGT CGCCCGAACC TTTTTATGCC
GTTCTAATAA TGGACGGCGA CTCTTTAGGG AATTTGCTGC GAATTAATGA CAGTGCGGAA
GTGAGCCAGG CGTTGTCACG TTTCAATGAC ATGGTTCCGA AAATACTGGA ACAAAATAAC
GGTGTCCTGG TTTATGCCGG CGGTGATGAT GTTCTGGCTC TTTTACCACT GGAAGATGCT
ATCAAGACAG CAGTAGAAAT ATCCAGGGCT TATTCCAATA GTTTTAGTGG TAGATTGGCA
AATGGGGAGG CTACAATTTC TGCGGGTATT GTTTATGCCC ACAACCACGC ACCCTTAAAG
TCCATATTAC AAGAAGCACA CCGATTGCTG GATGAAATAG CAAAAGAAGA AACCGGGCGC
TCCAGCCTGG CTGTTAGTGT CTGGAAAACT GCCGGCCGGG TATTACAGTG GTCTGCTCCC
TGGGAAGTAA TAACAAGTGG GGATAGTTGT TTGGTGGATG AATTGGTAGA GGATTTTCGG
GGAAAGTCCA GTTATGAGGA TCAATACAAC AGCTCATTCT TTTATAACCT TCGCAAGCGT
TTTGATTGGT TGTCCGGAGA CAATAATATT TTAACTTTTG AGGAAGCAGT TGACCTGTTG
GCAGCGGAAT ACCTTAAAAA CCGCGGGCGT GAAAATATTG CTGAGGCTAC AGCAAAAAGC
CGGGTCGAAA AATTATTAAT GTTTTGTCGA AGGTTTTTAC GCGGGCCTGA TGGAGTACAG
TCTTTAAATA AACTGAATGT AGACGGGGCC CTTTTAGTTC GTTTTTTGGC CGAAAAGGGG
GTTGTGGAAT GA
 
Protein sequence
MSKKILHFTL GPVQSFVAQA RRTRDLWAGS FILSYLAGQA MYLVHRAGGK IIFPAVYDNN 
GIINDPLLNV IADKNEGKPI SKKPVAGTLP NRFKAEIPGE FKPVECRQAV ISAWREIAGA
VWNKYIEPVA IYGKGTKEIW ERQVNNFWEV SWVVSDNAES DDLLDRRKNW RSYVQAEEPG
DKCTIMGNLQ ELSGHTRSLN SKKQDEFWGE LRKKVKALEL RKDERLCAVT IIKRLFPFTA
KTVLGWDVPT SYPSNSNISA VHWVEKVMQN DTEKVVNFLE YSRNCGLAPE GGKQSAQIYC
LRQMQKEKPE IAAFAKLNGN LFFESTLQND NLWDESIKEQ RKHLTKLLKE FKQSPEPFYA
VLIMDGDSLG NLLRINDSAE VSQALSRFND MVPKILEQNN GVLVYAGGDD VLALLPLEDA
IKTAVEISRA YSNSFSGRLA NGEATISAGI VYAHNHAPLK SILQEAHRLL DEIAKEETGR
SSLAVSVWKT AGRVLQWSAP WEVITSGDSC LVDELVEDFR GKSSYEDQYN SSFFYNLRKR
FDWLSGDNNI LTFEEAVDLL AAEYLKNRGR ENIAEATAKS RVEKLLMFCR RFLRGPDGVQ
SLNKLNVDGA LLVRFLAEKG VVE