Gene Dtox_2981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2981 
Symbol 
ID8429971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3169054 
End bp3170709 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content41% 
IMG OID645035235 
ProductCRISPR-associated protein, TM1812 family 
Protein accessionYP_003192358 
Protein GI258516136 
COG category 
COG ID 
TIGRFAM ID[TIGR02221] CRISPR-associated protein, TM1812 family
[TIGR02549] CRISPR-associated DxTHG motif protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.124903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTAAGT TAATTACTTT TATCGGAACA GGTAAATATG AGGAAACTAC TTATATATTT 
GATACAAATA AAATATATAC CCGCTATTTT TCCGTATTTT CCACAGCAAT CATAAAACCT
GTTGAAGTTC TGGTAGTTAG AACCGAGGAG GCAGCTAATG AGCACTGGGG TCCTCTTTGT
CAGGAATTTT TAGAAAATGG TTTTTCTGAA CCTGTTCCGA TAGATGTACC GGAAGGAAAA
AGTACAAAGG AACTCTGGGA GATATTCAAT ACATTAATAG ATAAGGTCGA TGAACAGGAC
AGTGTGGTAT TTGATATAAC TCACGGTTTT CGATCTCTTT CCCTGGTATG CTTCCTTAGT
ATCGCCTACC TAAAATTTGT TAAGAATGTG CAAATTAAGG GTCTCTATTA TGGAGCATAT
GAAGCTAGGG ATAAAGAAAA AAATATTACT CCGGTTTTTG ATTTAACTGA TTTTTGCTCA
CTGCTTGATT GGATTGTTGG TGTAAATTCT TTTATCCAGC ACGGCAGTGC CCGGGAAATC
AGCAGTCTCC TGGTAATGGC TCAAAAAAAA GCTAAGGAAA AGCAGGGCCC GGCGCGTCGG
GAGCTAAATA GTTTTGGTAT GTTAATTGAA GATATTTCCC GGGCTTTGTT TACCACCAGA
CCTTTTGAAC TGGTCAACAA GGCTCGCAAT TTAAAATATT ATAAAGAGGG AACTGAGCAA
AGAAATATTC TGGAACGTGA TGTCTGGGAG TGGGCTGCTC CTTTTGGTGT CTTAATGGAT
AAAATCATAG AAGAGTATTA TGCCTTTGCA GGGGTGGAAG ATCATTATAA CCCTCTCAAT
CTTGAAAGGC ACCTGGCAGC GGTCCGCTGG TATGTTGAGC ATAATTATGC TCCTCAAGCA
CTATCAATGA TGAGAGAACT GGTAATATCA TTTACTATGT GCCAGAAAGG AGAGTACGGG
AAGGCTTTTG ATCGGGGTAT GCGTGAACTG ACAGCAGGAC AGCTGTACAG GGCTAAGGAA
AATAGTCTCT CAGGAAAGCT GTGGTCTAAG CTGTCTGACC CACGCAATGA TGTAATCCAT
GCTGGCTGGA GAACCAATCC CAGGTCATCC CATAAGGTAA TGCAGGAAAC CAGTGAATGT
CTTGCATTAT TGGAGGAATT ATTCAGTGAA GAAGGGCTGA TAGTTTGTCA CGAGATACCT
GATTCAGATG GAGAAGCTTC AAACTTAAAG GTTCTTATAA CACCTTTGGG AATGTCTCCG
GGACTTTTAT ATACCGCTAT AAATCACATT GAGCCGGATC GTATTTTGGT ACTCACGTCT
AAAGAGGGTA GGGCATTGCT GACAGAGATA ACAACTCAAG CAGGTTATAG CGGAGGAATA
GAAGTGGTAG AAGTGAAAGA TCCTTTTGCC GGATTCAATG AATTATCCGA AGTAATAGAA
AAAGTTATGG GTTATTTGGA AATGCTGCCG CCACACAAAA TTTACATAAA CTTGGCCGGG
GGTACTACTT TGCTGCAATA TATAACAACC CGTATAGCCG GGCTTAAGGT AGATAATTGT
GAGGAGTTAG TTAATGTAGT CATGATAGAT AAAAGACCGG TTCGTGAACA GCAGGAAAAC
CCCTATGTTA TGGGTGATAT GTTAGTTGTT GAATAA
 
Protein sequence
MCKLITFIGT GKYEETTYIF DTNKIYTRYF SVFSTAIIKP VEVLVVRTEE AANEHWGPLC 
QEFLENGFSE PVPIDVPEGK STKELWEIFN TLIDKVDEQD SVVFDITHGF RSLSLVCFLS
IAYLKFVKNV QIKGLYYGAY EARDKEKNIT PVFDLTDFCS LLDWIVGVNS FIQHGSAREI
SSLLVMAQKK AKEKQGPARR ELNSFGMLIE DISRALFTTR PFELVNKARN LKYYKEGTEQ
RNILERDVWE WAAPFGVLMD KIIEEYYAFA GVEDHYNPLN LERHLAAVRW YVEHNYAPQA
LSMMRELVIS FTMCQKGEYG KAFDRGMREL TAGQLYRAKE NSLSGKLWSK LSDPRNDVIH
AGWRTNPRSS HKVMQETSEC LALLEELFSE EGLIVCHEIP DSDGEASNLK VLITPLGMSP
GLLYTAINHI EPDRILVLTS KEGRALLTEI TTQAGYSGGI EVVEVKDPFA GFNELSEVIE
KVMGYLEMLP PHKIYINLAG GTTLLQYITT RIAGLKVDNC EELVNVVMID KRPVREQQEN
PYVMGDMLVV E