Gene Dtox_2974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2974 
Symbol 
ID8429964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3159784 
End bp3161421 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content41% 
IMG OID645035228 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003192351 
Protein GI258516129 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR00372] CRISPR-associated protein Cas4 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAAA TGTGTTGCGA TTCACAGTAT AAATATCTAC CTGTTTCAGC TGTAGCGGAA 
ATTCTTTATT GTCCGAGAAA CTTTTATTAT CGTATTGTGG AGGGCGCTAA AGAGTATAAC
GCTCACCTTT TAGAAGGGCA ACTGCAAGAA GAAAAACGGA ACAATCGTTT AGCAATAAGC
AGAGAAGAAT ACCAACAGAA TAAGTCTGTT ATGGTCTCTT CTGAAAGACT GCGGCTAATC
GGAGTATTGG ATGCGGTTGA AGAAGGAGAG GATATTTATC CTGTAGAGTA TAAAAAAGGT
GAACTCAAGG AAAGTCTGAA TGATGACGTT CAGGTTTGTG CTCAGGCCAT GCTTTTGGAG
GAAAAATTAG GTCGGGAAAT TCTCCGTGGT TACATATATT ACAGCCAATC ACATGCCCGC
CGTGTAGTGG TCTTTGATCG ATCACTGCGT GAATTGGTGG AGAATACGAT TAAAAGGGCT
TATGAAATAA TCTATTCGGG GCAAATACCT CAGCCGCTGG CAGATTTCCG CTGTTATGGC
TGTTCTCTTG CCGGTCGCTG TTTGCCTTAT GAGGTGAATT ATCTAACCGG AGCAGATGAT
AAAACACCTG CCCGCCCACT ACCGTCATTA AATCTTGGCA GGGTATTATA TGTTGATGAA
CCGGGTGCAT ATGTACGTAA GAAAGGTGAA CGAGTTCAGG TGACCAGAGA TAAAGAAGTG
CTGGTTGATA TACCCTTGTG TAATCTGGAG CAATTGGTGC TGGCCGGTAC AGTTAATATA
TCTGCGCAGG TAATCAAGCT TTTACTTGAT AGAGGAACAG AAGTGCATTT CGTTTCACGC
GCAGGTAAGT ATTACGGTTC ACTCCAGCCG GCACTGACGA AAAATTCTGC TCTGCGCATA
GCACAACATA AAGCATATCA GGATATGGAA TTGCGCTTAA AGTATGCCGT TCTTTTTGTG
CAGGGAAAGC TGGCCAATAT GCGTACAATA CTGCTTCGTT ATAATAGGGA TCTTAAAGAA
AAACAATTAG AAGAAGCTAT TTGTAGACTT AAGTCTTTGA GCAAAAATTT ATATAAAGCA
GATTCATTAA ATAGCTTAAT GGGTATAGAA GGTGCTGCTA CCCGTGAATA TTTCAGAGTT
TTTAATTACA TGATCAAGCA GCATGTGCCC TTCAATTTTC AACAGCGCAG CAGGCGACCT
CCGGGAGACC CTGTGAATGC TTTGCTTAGT TTTGCTTATA CTCTTTTGAC TAAAGACATG
ATTGCATCTG TGTCTATCGT GGGCTATGAT CCGTATATTG GTTTTTTGCA CCGCTCGGAT
TATGGCAGAC CAGCATTGGC ACTGGATTTT ATAGAAGAAT TTCGGCCAAT TGTTGCAGAT
TCAGTTGTTT TGACCGTTTT AAACAAGGGT ATGATAAATA CCGATGATTT TGAATACAAA
ATGGGTGGTT GTTTTTTAAA CGACAGTGGT CGCAAGAAAT TTTACCGTGC CTATGAAGAA
CGGAGGCATG AGATGATCTC ACATCCGTTG TTTGGTTACA GAATATCTTA TATGCGTGTT
TTTGAACTGC AAGCCCGCTT TTTTGCTAAG GTATTAAGGG GGGAATTGAA TGAATACAAG
CCTTTTATGG TGAGGTGA
 
Protein sequence
MDEMCCDSQY KYLPVSAVAE ILYCPRNFYY RIVEGAKEYN AHLLEGQLQE EKRNNRLAIS 
REEYQQNKSV MVSSERLRLI GVLDAVEEGE DIYPVEYKKG ELKESLNDDV QVCAQAMLLE
EKLGREILRG YIYYSQSHAR RVVVFDRSLR ELVENTIKRA YEIIYSGQIP QPLADFRCYG
CSLAGRCLPY EVNYLTGADD KTPARPLPSL NLGRVLYVDE PGAYVRKKGE RVQVTRDKEV
LVDIPLCNLE QLVLAGTVNI SAQVIKLLLD RGTEVHFVSR AGKYYGSLQP ALTKNSALRI
AQHKAYQDME LRLKYAVLFV QGKLANMRTI LLRYNRDLKE KQLEEAICRL KSLSKNLYKA
DSLNSLMGIE GAATREYFRV FNYMIKQHVP FNFQQRSRRP PGDPVNALLS FAYTLLTKDM
IASVSIVGYD PYIGFLHRSD YGRPALALDF IEEFRPIVAD SVVLTVLNKG MINTDDFEYK
MGGCFLNDSG RKKFYRAYEE RRHEMISHPL FGYRISYMRV FELQARFFAK VLRGELNEYK
PFMVR