Gene Dtox_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3000 
Symbol 
ID8429990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3200114 
End bp3201121 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content39% 
IMG OID645035253 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003192376 
Protein GI258516154 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0143403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAGA GCAGTGATAC CCGGTATATT TTTTCAGCCG GAGATTTGTA TCAAAAGGAT 
TTTTCCATAG CATTCCGTAA AGAAGATGGT AATTTTTATA TTCCCATCAA GGATACGCGG
GAACTATACT GCTTTAATGA TATCACCCTG AGCACAAAGT TGCTTCAACT TTTGGCCAAA
GCCGGTATTG TGGTGCATTT CTTCGGTTAT TATGAAAATT ATATAGGGAC ATTTTATCCC
AAAGAGCAAT TATTAAGCGG GAGGCTGACG GTTGCTCAGG CTTTGGCTTA TGAGCAAAAC
AGGTTGCAGA TCGCGGGACA AATTATTAAG GGAATTGCTA AAAATACTTA TTTTGTACTT
TACCATTATT ACCGACATGG GAAAAGTGAA CTAAAAGACT TTTTGGACTG GTTGCGTAAG
GATGTTTCGC GCTTGGTCGA TTCTGTCGGC AATATTAAAC AGCTACTGCG TATTGAAGGG
GAAATATGGG CCCGGTTTTA TCAGTCATTC CGGGTATTTC TGCCCGAGTC TTTTGCTATG
AACAAAAGAG TAAAACGTCC GCCCGATAAT CCCATTAACG CACTGATATC TTTTGGTAAT
ACCTTGTTGT ACACAAAAAC CATTACGCAA ATTTTCCATA CCCACTTGAA TCAAACTATT
TCCTTCTTGC ACGAACCGGC GGAACGACGA TTTTCCTTAA GCTTGGATTT GTCGGAAGTA
TTTAAACCGG TATTGGTTTG CAAAACTATT TTTGATTGTG TAAATAATCG CAAAATTATG
GTAGAAAAGC ATTTTGACAA AAAGCTGAAT TATGCCCTGC TAAACGAGCT TGGGCGCAAA
GTGTTTATCG AGGCTTTTGA AGAACGGTTG AACCAGACAT TTGAACATCA AACCTTAAGA
AGAAGGTGTA GTTACAAACA AGCTATCAGG TTAGACGGAT ATAAGCTCAT TAAGCATATT
TTGGAAGGAA AAAACTTCGT TCCTTTTTGC ATGGAGGATA AAAAGTGA
 
Protein sequence
MGKSSDTRYI FSAGDLYQKD FSIAFRKEDG NFYIPIKDTR ELYCFNDITL STKLLQLLAK 
AGIVVHFFGY YENYIGTFYP KEQLLSGRLT VAQALAYEQN RLQIAGQIIK GIAKNTYFVL
YHYYRHGKSE LKDFLDWLRK DVSRLVDSVG NIKQLLRIEG EIWARFYQSF RVFLPESFAM
NKRVKRPPDN PINALISFGN TLLYTKTITQ IFHTHLNQTI SFLHEPAERR FSLSLDLSEV
FKPVLVCKTI FDCVNNRKIM VEKHFDKKLN YALLNELGRK VFIEAFEERL NQTFEHQTLR
RRCSYKQAIR LDGYKLIKHI LEGKNFVPFC MEDKK