Gene Dtox_4286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4286 
Symbol 
ID8431300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4452915 
End bp4453907 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content35% 
IMG OID645036478 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003193576 
Protein GI258517354 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAAGA CAATATATAT TTTTTCCAGC GGGGAATTGA AGCGAAAAGG GAACACGTTA 
TATTTTGAAA CGGAAGAAAA TAGAAAATAT ATTCCGGTAG AAGATACCGG GGAAATAATG
ATTTTCGGGG AAGTTGATAT TAATAAAAAA CTCCTGGAGT TTCTATCTCA AAAAGAAATA
TTAATGCACT ATTTTAATTA TAACGGTTAT TATATGGGTA CCTTCTATCC CCGGGAGCAC
CTTAATTCGG GGTATATGAT TTTGAAACAG TCTGAATATT ATATGGATGA GAAGAAAAGG
CTCGAACTGG CACGTCAATT CGTGATAGGT GCGGGTAAAA ACATCAGACA GGTATTAAAA
TACTATAACA ACCGGGATTG TGATTTGGAG GATGGACTAA AGAGAATTGA AGAAATTTTC
ACATCTATGC CGGAAATAAT TACAATTAAC CAGTTAATGG CTCTCGAAGG TAATATGCGG
GAGCAGTATT ACAAGTGTTT TGATACTATT ACCCAAAATC AGGACTTTGT ATTTGAGGAG
CGCAGTCGCC GACCACCCCG GAATTATATG AATACTCTGG TTAGCTTCGG TAACACTTTG
ATGTATACTA CGGTATTAAG CGAAATTTAT AAAACCCATC TGGATCCTAG AATTGGTTTT
TTGCATGCAA CAAATTTTCG CCGTTTTAGT TTAAATCTTG ATGTAGCGGA GATATTTAAG
CCCATTATTA TAGATAGGCT TATTTTTAGC TTAATCCGTA AGAATATGGT AACTAAAAAG
GACTTTGATA AAAATACAGA AGGAATAATG TTAAAGGACA AGGCAAAACA AGTCTTTGTT
CAGCAAATGG ACGAAAAATT GAAAGTTACA ATAAAGCATC GTGAGTTAGG CAGGCATGTT
TCTTATCGTA GGCTAATCAG GCTGGAATTG TATAAGTTAG CCAAGCATTT AATCGACGAA
AAAAAATACG AACCATTTAT TGCTAATTGG TGA
 
Protein sequence
MEKTIYIFSS GELKRKGNTL YFETEENRKY IPVEDTGEIM IFGEVDINKK LLEFLSQKEI 
LMHYFNYNGY YMGTFYPREH LNSGYMILKQ SEYYMDEKKR LELARQFVIG AGKNIRQVLK
YYNNRDCDLE DGLKRIEEIF TSMPEIITIN QLMALEGNMR EQYYKCFDTI TQNQDFVFEE
RSRRPPRNYM NTLVSFGNTL MYTTVLSEIY KTHLDPRIGF LHATNFRRFS LNLDVAEIFK
PIIIDRLIFS LIRKNMVTKK DFDKNTEGIM LKDKAKQVFV QQMDEKLKVT IKHRELGRHV
SYRRLIRLEL YKLAKHLIDE KKYEPFIANW