Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_2974 |
Symbol | |
ID | 8429964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 3159784 |
End bp | 3161421 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 645035228 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003192351 |
Protein GI | 258516129 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR00372] CRISPR-associated protein Cas4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGAAA TGTGTTGCGA TTCACAGTAT AAATATCTAC CTGTTTCAGC TGTAGCGGAA ATTCTTTATT GTCCGAGAAA CTTTTATTAT CGTATTGTGG AGGGCGCTAA AGAGTATAAC GCTCACCTTT TAGAAGGGCA ACTGCAAGAA GAAAAACGGA ACAATCGTTT AGCAATAAGC AGAGAAGAAT ACCAACAGAA TAAGTCTGTT ATGGTCTCTT CTGAAAGACT GCGGCTAATC GGAGTATTGG ATGCGGTTGA AGAAGGAGAG GATATTTATC CTGTAGAGTA TAAAAAAGGT GAACTCAAGG AAAGTCTGAA TGATGACGTT CAGGTTTGTG CTCAGGCCAT GCTTTTGGAG GAAAAATTAG GTCGGGAAAT TCTCCGTGGT TACATATATT ACAGCCAATC ACATGCCCGC CGTGTAGTGG TCTTTGATCG ATCACTGCGT GAATTGGTGG AGAATACGAT TAAAAGGGCT TATGAAATAA TCTATTCGGG GCAAATACCT CAGCCGCTGG CAGATTTCCG CTGTTATGGC TGTTCTCTTG CCGGTCGCTG TTTGCCTTAT GAGGTGAATT ATCTAACCGG AGCAGATGAT AAAACACCTG CCCGCCCACT ACCGTCATTA AATCTTGGCA GGGTATTATA TGTTGATGAA CCGGGTGCAT ATGTACGTAA GAAAGGTGAA CGAGTTCAGG TGACCAGAGA TAAAGAAGTG CTGGTTGATA TACCCTTGTG TAATCTGGAG CAATTGGTGC TGGCCGGTAC AGTTAATATA TCTGCGCAGG TAATCAAGCT TTTACTTGAT AGAGGAACAG AAGTGCATTT CGTTTCACGC GCAGGTAAGT ATTACGGTTC ACTCCAGCCG GCACTGACGA AAAATTCTGC TCTGCGCATA GCACAACATA AAGCATATCA GGATATGGAA TTGCGCTTAA AGTATGCCGT TCTTTTTGTG CAGGGAAAGC TGGCCAATAT GCGTACAATA CTGCTTCGTT ATAATAGGGA TCTTAAAGAA AAACAATTAG AAGAAGCTAT TTGTAGACTT AAGTCTTTGA GCAAAAATTT ATATAAAGCA GATTCATTAA ATAGCTTAAT GGGTATAGAA GGTGCTGCTA CCCGTGAATA TTTCAGAGTT TTTAATTACA TGATCAAGCA GCATGTGCCC TTCAATTTTC AACAGCGCAG CAGGCGACCT CCGGGAGACC CTGTGAATGC TTTGCTTAGT TTTGCTTATA CTCTTTTGAC TAAAGACATG ATTGCATCTG TGTCTATCGT GGGCTATGAT CCGTATATTG GTTTTTTGCA CCGCTCGGAT TATGGCAGAC CAGCATTGGC ACTGGATTTT ATAGAAGAAT TTCGGCCAAT TGTTGCAGAT TCAGTTGTTT TGACCGTTTT AAACAAGGGT ATGATAAATA CCGATGATTT TGAATACAAA ATGGGTGGTT GTTTTTTAAA CGACAGTGGT CGCAAGAAAT TTTACCGTGC CTATGAAGAA CGGAGGCATG AGATGATCTC ACATCCGTTG TTTGGTTACA GAATATCTTA TATGCGTGTT TTTGAACTGC AAGCCCGCTT TTTTGCTAAG GTATTAAGGG GGGAATTGAA TGAATACAAG CCTTTTATGG TGAGGTGA
|
Protein sequence | MDEMCCDSQY KYLPVSAVAE ILYCPRNFYY RIVEGAKEYN AHLLEGQLQE EKRNNRLAIS REEYQQNKSV MVSSERLRLI GVLDAVEEGE DIYPVEYKKG ELKESLNDDV QVCAQAMLLE EKLGREILRG YIYYSQSHAR RVVVFDRSLR ELVENTIKRA YEIIYSGQIP QPLADFRCYG CSLAGRCLPY EVNYLTGADD KTPARPLPSL NLGRVLYVDE PGAYVRKKGE RVQVTRDKEV LVDIPLCNLE QLVLAGTVNI SAQVIKLLLD RGTEVHFVSR AGKYYGSLQP ALTKNSALRI AQHKAYQDME LRLKYAVLFV QGKLANMRTI LLRYNRDLKE KQLEEAICRL KSLSKNLYKA DSLNSLMGIE GAATREYFRV FNYMIKQHVP FNFQQRSRRP PGDPVNALLS FAYTLLTKDM IASVSIVGYD PYIGFLHRSD YGRPALALDF IEEFRPIVAD SVVLTVLNKG MINTDDFEYK MGGCFLNDSG RKKFYRAYEE RRHEMISHPL FGYRISYMRV FELQARFFAK VLRGELNEYK PFMVR
|
| |