Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3000 |
Symbol | |
ID | 8429990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 3200114 |
End bp | 3201121 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 645035253 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003192376 |
Protein GI | 258516154 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0143403 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAGA GCAGTGATAC CCGGTATATT TTTTCAGCCG GAGATTTGTA TCAAAAGGAT TTTTCCATAG CATTCCGTAA AGAAGATGGT AATTTTTATA TTCCCATCAA GGATACGCGG GAACTATACT GCTTTAATGA TATCACCCTG AGCACAAAGT TGCTTCAACT TTTGGCCAAA GCCGGTATTG TGGTGCATTT CTTCGGTTAT TATGAAAATT ATATAGGGAC ATTTTATCCC AAAGAGCAAT TATTAAGCGG GAGGCTGACG GTTGCTCAGG CTTTGGCTTA TGAGCAAAAC AGGTTGCAGA TCGCGGGACA AATTATTAAG GGAATTGCTA AAAATACTTA TTTTGTACTT TACCATTATT ACCGACATGG GAAAAGTGAA CTAAAAGACT TTTTGGACTG GTTGCGTAAG GATGTTTCGC GCTTGGTCGA TTCTGTCGGC AATATTAAAC AGCTACTGCG TATTGAAGGG GAAATATGGG CCCGGTTTTA TCAGTCATTC CGGGTATTTC TGCCCGAGTC TTTTGCTATG AACAAAAGAG TAAAACGTCC GCCCGATAAT CCCATTAACG CACTGATATC TTTTGGTAAT ACCTTGTTGT ACACAAAAAC CATTACGCAA ATTTTCCATA CCCACTTGAA TCAAACTATT TCCTTCTTGC ACGAACCGGC GGAACGACGA TTTTCCTTAA GCTTGGATTT GTCGGAAGTA TTTAAACCGG TATTGGTTTG CAAAACTATT TTTGATTGTG TAAATAATCG CAAAATTATG GTAGAAAAGC ATTTTGACAA AAAGCTGAAT TATGCCCTGC TAAACGAGCT TGGGCGCAAA GTGTTTATCG AGGCTTTTGA AGAACGGTTG AACCAGACAT TTGAACATCA AACCTTAAGA AGAAGGTGTA GTTACAAACA AGCTATCAGG TTAGACGGAT ATAAGCTCAT TAAGCATATT TTGGAAGGAA AAAACTTCGT TCCTTTTTGC ATGGAGGATA AAAAGTGA
|
Protein sequence | MGKSSDTRYI FSAGDLYQKD FSIAFRKEDG NFYIPIKDTR ELYCFNDITL STKLLQLLAK AGIVVHFFGY YENYIGTFYP KEQLLSGRLT VAQALAYEQN RLQIAGQIIK GIAKNTYFVL YHYYRHGKSE LKDFLDWLRK DVSRLVDSVG NIKQLLRIEG EIWARFYQSF RVFLPESFAM NKRVKRPPDN PINALISFGN TLLYTKTITQ IFHTHLNQTI SFLHEPAERR FSLSLDLSEV FKPVLVCKTI FDCVNNRKIM VEKHFDKKLN YALLNELGRK VFIEAFEERL NQTFEHQTLR RRCSYKQAIR LDGYKLIKHI LEGKNFVPFC MEDKK
|
| |