Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_0858 |
Symbol | |
ID | 3755658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | - |
Start bp | 886595 |
End bp | 887362 |
Gene Length | 768 bp |
Protein Length | 255 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637781723 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_387354 |
Protein GI | 78355905 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.41465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGTGTA TTCTTCTGGA ACCGGGTACA CGCGTCAGTC ATGCCGCGGT TGTTCTTGCT GCACGGTCAG GCACTCTTCT TGTCTGGACA GGAGAAGCCG GAGTGCGGTT ATACGCCTCC GGCCAACCGG GCGGAGCGCG CAGCGACAAA CTGCTTTATC AGGCGAAGCT GGCTCTGGAT GATACAGCCC GCCTTAAAGT TGTTCGGCGC ATGTTTGCCA TGCGCTTTCA GGAACAGGCA CCGGACAGAC GCAGCGTTAA CCAGTTGCGC GGTCTGGAAG GGGCGCGGGT ACGGGCTCTT TATAGCCTGC TTGCCAAACA ATATAAGGTG CCTTGGAAAG GCAGAAAATA TGACCCGAAG GATTGGGAAT CCGGCGACCT GCCTAACCGT TGCATAAGTG CGGCCACAGC GTGCCTGTAT GGAGTATGCG AAGCCGGAAT TCTGGCCGCT GGCTATGCTC CTGCCATTGG CTTTTTACAC ACGGGCAAAC CCCAGAGCTT TGTGTATGAT ATAGCAGACC TGTTCAAATT CGAAACGGTA GTTCCCGTCG CCTTTCGCGT GGCAGCAGGT AAACCCGTTA ACCCCGATCA GGCTGTCCGT CTGGCATGCA GAGATATGTT CCGCAAAACA AAGCTGTTGC AACGCATCAT CCCCACTATT GAAGAAGTGC TGGCCGCCGG TGAAATTGCC CCGCCTGAAA TTGAAGGAGT TGTTACGCCG GCAATACAGG AAAAAGAGGG TATAGGCGAT GACGGTCATC GTGGTTGA
|
Protein sequence | MTCILLEPGT RVSHAAVVLA ARSGTLLVWT GEAGVRLYAS GQPGGARSDK LLYQAKLALD DTARLKVVRR MFAMRFQEQA PDRRSVNQLR GLEGARVRAL YSLLAKQYKV PWKGRKYDPK DWESGDLPNR CISAATACLY GVCEAGILAA GYAPAIGFLH TGKPQSFVYD IADLFKFETV VPVAFRVAAG KPVNPDQAVR LACRDMFRKT KLLQRIIPTI EEVLAAGEIA PPEIEGVVTP AIQEKEGIGD DGHRG
|
| |