Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Svir_17300 |
Symbol | |
ID | 8387059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharomonospora viridis DSM 43017 |
Kingdom | Bacteria |
Replicon accession | NC_013159 |
Strand | - |
Start bp | 1787032 |
End bp | 1788060 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644975803 |
Product | CRISPR-associated protein, Cas1 family |
Protein accession | YP_003133585 |
Protein GI | 257055753 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGTGG CCGATATCTG GTGGAAAGCC CACCCCCACG ACCTGCACCG GCTCACGGAC CGTGTCTCGA GCGTCTACAT CGAACGCAGT CACCTCGATC GCGCGGAGAA CGCGATCGCC ATCATCAACC GTCGTGAGAC CGTGCGGCTT CCCGCCGCAC TCGTCGCCGT GGTGCTGCTC GGGCCGGGAA CCCGTGTCAC TCACGGCGCG ATGCAGTTGC TCGCGGACTC GGGCACGGCG GTGTGCTGGG TCGGCGAACA AGGCGTCAGG ATGTATGCCG CGGGGCTCGG CCCCAGCCGG GGCGCGGCGC TGCTCCAGCG ACAAGCGTAT TTGGTCAGTC GCACCACAAC GCGGCTGGAG GTGGCCCGGG CCATGTATGC CATGCGGTTT CCCGGCGAAG ACGTCTCCAC GCTCACCATG CAGCAGCTAC GCGGCCGGGA AGGCGCACGC GTCCGCAAGG TCTATCGGCA ACAGGCGCGA CAACACGGTG TGCCCTGGAA CGGACGTGCC TACAAGGCAG GGGACGCCTT CGCGGTCGGC GATGACCTCA ATCGCCTGTT GTCCGCCGCC AACGCCGCTC TCTACGGCAT TTGCCACGCC GTCATCGTGG GGCTGGGAGC CAGCCCCGGT CTCGGGTTCA TCCACACCGG CTCGGCGACC TCCTTCGTGA TGGACATCGC CGACCTGTAC AAAGCCGAAT ACACCATCCC GCTGGCCTTC CAGCTCGCAG CGCGAGGTCT CCTTGAGGAA CGTGACGCCC GAACCGCACT GCGCGACCGT ATCGCCGGTA CCGGCCTGCT CCCGCGCATC ATCAAGGACG TCAAGACACT ACTGGCACCC GAAGGGGTCG ACCTGCCCGA TCCGGAGGTG AACCTGCTCT GGGACGAGCG AGGGAATCCC GTCCCCGGCG GGGTGAACTG GTCCGATGAC TTCGATTTCC CGGTGATCGA CCCCAGCATG GACCAAACCC ACATCTCTGT GATCGGTCCC GAATTCGACA CACCCACCAC AGAGGCCGGG GAGTCATGA
|
Protein sequence | MSVADIWWKA HPHDLHRLTD RVSSVYIERS HLDRAENAIA IINRRETVRL PAALVAVVLL GPGTRVTHGA MQLLADSGTA VCWVGEQGVR MYAAGLGPSR GAALLQRQAY LVSRTTTRLE VARAMYAMRF PGEDVSTLTM QQLRGREGAR VRKVYRQQAR QHGVPWNGRA YKAGDAFAVG DDLNRLLSAA NAALYGICHA VIVGLGASPG LGFIHTGSAT SFVMDIADLY KAEYTIPLAF QLAARGLLEE RDARTALRDR IAGTGLLPRI IKDVKTLLAP EGVDLPDPEV NLLWDERGNP VPGGVNWSDD FDFPVIDPSM DQTHISVIGP EFDTPTTEAG ES
|
| |