Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3220 |
Symbol | cas1 |
ID | 5712276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3388955 |
End bp | 3389830 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641269147 |
Product | cas1: CRISPR-associated protein Cas1 |
Protein accession | YP_001534554 |
Protein GI | 159045760 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.615148 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAAGG GGCGGCTCGG TCTCGACAGT GCGAAGGTCC CCCACGGGGA CCGCGCAGGG CTACTCTATC TCGCGCGCGG GGCGCTGACC GCCCGCGATG GCACACTTGC CTTCCAGCGC GGGTCCGCCG AAGGTCCCGA CGCCCTCGCG CAGGGGGACC ATGCGATCCC GCTCCAGGGC GTGTCGATGA TCCTGCTTGG ACCCGGCTCC ACCGTCAGCC ACGACGCCCT GCGCCTGCTC GCCCATGCCC GTACGGCCCT CGCCGCCGTG GGCGAGGACG GCGTGCGCCT CTACACCGCA CCGCCACTGA TCCCCGACCG CTCCGGCCTC GCCCGGCTTC AGGCCCGGCT CTGGGCCGAT GACGACATGC GTATCATGAT CGCGCGGCGC ATGTACGCGC TCCGGCTTGG CGAGGTCCTG CCACACCGCA CGCTGGATGT CCTGCGCGGG ATCGAAGGCG CCCGGGTCAA GGAAAGCTAT AAGATCACCG CAGAGCGCTT CGGCGTGCCC TGGCGCGGCC GGCGCTACGA CCGGGCCGAC CCGCTAGCCG CCGACCTGCC CAACCAGGCC CTCAACCACG CCGCCAGCGC CGTCGAGGCC GCTGCGGCCA TCGCCGTGAG CGCAACCGCG ACCGTGCCAC AACTGGGCTT CATCCACGAG GATCCCGGCC AGTCCTTCGT CCTCGATATC GCCGACCTCT GGCGCGAAAC TGTGACACTG CCCTGCGCCT TCCGCGCCGC CAAACGCGCG GCCGAGCGCC CGGACATGCC CGTCGAGCGT ATCGCGCGCC GCCTGACCGG CGAGACGCTG GCGAAGGAGC AGGTCATCCC CGCGATGATC GACCGGATCA AGACCCTGAT CGAAGAGGGC CCATGA
|
Protein sequence | MLKGRLGLDS AKVPHGDRAG LLYLARGALT ARDGTLAFQR GSAEGPDALA QGDHAIPLQG VSMILLGPGS TVSHDALRLL AHARTALAAV GEDGVRLYTA PPLIPDRSGL ARLQARLWAD DDMRIMIARR MYALRLGEVL PHRTLDVLRG IEGARVKESY KITAERFGVP WRGRRYDRAD PLAADLPNQA LNHAASAVEA AAAIAVSATA TVPQLGFIHE DPGQSFVLDI ADLWRETVTL PCAFRAAKRA AERPDMPVER IARRLTGETL AKEQVIPAMI DRIKTLIEEG P
|
| |