Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A3092 |
Symbol | cas |
ID | 6515366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 2983090 |
End bp | 2984007 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642748111 |
Product | crispr-associated protein Cas1 |
Protein accession | YP_002115888 |
Protein GI | 194734989 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.79761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTGGC TGCCGCTGAA TCCCATCCCG TTGAAAGACC GTGTTTCGAT GATATTTCTC CAGTACGGAC AAATAGATGT GATCGACGGT GCGTTTGTGC TTATCGATAA AACGGGTGTA CGTACCCATA TTCCTGTTGG ATCGGTGGCC TGCATCATGC TGGAACCGGG GACGCGGGTT TCCCATGCCG CCGTGCGACT TGCGGCAACG GTGGGTACGT TACTGGTGTG GGTGGGGGAA GCGGGCGTAC GCGTTTACGC TTCGGGGCAG CCTGGTGGTG CCCGTTCCGA CAAATTGCTT TATCAGGCGA AACTTGCACT GGATGAAGAT TTGCGGCTGA AGGTCGTGCG TAAAATGTTT GAATTACGTT TTGGCGAACC CGCGCCGAGC CGTCGTTCTG TAGATCAATT GCGTGGTATT GAGGGGAGCC GTGTGCGGGC AACCTATGCA CTACTTGCTA AGCAGTATGG CGTGAAATGG CTGGGACGTC GCTACGATCC GAAAGACTGG GAGAAAGGCG ATGTCATTAA TCAGTGTATC AGCTCGGCAA CCTCCTGCCT CTATGGCGTA ACGGAGGCGG CAATACTGGC TGCCGGATAT GCGCCCGCGA TTGGATTTGT GCACACCGGC AAGCCGCTTT CTTTTGTCTA TGATATTGCC GATATCATTA AATTTGAGAC CGTTGTACCG AAAGCATTTG AAATTGCGCG ACGTAATCCT GCCGAGCCTG ATCGTGATGT CCGTATTGCC TGCCGGGATA TCTTCCGCAG TGGAAAAACA TTGGCGAAAT TGATTCCTCT TATTGAAGAT GTTCTCGCGG AAGGGGAAAT TCAACCGCCG TTACCTCCTG AAGATTCACA ACCCATAGCG ATCCCTCTTC CTGTTGCGTT GGGAGATTCC GGTCATCGGA GTACCTAA
|
Protein sequence | MSWLPLNPIP LKDRVSMIFL QYGQIDVIDG AFVLIDKTGV RTHIPVGSVA CIMLEPGTRV SHAAVRLAAT VGTLLVWVGE AGVRVYASGQ PGGARSDKLL YQAKLALDED LRLKVVRKMF ELRFGEPAPS RRSVDQLRGI EGSRVRATYA LLAKQYGVKW LGRRYDPKDW EKGDVINQCI SSATSCLYGV TEAAILAAGY APAIGFVHTG KPLSFVYDIA DIIKFETVVP KAFEIARRNP AEPDRDVRIA CRDIFRSGKT LAKLIPLIED VLAEGEIQPP LPPEDSQPIA IPLPVALGDS GHRST
|
| |