Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3249 |
Symbol | cas1 |
ID | 6873274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3121641 |
End bp | 3122561 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642786264 |
Product | crispr-associated protein Cas1 |
Protein accession | YP_002216905 |
Protein GI | 198243044 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.222064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGTTCG TACCGCTCAA CCCGATCCCG TTAAAAGATC GAACCTCGAT GATCTTCCTC CAGTACGGTC AAATTGACGT GCTGGATGGG GCATTCGTGC TGATCGATAA AACGGGAGTC CGCACGCATA TTCCCGTCGG TTCGGTCGCT TGTATCATGC TGGAACCGGG AACGCGGGTT TCCCATGCGG CAGTGCATTT AGCATCAACG GTCGGCACCC TGTTGGTATG GGTGGGCGAG GCGGGAGTGC GGGTCTATTC CTCCGGACAA CCCGGTGGCG CACGAGCCGA TAAGTTGCTT TATCAGGCAA AGCTGGCGTT AGATGATGAC CTGCGGCTTA AAGTAGTCCG CAAAATGTAT GAACTGCGTT TTCGTGAGCC GCCTCCCGCC CGTCGTTCCG TTGAGCAACT GCGCGGTATT GAAGGATCCC GTGTGCGGGC GACCTATGCA TTACTGGCGA AGCAGTATGG CGTGAAATGG CATGGTCGTA ACTATGATCC GAAAGACTGG GAGAAGGGGG ATGTCGTCAA CCGATGTATT AGCGCGGCGA CATCGTGCCT GTACGGGATT TCAGAAGCGG CTATCCTGGC GGCGGGATAT GCGCCGGCTA TCGGTTTTAT CCATAGCGGT AAGCCGCTTT CTTTTGTTTA TGACATTGCC GATATCATCA AATTTGAATC GGTGGTGCCC AAAGCATTTG AGATCGCCGC TCGTCACCCG GCGGAACCTG ATAAAGAAGT GCGCCTGGCC TGCCGGGATA TTTTTCGCAG TTCGAAGCTG ACCGGAAAAT TGATCCCGCT GATCGAAGAG GTGCTCGCTG CCGGTGAAAT TGAACCACCT CAGCCTGCGC CGGATATGCT GCCGCCAGCA ATACCGGAAC CTGAATCACT GGGTGATAGC GGCCATCGGG GGCATGGTTG A
|
Protein sequence | MTFVPLNPIP LKDRTSMIFL QYGQIDVLDG AFVLIDKTGV RTHIPVGSVA CIMLEPGTRV SHAAVHLAST VGTLLVWVGE AGVRVYSSGQ PGGARADKLL YQAKLALDDD LRLKVVRKMY ELRFREPPPA RRSVEQLRGI EGSRVRATYA LLAKQYGVKW HGRNYDPKDW EKGDVVNRCI SAATSCLYGI SEAAILAAGY APAIGFIHSG KPLSFVYDIA DIIKFESVVP KAFEIAARHP AEPDKEVRLA CRDIFRSSKL TGKLIPLIEE VLAAGEIEPP QPAPDMLPPA IPEPESLGDS GHRGHG
|
| |