Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2883 |
Symbol | cas1 |
ID | 6143521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2952596 |
End bp | 2953519 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617752 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001744907 |
Protein GI | 170681830 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.921772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGTTTG TACCACTGAG TCCGATCCCG TTAAAAGATC GCACATCTAT GATCTTCCTC CAGTACGGTC AGGTCGACGT ACTGGACGGC GCTTTCGTGC TGATCGACAA AACCGGGATC CGCACGCATA TTCCAGTGGG ATCGGTCGCC TGCATTATGC TCGAACCGGG AACAAGAGTT TCCCACGCGG CAGTCCATCT GGCCGCCACG GTGGGAACAC TGCTGGTCTG GGTCGGTGAA GCGGGCGTTC GCGTTTACTC CTCCGGACAA CCCGGAGGGG CGCGGGCAGA TAAATTACTC TACCAGGCAA AGCTGGCTTT AACGGAAGAT CTACGGTTGA AGGTGGTGCG CAAAATGTAT GAATTACGTT TTCGTGAACC ACCGCCAGCT CGCCGTTCAG TGGAGCAGTT ACGGGGAATT GAGGGATCCC GTGTTCGCCA GACCTATGCA TTACTGGCGA AGCAATATGG TGTGAAATGG AATGGTCGCA AATACGATCC TAAAGACTGG GAAAAAGGCG ATGTTGTCAA TCGCTGCATC AGTGCTGCCA CATCATGCCT GTACGGTATA TCTGAAGCGG CGGTATTAGC CGCGGGATAT GCGCCCGCTA TTGGATTTAT CCATAGTGGC AAACCGCTTT CATTTGTTTA TGACATAGCC GATATCATTA AATTTGATTT GGTTGTGCCA AAGGCATTTG AAATAGCGGC AAGACACCCG GCAGAACCTG ATAAAGAAGT GAGACTGGCC TGCCGTGATA TTTTCCGTAG TAGTAAATTA ACGGGCAAAT TAATTCCGTT AATTGAGGAA GTCCTTGCCG CCGGTGAAAT TGAACCACCA CAACCTGCGC CGGATATGTT ACCACCAGCT ATCCCGGAAC CTGAATTGCT AGGTGATAGT GGTCACCGGG GGCGCAGCGG ATGA
|
Protein sequence | MTFVPLSPIP LKDRTSMIFL QYGQVDVLDG AFVLIDKTGI RTHIPVGSVA CIMLEPGTRV SHAAVHLAAT VGTLLVWVGE AGVRVYSSGQ PGGARADKLL YQAKLALTED LRLKVVRKMY ELRFREPPPA RRSVEQLRGI EGSRVRQTYA LLAKQYGVKW NGRKYDPKDW EKGDVVNRCI SAATSCLYGI SEAAVLAAGY APAIGFIHSG KPLSFVYDIA DIIKFDLVVP KAFEIAARHP AEPDKEVRLA CRDIFRSSKL TGKLIPLIEE VLAAGEIEPP QPAPDMLPPA IPEPELLGDS GHRGRSG
|
| |