Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2888 |
Symbol | cse1 |
ID | 6144590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2956508 |
End bp | 2958070 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641617757 |
Product | CRISPR-associated Cse1 family protein |
Protein accession | YP_001744912 |
Protein GI | 170683207 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.447574 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTCGT TTTCACTTCT GACAACCCCG TGGTTGCCTG TTCGTTTCAA AGACGGAACG ACAGGCAAGC TGGCACCAGT CGATCTGGCT GATGAAAATG TTGTCGATAT CGCCGCACCG CGAGCAGATT TACAAGGTGC GGCCTGGCAG TTTTTGCTGG GACTACTACA AACCAGCTTC GCGCCAAAAG ATCATCGTCG TTGGGATGAT ATCTGGGAAG ACGGACTGGA AGCTGAAAAG CTACGAGAAG CATTGCTGTC TTTAGAACAT GCTTTCCAGT TTGGGCCTGA TTCATCTTCA TTTATGCAGG ATTTTGAGGC GCTCACGGGA GATAAAGTTC CGGTCGCTTC GCTACTGCCT GAGATTCCCG GCTCTCAAAC GACGAAGTTT AATAAAGACC ATTTTATCAA GCGGGGTGTA ACGGAATATC TGTGCCCCCA TTGCTTAGCA TTGGCCTTGT TCTCTTTACA GTTGAATGCA CCCGCGGGCG GCAAAGGCTA TCGCACCGGT TTACGAGGCG GTGGGCCGAT GACGACTCTC ATTGAATTAC AGGAATATCA GGGCAATCAA CAAACACCCT TGTGGCGCAA ACTGTGGCTC AACGTGATGC CGCAGGATGA AGCCGACTTA CCGCTACCCA AAAAATTTGA CGATCTGATT TTCCCCTGGC TAGGCCCGAC GCGTACCAGC GAACTGGTCG GTGCGGTGGT AACCCATGAT CAGGTCAATA AACTCCAGGC GTACTGGGGA ATGCCGCGGC GTATACGTAT TGATTTTAAT ACCACGACAG TCGGCAACTG CGATATTTGC GACGAGCAGA ACGACGCGCT CCTGAGCCTG ATGACGACCA AAAATTATGG TGCCAATTAT GCCATGTGGC AGCATCCCTT AACGCCTTAC CGTGTACCAC TTAAAGAGGG TGGCGAGTTT TACTCCGTTA AACCACAACC GGGCGGTTTA ATCTGGCGCG ACTGGTTAGG CCTTATCGAA ACGGGTAAAT CAGAAAACAA TACGGAACTT CCCGCGTTGG TGGTGAAACT CTTTAATGCC AGCAGTCTGA AACAGGCAAA AGTGGGCCTA TGGGGATTTG GTTATGATTT TGACAACATG AAAGCGCGCT GTTGGTACGA ACACCATTTC CCGCTACTGC TCAATAAAAA AGAAGGCCAG ATACCGAAGC TGCGGCTGGC TGCGCAAACG GCTTCACGGA TTCTGAGTCT GTTACGGAGT GCATTGAAAG AAGCATGGTT CTCCGATCCA AAAGGTGCAA GGGGTGATTT CAGTTTTGTG GATATCGACT TCTGGAACAA AACTCAGCAT CGCTTCCTGA GGTTAGTGCG CCAAATTGAA GAAGGTCAGG ATGCGGATGA ATTACTCGGC AAATGGCAAA AGGAAATGTG GTTATTCGCG CGTCAGGATT TTGACGAGCG TGTATTCACT AATCCTTATG AGCCCGTTGA TTTGAAACGC GTCATGACCG CGCGCAAGAA ATATTTTACA ACATCGGCGG AGAAGCAAAG TGCTAAAGCC GCCAGGGAGA AAAAGCAGGA GGCTGCTGAA TGA
|
Protein sequence | MNSFSLLTTP WLPVRFKDGT TGKLAPVDLA DENVVDIAAP RADLQGAAWQ FLLGLLQTSF APKDHRRWDD IWEDGLEAEK LREALLSLEH AFQFGPDSSS FMQDFEALTG DKVPVASLLP EIPGSQTTKF NKDHFIKRGV TEYLCPHCLA LALFSLQLNA PAGGKGYRTG LRGGGPMTTL IELQEYQGNQ QTPLWRKLWL NVMPQDEADL PLPKKFDDLI FPWLGPTRTS ELVGAVVTHD QVNKLQAYWG MPRRIRIDFN TTTVGNCDIC DEQNDALLSL MTTKNYGANY AMWQHPLTPY RVPLKEGGEF YSVKPQPGGL IWRDWLGLIE TGKSENNTEL PALVVKLFNA SSLKQAKVGL WGFGYDFDNM KARCWYEHHF PLLLNKKEGQ IPKLRLAAQT ASRILSLLRS ALKEAWFSDP KGARGDFSFV DIDFWNKTQH RFLRLVRQIE EGQDADELLG KWQKEMWLFA RQDFDERVFT NPYEPVDLKR VMTARKKYFT TSAEKQSAKA AREKKQEAAE
|
| |