Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4013 |
Symbol | cse1 |
ID | 6968976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3707127 |
End bp | 3708689 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643387781 |
Product | CRISPR-associated protein, Cse1 family |
Protein accession | YP_002272224 |
Protein GI | 209395699 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.246071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTCGT TTTCACTTCT GACAACCCCG TGGTTGCCCG TTCGTTTTAA AGACGGAACA ACAGGCAAGC TGGCGCCAGT CGATCTGGCG GATGAAAATG TTGTCGATAT CGCTGCGCCG CGGGCAGATC TCCAGGGGGC GGCATGGCAG TTTTTGCTGG GGTTACTACA AAGCAGTTTC GCGCCAAAAG ATTATCGTCG TTGGGATGAT ATCTGGGAAG ACGGGCTGGA AGCTGAAAAG CTACGGGAAG CATTGCTGTC ATTAGAACAC CCTTTCCAGT TTGGCCCAGA TTCACCTTCA TTTATGCAGG ATTTCGAGGT GCTCATGGGC GATAAAGTTC AGGTCGCTTC GCTACTGCCT GAGATTCCCG GCGCTCAAAC AACGAAGTTT AATAAAGACC ACTTTATTAA GCGTGGCGTG ACTGAACACG TATGCTCTCA TTGTTCTGCG TTAGCTCTGT TCTCCCTACA GTTAAATGCG CCGTCAGGTG GCAAAGGCTA TCGCACCGGT TTACGCGGCG GTGGGCCGAT GACGACTCTG ATTGAATTGC AGGAGTATCA GGGCAATCAA CAAGCCCCCT TGTGGCGCAA ACTGTGGCTC AACGTGATGC CGCAGGATGA AGCCGACTTA CCGCTACCCA AAAAATTTGA CGATCTGGTT TTCCCCTGGC TTGGCCCGAC GCGTACCAGC GAACTGGCCG GTGCGGTGGT AACCGATGAT CAGGTCAATA AACTCCAGGC GTACTGGGGA ATGCCGCGGC GTATTCGTAT TGATTTTAAT ACCACGACAG TCGGCAACTG CGATATTTGC GGTGAGCAGA GTGACGCGCT TCTGAGTTTG ATGACTACCA AAAATTACGG TGCGAATTAT GCCATGTGGC AGCATCCCTT AACGCCTTAC CGTGTACCAC TTAAAGAGGG CGGTGAGTTT TACTCCGTTA AACCACAACC GGGCGGTTTA ATCTGGCGCG ACTGGTTAGG CCTTATCGAA ACGGGTAAGT CAGAAAACAA TACGGAACTT CCCGCGCTGG TGGTGAAACT CTTTAATGCC AGCAGTCTGA AACAGGCAAA AGTGGGCCTG TGGGGATTTG GTTATGATTT CGACAACATG AAAGCGCGCT GTTGGTACGA ACACCATTTC CCGCTGCTGC TCAATAAAAA AGAAGGCCAG ATACCGAAGC TGCGGCTGGC TGCGCAAACG GCTTCACGGA TTCTGAGTCT GTTACGGAGT GCATTGAAAG AAGCATGGTT CTCCGATCCA AAAGGTGCAA GGGGTGATTT CAGTTTTGTG GATATCGACT TCTGGAACAA AACTCAGCAT CGCTTCCTGA GGTTAGTGCG CCAAATTGAA GAAGGTCAGG ATGCGGATGA ATTACTCGGC AAATGGCAAA AGGAAATTTG GTTATTCGCA CGTCAGGATT TTGACGAGCG TGTATTCACC AATCCTTATG AGCCCGTTGA TTTGGAACGC GTCATGACCG CGCGCAAGAA ATATTTTACA ACATCGGCGG AGAAGCAAAG TGCTAAAGCC GCCAGGGAGA AAAAGCAGGA GGCTGCTGAA TGA
|
Protein sequence | MNSFSLLTTP WLPVRFKDGT TGKLAPVDLA DENVVDIAAP RADLQGAAWQ FLLGLLQSSF APKDYRRWDD IWEDGLEAEK LREALLSLEH PFQFGPDSPS FMQDFEVLMG DKVQVASLLP EIPGAQTTKF NKDHFIKRGV TEHVCSHCSA LALFSLQLNA PSGGKGYRTG LRGGGPMTTL IELQEYQGNQ QAPLWRKLWL NVMPQDEADL PLPKKFDDLV FPWLGPTRTS ELAGAVVTDD QVNKLQAYWG MPRRIRIDFN TTTVGNCDIC GEQSDALLSL MTTKNYGANY AMWQHPLTPY RVPLKEGGEF YSVKPQPGGL IWRDWLGLIE TGKSENNTEL PALVVKLFNA SSLKQAKVGL WGFGYDFDNM KARCWYEHHF PLLLNKKEGQ IPKLRLAAQT ASRILSLLRS ALKEAWFSDP KGARGDFSFV DIDFWNKTQH RFLRLVRQIE EGQDADELLG KWQKEIWLFA RQDFDERVFT NPYEPVDLER VMTARKKYFT TSAEKQSAKA AREKKQEAAE
|
| |