Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3062 |
Symbol | cse1 |
ID | 5590399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3061527 |
End bp | 3063089 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640926706 |
Product | CRISPR-associated Cse1 family protein |
Protein accession | YP_001464082 |
Protein GI | 157156933 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTCGT TTTCACTTCT GACAACCCCG TGGTTGCCTG TTCGTTTTAA AGACGGAACG ACAGGCAAGC TGGCACCAGT CGATCTGGCT GATGAAAATG TTGTCGATAT CGCTGCGCCG CGGGCAGATC TCCAGGGTGC GGCCTGGCAG TTTTTGCTGG GGCTACTACA AACCAGCTTC GCGCCCAAAA ATCACGGTCG TTGGGATGAT ATCTGGGAAG ACGGACTGGA GGCTGAAAAG CTACGGGAAG CATTGCTGTC CTTAGAACAC GCTTTCCAGT TTGGGGCAGA TTCCCCTTCA TTTATGCAGG ATTTCGAGGC GCTCAAGGGA GATAAAGTTC AGGTCGCTTC GCTACTGCCT GAGATTCCCG GCGCTCAAAC AACGAAGTTT AATAAAGACC ACTTTATTAA GCGTGGCGTG ACTGAACACG TATGCCCTCA TTGTTCCGCG TTAGCTCTGT TCTCCCTACA GTTAAATGCG CCATCAGGCG GCAAAGGCTA TCGCACCGGT TTGCGCGGCG GTGGGCCGAT GACCACTCTG ATTGAATTAC AGGAATATCA AGGCAATCAA CAAACGCCCT TGTGGCGCAA ACTGTGGCCC AATGTGATGC CGCAGGATGA AGCCGACTTA CCGCTACCCA AAAAATTTGA CGATCTGGTT TTCCCCTGGC TTGGCCCGAC GCGAACCAGC GAACTGGCCG GTGCGGTGGT AACCCATGAT CAGGTCAATA AACTCCAGGC TTACTGGGGA ATGCCGCGCC GTATTCGTAT TGATTTTAAT ACCACGACAG TCGGCAACTG CGATATTTGC GGTGAACAGA GCGACGCGCT CCTGAGCCTG ATGACGACCA AAAATTACGG TGCGAATTAT GCCATGTGGC AGCATCCCTT AACGCCTTAC CGTATACCAC TTAAAGAGGG TGGCGAGTTT TACTCGGTCA AACCACAACC GGGCGGTTTA ATCTGGCGCG ACTGGTTAGG CCTTATCGAA ACGGGTAAAT CAGAAAACAA TACGGAACTT CCCGCGCTGG TGGTGAAACT CTTTAATGCC AGCAGTCTGA AACAGGCAAA AGTAGGCCTG TGGGGATTTG GCTATGATTT CGACAACATG AAAGCGCGCT GTTGGTACGA ACACCATTTC CCGCTGCTGC TCAAGAAAAA AGAAGGCCAG ATACCGAAAC TGCGTCTGGC TGCGCAAACG GCTTCACGGA TTCTAAGTCT GTTACGAAGT GCATTGAAAG AAGCGTGGTT CTCCGATCCA AAAGGTGCAC GGGGTGATTT CAGTTTTGTG GATATCGACT TCTGGAACAA AACTCAGCAT CGCTTCCTGA GGTTAGTGCG CCAAATTGAA GAAGGTCAGG ATGCGGACGA ATTACTCGGC AAATGGCAAA AGGAAATTTG GTTATTCGCG CGTCAGGATT TTGACGAGCG TGTATTCACC AATCCATATG AGCCTGTTGA TTTGAAACGC GTTATGACCG CGCGCAAGAA ATATTTCACA ACATCGGCGG AGAAGCAAAG TGCTAAAGCC GCCAGGGAGA AAAAGCAGGA GGCTGCTGAA TGA
|
Protein sequence | MNSFSLLTTP WLPVRFKDGT TGKLAPVDLA DENVVDIAAP RADLQGAAWQ FLLGLLQTSF APKNHGRWDD IWEDGLEAEK LREALLSLEH AFQFGADSPS FMQDFEALKG DKVQVASLLP EIPGAQTTKF NKDHFIKRGV TEHVCPHCSA LALFSLQLNA PSGGKGYRTG LRGGGPMTTL IELQEYQGNQ QTPLWRKLWP NVMPQDEADL PLPKKFDDLV FPWLGPTRTS ELAGAVVTHD QVNKLQAYWG MPRRIRIDFN TTTVGNCDIC GEQSDALLSL MTTKNYGANY AMWQHPLTPY RIPLKEGGEF YSVKPQPGGL IWRDWLGLIE TGKSENNTEL PALVVKLFNA SSLKQAKVGL WGFGYDFDNM KARCWYEHHF PLLLKKKEGQ IPKLRLAAQT ASRILSLLRS ALKEAWFSDP KGARGDFSFV DIDFWNKTQH RFLRLVRQIE EGQDADELLG KWQKEIWLFA RQDFDERVFT NPYEPVDLKR VMTARKKYFT TSAEKQSAKA AREKKQEAAE
|
| |