Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4011 |
Symbol | cse4 |
ID | 6967722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3705527 |
End bp | 3706582 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387779 |
Product | CRISPR-associated protein, Cse4 family |
Protein accession | YP_002272222 |
Protein GI | 209399904 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.453377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACAT TTATTCAGCT TCATTTGTTA ACCGCTTACC CTGCAGCCAA CCTTAACCGT GATGACACCG GAGCGCCGAA AACCGTGGTC CTGGGTGGAG CAACGCGACT GCGCGTTTCC TCGCAAAGTC TGAAACGTGC GTGGCGCACT TCTGCACTTT TTGAACAGGC ACTGGCGGGC CATATTGGTA TTCGCAGTGG GCGTATTGCG CGTGAGGCGG CAACTATCCT GATTGAGAAA GGCATCGAAG AGAAAAAAGC CATCGAATGG GCGGCAAAAA TTGCGGATTA TCTTGGGAAA GCTAAAAACG ACAAAAAACC AAAAGATCCG CTCACTAACG CCGAAACTGA ACAATTAGTC CATATCAGCC CGGCAGAATT TGACGCCGTA AAAGCGCTGG CCCATCAACT GGCCGAAGAA AAGCGCGCGC CAAAAGAGGA AGATCTCGCT TTGTTACGTA AAGATCGCAT GGCAGTAGAT ATCGCTATGT TTGGTCGTAT GCTGGCGAAT AAACCCGAAT TTAATGTTGA AGCCGCCTGC CAGGTCGCGC ATGCATTTGG TGTCAGTGAA ACGATTGTCG AAGATGATTT TTTCACCGCC GTTGATGATT TGCGCCAGGC TTCTGAAGAT GCCGGTGCCG GGCATCTTGG TGAAACCGGA TTCGGTTCTG CGCTGTTCTA CACCTATATC TGCATCGATA AAGATCTGCT GGTCGAAAAC CTCGGCGGTG ACGAAGCGTT AGCTAATCAG ACCTTGCGCG CCTTTACGGA AGCCGCACTT AAAGTCTCCC CAACCGGCAA ACAGAACAGC TTTGCCAGCC GTGCCTACGC CTCCTGGGCG CTGGCAGAAA AAGGCACCGA ACAACCACGT TCTCTGGCGG CGGCTTTCTA TGAACCCATT AATGGCACCC GGCAGTTAGA TGTGGCGGTG CAGCGTATTA CAACGCTTCG CGAAAATATG AATACGGTCT ATGAACAGAA GACCGAATGC GCAAGCTTTG ACGTGATGAA CAAACAGGGA AGCATGAAGG ACGTGCTGGA CTTTATCTGC GCGTAA
|
Protein sequence | MTTFIQLHLL TAYPAANLNR DDTGAPKTVV LGGATRLRVS SQSLKRAWRT SALFEQALAG HIGIRSGRIA REAATILIEK GIEEKKAIEW AAKIADYLGK AKNDKKPKDP LTNAETEQLV HISPAEFDAV KALAHQLAEE KRAPKEEDLA LLRKDRMAVD IAMFGRMLAN KPEFNVEAAC QVAHAFGVSE TIVEDDFFTA VDDLRQASED AGAGHLGETG FGSALFYTYI CIDKDLLVEN LGGDEALANQ TLRAFTEAAL KVSPTGKQNS FASRAYASWA LAEKGTEQPR SLAAAFYEPI NGTRQLDVAV QRITTLRENM NTVYEQKTEC ASFDVMNKQG SMKDVLDFIC A
|
| |