Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3060 |
Symbol | cse4 |
ID | 5589843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3059927 |
End bp | 3060982 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640926704 |
Product | CRISPR-associated Cse4 family protein |
Protein accession | YP_001464080 |
Protein GI | 157157725 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACAT TTATTCAGCT TCATTTGTTA ACCGCTTACC CTGCTGCCAA CCTTAACCGT GATGATACCG GTGCGCCAAA AACTGTGGTG CTGGGTGGAG CAACGCGTCT GCGCGTTTCC TCGCAAAGTC TGAAACGTGC GTGGCGCACT TCTGCACTTT TTGAACAGGC ACTGGCGGGC CATATTGGTA TTCGCAGTGG GCGTATTGCG CGTGAGGCGG CAACTATCCT GATTGAGAAA GGAATCGAAG AGAAAAAAGC CATCGAATGG GCGGCAAAAA TTGCAGATTA TCTTGGGAAA GCTAAAAACG ACAAAAAACC AAAAGATCCG CTCACTAACG CCGAAACTGA ACAGTTAGTC CATATCAGCC CGGCAGAATT TGACGCCGTA AAAGTGCTGG CCCATCAGCT TGCAGAAGAA AAGCGCGCGC CAAAAGAGGA AGATCTCGCG TTGTTACGCA AAGACCGTAT GGCGGTAGAT ATTGCCATGT TTGGTCGTAT GCTGGCGAAT AAACCCGAGT TTAATGTTGA AGCCGCCTGC CAGGTAGCGC ACGCATTTGG TGTCAGTGAA ACGATTGTCG AAGATGATTT TTTCACCGCC GTTGATGATT TGCGCCAGGC TTCTGAAGAT GCCGGTGCCG GGCATCTTGG TGAAACCGGA TTCGGTTCTG CACTGTTCTA CACCTATATC TGCATTGATA AAGATCTGCT GGTCGAAAAC CTTGGCGGTG ACGAAGCGTT AGCTAATCAG ACCATACGTG CCTTTACGGA AGCCGCACTT AAAGTCTCCC CAACCGGCAA ACAGAACAGC TTTGCCAGCC GTGCCTACGC CTCCTGGGCG CTGGCAGAAA AAGGCACCGA TCAACCGCGT TCTCTGGCGG CAGCTTTCTA TGAACCCATT AATGGCACCC GTCAGTTAGA GGTGGCGGTG CAGCGTATTA CAACTCTTCG CGAAAATATG AATACGGTCT ACGAACAGAA GACCGATTAC GCAAGTTTTG ACGTGATGAA CAAACAGGGA AGCATGAAGG ACGTGCTGGA CTTTATCTGC GCGTAA
|
Protein sequence | MTTFIQLHLL TAYPAANLNR DDTGAPKTVV LGGATRLRVS SQSLKRAWRT SALFEQALAG HIGIRSGRIA REAATILIEK GIEEKKAIEW AAKIADYLGK AKNDKKPKDP LTNAETEQLV HISPAEFDAV KVLAHQLAEE KRAPKEEDLA LLRKDRMAVD IAMFGRMLAN KPEFNVEAAC QVAHAFGVSE TIVEDDFFTA VDDLRQASED AGAGHLGETG FGSALFYTYI CIDKDLLVEN LGGDEALANQ TIRAFTEAAL KVSPTGKQNS FASRAYASWA LAEKGTDQPR SLAAAFYEPI NGTRQLEVAV QRITTLRENM NTVYEQKTDY ASFDVMNKQG SMKDVLDFIC A
|
| |