Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2897 |
Symbol | cse4 |
ID | 5592513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2898284 |
End bp | 2899375 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640922014 |
Product | CRISPR-associated Cse4 family protein |
Protein accession | YP_001459525 |
Protein GI | 157162207 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 0.317201 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAACT TTATCAATAT TCATGTTCTG ATCTCTCACA GCCCTTCATG TCTGAACCGC GACGATATGA ACATGCAGAA AGACGCTATT TTCGGCGGGA AAAGACGAGT AAGAATTTCA AGTCAAAGCC TTAAACGTGC GATGCGTAAA AGTGATTATT ATGCACAAAA TATTGGTGAA TCCAGTCTCA GAACAATTCA TCTTGCACAA TTACGTGATG TTCTTCGGCA AAAACTTGGT GAACGTTTTG AGCAAAAAAT CATCGATAAG ACATTAGCTC TGCTCTCCGG TAAATCAGTT GATGAAGCCG AAAAGATTTC TGCCGATGCG GTTACTCCCT GGGTTGTGGG AGAAATAGCC TGGTTCTGTG AGCAGGTTGC AAAAGCAGAG GCTGATAATC TGGATGATAA AAAGCTGCTC AAAGTTCTTA AGGAAGATAT TGCCGCCATA CGTGTGAATT TACAGCAGGG TGTTGATATT GCGCTTAGTG GAAGAATGGC AACCAGCGGC ATGATGAGTG AGTTGGGAAA AGTTGATGGT GCAATGTCCA TTGCGCATGC GATCACTACC CATCAGGTTG ATTCTGATAT TGACTGGTTC ACCGCCGTAG ATGATTTACA GGAACAAGGT TCTGCACATC TGGGAACACA GGAATTTTCA TCGGGTGTTT TTTATCGTTA TGCCAATATT AACCTCGCTC AACTTCAGGA AAATTTAGGC GGTGCCTCCA GGGAGCAGGC TCTGGAAATT GCAACCCATG TTGTTCATAT GCTGGCAACA GAGGTCCCTG GAGCAAAACA GCGTACTTAT GCCGCTTTTA ACCCTGCGGA TATGGTAATG GTTAATTTCT CCGATATGCC ACTTTCTATG GCAAATGCTT TTGAAAAAGC GGTGAAAGCG AAAGATGGCT TTTTGCAACC GTCTTTACAG GCGTTTAATC AATATTGGGA TCGCGTTGCC AATGGATATG GTCTGAACGG AGCCGCTGCG CAATTCAGCT TGTCTGATGT AGACCCTATT ACTGGTCAGG TACAGCAAAT GCCTACTTTA GAACAGTTAA AGTCCTGGGT TCGTAATAAT GGCGAGGCGT GA
|
Protein sequence | MSNFINIHVL ISHSPSCLNR DDMNMQKDAI FGGKRRVRIS SQSLKRAMRK SDYYAQNIGE SSLRTIHLAQ LRDVLRQKLG ERFEQKIIDK TLALLSGKSV DEAEKISADA VTPWVVGEIA WFCEQVAKAE ADNLDDKKLL KVLKEDIAAI RVNLQQGVDI ALSGRMATSG MMSELGKVDG AMSIAHAITT HQVDSDIDWF TAVDDLQEQG SAHLGTQEFS SGVFYRYANI NLAQLQENLG GASREQALEI ATHVVHMLAT EVPGAKQRTY AAFNPADMVM VNFSDMPLSM ANAFEKAVKA KDGFLQPSLQ AFNQYWDRVA NGYGLNGAAA QFSLSDVDPI TGQVQQMPTL EQLKSWVRNN GEA
|
| |