Gene ECH74115_4013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4013 
Symbolcse1 
ID6968976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3707127 
End bp3708689 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content50% 
IMG OID643387781 
ProductCRISPR-associated protein, Cse1 family 
Protein accessionYP_002272224 
Protein GI209395699 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.246071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCGT TTTCACTTCT GACAACCCCG TGGTTGCCCG TTCGTTTTAA AGACGGAACA 
ACAGGCAAGC TGGCGCCAGT CGATCTGGCG GATGAAAATG TTGTCGATAT CGCTGCGCCG
CGGGCAGATC TCCAGGGGGC GGCATGGCAG TTTTTGCTGG GGTTACTACA AAGCAGTTTC
GCGCCAAAAG ATTATCGTCG TTGGGATGAT ATCTGGGAAG ACGGGCTGGA AGCTGAAAAG
CTACGGGAAG CATTGCTGTC ATTAGAACAC CCTTTCCAGT TTGGCCCAGA TTCACCTTCA
TTTATGCAGG ATTTCGAGGT GCTCATGGGC GATAAAGTTC AGGTCGCTTC GCTACTGCCT
GAGATTCCCG GCGCTCAAAC AACGAAGTTT AATAAAGACC ACTTTATTAA GCGTGGCGTG
ACTGAACACG TATGCTCTCA TTGTTCTGCG TTAGCTCTGT TCTCCCTACA GTTAAATGCG
CCGTCAGGTG GCAAAGGCTA TCGCACCGGT TTACGCGGCG GTGGGCCGAT GACGACTCTG
ATTGAATTGC AGGAGTATCA GGGCAATCAA CAAGCCCCCT TGTGGCGCAA ACTGTGGCTC
AACGTGATGC CGCAGGATGA AGCCGACTTA CCGCTACCCA AAAAATTTGA CGATCTGGTT
TTCCCCTGGC TTGGCCCGAC GCGTACCAGC GAACTGGCCG GTGCGGTGGT AACCGATGAT
CAGGTCAATA AACTCCAGGC GTACTGGGGA ATGCCGCGGC GTATTCGTAT TGATTTTAAT
ACCACGACAG TCGGCAACTG CGATATTTGC GGTGAGCAGA GTGACGCGCT TCTGAGTTTG
ATGACTACCA AAAATTACGG TGCGAATTAT GCCATGTGGC AGCATCCCTT AACGCCTTAC
CGTGTACCAC TTAAAGAGGG CGGTGAGTTT TACTCCGTTA AACCACAACC GGGCGGTTTA
ATCTGGCGCG ACTGGTTAGG CCTTATCGAA ACGGGTAAGT CAGAAAACAA TACGGAACTT
CCCGCGCTGG TGGTGAAACT CTTTAATGCC AGCAGTCTGA AACAGGCAAA AGTGGGCCTG
TGGGGATTTG GTTATGATTT CGACAACATG AAAGCGCGCT GTTGGTACGA ACACCATTTC
CCGCTGCTGC TCAATAAAAA AGAAGGCCAG ATACCGAAGC TGCGGCTGGC TGCGCAAACG
GCTTCACGGA TTCTGAGTCT GTTACGGAGT GCATTGAAAG AAGCATGGTT CTCCGATCCA
AAAGGTGCAA GGGGTGATTT CAGTTTTGTG GATATCGACT TCTGGAACAA AACTCAGCAT
CGCTTCCTGA GGTTAGTGCG CCAAATTGAA GAAGGTCAGG ATGCGGATGA ATTACTCGGC
AAATGGCAAA AGGAAATTTG GTTATTCGCA CGTCAGGATT TTGACGAGCG TGTATTCACC
AATCCTTATG AGCCCGTTGA TTTGGAACGC GTCATGACCG CGCGCAAGAA ATATTTTACA
ACATCGGCGG AGAAGCAAAG TGCTAAAGCC GCCAGGGAGA AAAAGCAGGA GGCTGCTGAA
TGA
 
Protein sequence
MNSFSLLTTP WLPVRFKDGT TGKLAPVDLA DENVVDIAAP RADLQGAAWQ FLLGLLQSSF 
APKDYRRWDD IWEDGLEAEK LREALLSLEH PFQFGPDSPS FMQDFEVLMG DKVQVASLLP
EIPGAQTTKF NKDHFIKRGV TEHVCSHCSA LALFSLQLNA PSGGKGYRTG LRGGGPMTTL
IELQEYQGNQ QAPLWRKLWL NVMPQDEADL PLPKKFDDLV FPWLGPTRTS ELAGAVVTDD
QVNKLQAYWG MPRRIRIDFN TTTVGNCDIC GEQSDALLSL MTTKNYGANY AMWQHPLTPY
RVPLKEGGEF YSVKPQPGGL IWRDWLGLIE TGKSENNTEL PALVVKLFNA SSLKQAKVGL
WGFGYDFDNM KARCWYEHHF PLLLNKKEGQ IPKLRLAAQT ASRILSLLRS ALKEAWFSDP
KGARGDFSFV DIDFWNKTQH RFLRLVRQIE EGQDADELLG KWQKEIWLFA RQDFDERVFT
NPYEPVDLER VMTARKKYFT TSAEKQSAKA AREKKQEAAE