Gene ECH74115_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4011 
Symbolcse4 
ID6967722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3705527 
End bp3706582 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content51% 
IMG OID643387779 
ProductCRISPR-associated protein, Cse4 family 
Protein accessionYP_002272222 
Protein GI209399904 
COG category 
COG ID 
TIGRFAM ID[TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.453377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAT TTATTCAGCT TCATTTGTTA ACCGCTTACC CTGCAGCCAA CCTTAACCGT 
GATGACACCG GAGCGCCGAA AACCGTGGTC CTGGGTGGAG CAACGCGACT GCGCGTTTCC
TCGCAAAGTC TGAAACGTGC GTGGCGCACT TCTGCACTTT TTGAACAGGC ACTGGCGGGC
CATATTGGTA TTCGCAGTGG GCGTATTGCG CGTGAGGCGG CAACTATCCT GATTGAGAAA
GGCATCGAAG AGAAAAAAGC CATCGAATGG GCGGCAAAAA TTGCGGATTA TCTTGGGAAA
GCTAAAAACG ACAAAAAACC AAAAGATCCG CTCACTAACG CCGAAACTGA ACAATTAGTC
CATATCAGCC CGGCAGAATT TGACGCCGTA AAAGCGCTGG CCCATCAACT GGCCGAAGAA
AAGCGCGCGC CAAAAGAGGA AGATCTCGCT TTGTTACGTA AAGATCGCAT GGCAGTAGAT
ATCGCTATGT TTGGTCGTAT GCTGGCGAAT AAACCCGAAT TTAATGTTGA AGCCGCCTGC
CAGGTCGCGC ATGCATTTGG TGTCAGTGAA ACGATTGTCG AAGATGATTT TTTCACCGCC
GTTGATGATT TGCGCCAGGC TTCTGAAGAT GCCGGTGCCG GGCATCTTGG TGAAACCGGA
TTCGGTTCTG CGCTGTTCTA CACCTATATC TGCATCGATA AAGATCTGCT GGTCGAAAAC
CTCGGCGGTG ACGAAGCGTT AGCTAATCAG ACCTTGCGCG CCTTTACGGA AGCCGCACTT
AAAGTCTCCC CAACCGGCAA ACAGAACAGC TTTGCCAGCC GTGCCTACGC CTCCTGGGCG
CTGGCAGAAA AAGGCACCGA ACAACCACGT TCTCTGGCGG CGGCTTTCTA TGAACCCATT
AATGGCACCC GGCAGTTAGA TGTGGCGGTG CAGCGTATTA CAACGCTTCG CGAAAATATG
AATACGGTCT ATGAACAGAA GACCGAATGC GCAAGCTTTG ACGTGATGAA CAAACAGGGA
AGCATGAAGG ACGTGCTGGA CTTTATCTGC GCGTAA
 
Protein sequence
MTTFIQLHLL TAYPAANLNR DDTGAPKTVV LGGATRLRVS SQSLKRAWRT SALFEQALAG 
HIGIRSGRIA REAATILIEK GIEEKKAIEW AAKIADYLGK AKNDKKPKDP LTNAETEQLV
HISPAEFDAV KALAHQLAEE KRAPKEEDLA LLRKDRMAVD IAMFGRMLAN KPEFNVEAAC
QVAHAFGVSE TIVEDDFFTA VDDLRQASED AGAGHLGETG FGSALFYTYI CIDKDLLVEN
LGGDEALANQ TLRAFTEAAL KVSPTGKQNS FASRAYASWA LAEKGTEQPR SLAAAFYEPI
NGTRQLDVAV QRITTLRENM NTVYEQKTEC ASFDVMNKQG SMKDVLDFIC A