Gene ECH74115_4010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4010 
Symbolcas5e 
ID6970641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3704770 
End bp3705516 
Gene Length747 bp 
Protein Length248 aa 
Translation table11 
GC content55% 
IMG OID643387778 
ProductCRISPR-associated protein Cas5 
Protein accessionYP_002272221 
Protein GI209396791 
COG category 
COG ID 
TIGRFAM ID[TIGR01868] CRISPR system CASCADE complex protein CasD/Cas5e
[TIGR02593] CRISPR-associated protein Cas5, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAT ATTTGATTTT TCAGCTTCAT GGGCCAATGG CATCCTGGGG CGTCGATGCC 
CCCGGCGAAG TGCGTCATAC CCATGAACTG CCTTCGCGCT CAGCATTGCT GGGGCTGCTG
GCTGCCGGGG TAGGGATTCG GCGTGATGAT ACCGAACGAT TAAACGCGTT TAACCGCCAC
TATTCACTGG TGGTTTGCGC CAGCCGTAAC CCGCGCTGGG CACGGGATTA TCACACGGTC
CAGATGCCAA AAGAGGTGCG TAAAGCGCGT TATTTCAGCC GTCGCGAAGA GTTGAGCGCC
CCTGATCTTC TGAGCGCGAT TATCTCCCGG CGCGACTACT ACACCGATGC CTGGTGGATG
GTGGCGGTGG CAACAACCCC CGATGCGCCT TACAGCCTTG AACAGTTGCA GGACGGTTTA
CGTCATCCGG TTTTTCCGCT TTATCTGGGG CGAAAAAGTC ATCCGCTGGC GTTACCACTT
GCGCCGTTAC TGCTCGAAGG CAACGCGTCT GATGTCTTAC GTAACGCATA CCAACAGTAT
CAGGATAGTT TCCGCGAACT GAAAGTCTCA CTCCCGAAAC TTCAGGATGA ATGCTGGTGG
GAAGGGGAAC ACGACGGCCT GGTCGCGAGC AAAATATTAC GTCGCCGGGA TGTTCCTTTA
AATCGTCAGC AGTGGCTGTT TGGGGAACGC ACCATCAATC AGGGGCCGTG GCTCAGCAAG
GAGGAACCAT GTACCTCTCA AGAATAA
 
Protein sequence
MSQYLIFQLH GPMASWGVDA PGEVRHTHEL PSRSALLGLL AAGVGIRRDD TERLNAFNRH 
YSLVVCASRN PRWARDYHTV QMPKEVRKAR YFSRREELSA PDLLSAIISR RDYYTDAWWM
VAVATTPDAP YSLEQLQDGL RHPVFPLYLG RKSHPLALPL APLLLEGNAS DVLRNAYQQY
QDSFRELKVS LPKLQDECWW EGEHDGLVAS KILRRRDVPL NRQQWLFGER TINQGPWLSK
EEPCTSQE