Gene EcSMS35_2888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2888 
Symbolcse1 
ID6144590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2956508 
End bp2958070 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content50% 
IMG OID641617757 
ProductCRISPR-associated Cse1 family protein 
Protein accessionYP_001744912 
Protein GI170683207 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.447574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCGT TTTCACTTCT GACAACCCCG TGGTTGCCTG TTCGTTTCAA AGACGGAACG 
ACAGGCAAGC TGGCACCAGT CGATCTGGCT GATGAAAATG TTGTCGATAT CGCCGCACCG
CGAGCAGATT TACAAGGTGC GGCCTGGCAG TTTTTGCTGG GACTACTACA AACCAGCTTC
GCGCCAAAAG ATCATCGTCG TTGGGATGAT ATCTGGGAAG ACGGACTGGA AGCTGAAAAG
CTACGAGAAG CATTGCTGTC TTTAGAACAT GCTTTCCAGT TTGGGCCTGA TTCATCTTCA
TTTATGCAGG ATTTTGAGGC GCTCACGGGA GATAAAGTTC CGGTCGCTTC GCTACTGCCT
GAGATTCCCG GCTCTCAAAC GACGAAGTTT AATAAAGACC ATTTTATCAA GCGGGGTGTA
ACGGAATATC TGTGCCCCCA TTGCTTAGCA TTGGCCTTGT TCTCTTTACA GTTGAATGCA
CCCGCGGGCG GCAAAGGCTA TCGCACCGGT TTACGAGGCG GTGGGCCGAT GACGACTCTC
ATTGAATTAC AGGAATATCA GGGCAATCAA CAAACACCCT TGTGGCGCAA ACTGTGGCTC
AACGTGATGC CGCAGGATGA AGCCGACTTA CCGCTACCCA AAAAATTTGA CGATCTGATT
TTCCCCTGGC TAGGCCCGAC GCGTACCAGC GAACTGGTCG GTGCGGTGGT AACCCATGAT
CAGGTCAATA AACTCCAGGC GTACTGGGGA ATGCCGCGGC GTATACGTAT TGATTTTAAT
ACCACGACAG TCGGCAACTG CGATATTTGC GACGAGCAGA ACGACGCGCT CCTGAGCCTG
ATGACGACCA AAAATTATGG TGCCAATTAT GCCATGTGGC AGCATCCCTT AACGCCTTAC
CGTGTACCAC TTAAAGAGGG TGGCGAGTTT TACTCCGTTA AACCACAACC GGGCGGTTTA
ATCTGGCGCG ACTGGTTAGG CCTTATCGAA ACGGGTAAAT CAGAAAACAA TACGGAACTT
CCCGCGTTGG TGGTGAAACT CTTTAATGCC AGCAGTCTGA AACAGGCAAA AGTGGGCCTA
TGGGGATTTG GTTATGATTT TGACAACATG AAAGCGCGCT GTTGGTACGA ACACCATTTC
CCGCTACTGC TCAATAAAAA AGAAGGCCAG ATACCGAAGC TGCGGCTGGC TGCGCAAACG
GCTTCACGGA TTCTGAGTCT GTTACGGAGT GCATTGAAAG AAGCATGGTT CTCCGATCCA
AAAGGTGCAA GGGGTGATTT CAGTTTTGTG GATATCGACT TCTGGAACAA AACTCAGCAT
CGCTTCCTGA GGTTAGTGCG CCAAATTGAA GAAGGTCAGG ATGCGGATGA ATTACTCGGC
AAATGGCAAA AGGAAATGTG GTTATTCGCG CGTCAGGATT TTGACGAGCG TGTATTCACT
AATCCTTATG AGCCCGTTGA TTTGAAACGC GTCATGACCG CGCGCAAGAA ATATTTTACA
ACATCGGCGG AGAAGCAAAG TGCTAAAGCC GCCAGGGAGA AAAAGCAGGA GGCTGCTGAA
TGA
 
Protein sequence
MNSFSLLTTP WLPVRFKDGT TGKLAPVDLA DENVVDIAAP RADLQGAAWQ FLLGLLQTSF 
APKDHRRWDD IWEDGLEAEK LREALLSLEH AFQFGPDSSS FMQDFEALTG DKVPVASLLP
EIPGSQTTKF NKDHFIKRGV TEYLCPHCLA LALFSLQLNA PAGGKGYRTG LRGGGPMTTL
IELQEYQGNQ QTPLWRKLWL NVMPQDEADL PLPKKFDDLI FPWLGPTRTS ELVGAVVTHD
QVNKLQAYWG MPRRIRIDFN TTTVGNCDIC DEQNDALLSL MTTKNYGANY AMWQHPLTPY
RVPLKEGGEF YSVKPQPGGL IWRDWLGLIE TGKSENNTEL PALVVKLFNA SSLKQAKVGL
WGFGYDFDNM KARCWYEHHF PLLLNKKEGQ IPKLRLAAQT ASRILSLLRS ALKEAWFSDP
KGARGDFSFV DIDFWNKTQH RFLRLVRQIE EGQDADELLG KWQKEMWLFA RQDFDERVFT
NPYEPVDLKR VMTARKKYFT TSAEKQSAKA AREKKQEAAE