Gene EcHS_A2897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2897 
Symbolcse4 
ID5592513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2898284 
End bp2899375 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content44% 
IMG OID640922014 
ProductCRISPR-associated Cse4 family protein 
Protein accessionYP_001459525 
Protein GI157162207 
COG category 
COG ID 
TIGRFAM ID[TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value0.317201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAACT TTATCAATAT TCATGTTCTG ATCTCTCACA GCCCTTCATG TCTGAACCGC 
GACGATATGA ACATGCAGAA AGACGCTATT TTCGGCGGGA AAAGACGAGT AAGAATTTCA
AGTCAAAGCC TTAAACGTGC GATGCGTAAA AGTGATTATT ATGCACAAAA TATTGGTGAA
TCCAGTCTCA GAACAATTCA TCTTGCACAA TTACGTGATG TTCTTCGGCA AAAACTTGGT
GAACGTTTTG AGCAAAAAAT CATCGATAAG ACATTAGCTC TGCTCTCCGG TAAATCAGTT
GATGAAGCCG AAAAGATTTC TGCCGATGCG GTTACTCCCT GGGTTGTGGG AGAAATAGCC
TGGTTCTGTG AGCAGGTTGC AAAAGCAGAG GCTGATAATC TGGATGATAA AAAGCTGCTC
AAAGTTCTTA AGGAAGATAT TGCCGCCATA CGTGTGAATT TACAGCAGGG TGTTGATATT
GCGCTTAGTG GAAGAATGGC AACCAGCGGC ATGATGAGTG AGTTGGGAAA AGTTGATGGT
GCAATGTCCA TTGCGCATGC GATCACTACC CATCAGGTTG ATTCTGATAT TGACTGGTTC
ACCGCCGTAG ATGATTTACA GGAACAAGGT TCTGCACATC TGGGAACACA GGAATTTTCA
TCGGGTGTTT TTTATCGTTA TGCCAATATT AACCTCGCTC AACTTCAGGA AAATTTAGGC
GGTGCCTCCA GGGAGCAGGC TCTGGAAATT GCAACCCATG TTGTTCATAT GCTGGCAACA
GAGGTCCCTG GAGCAAAACA GCGTACTTAT GCCGCTTTTA ACCCTGCGGA TATGGTAATG
GTTAATTTCT CCGATATGCC ACTTTCTATG GCAAATGCTT TTGAAAAAGC GGTGAAAGCG
AAAGATGGCT TTTTGCAACC GTCTTTACAG GCGTTTAATC AATATTGGGA TCGCGTTGCC
AATGGATATG GTCTGAACGG AGCCGCTGCG CAATTCAGCT TGTCTGATGT AGACCCTATT
ACTGGTCAGG TACAGCAAAT GCCTACTTTA GAACAGTTAA AGTCCTGGGT TCGTAATAAT
GGCGAGGCGT GA
 
Protein sequence
MSNFINIHVL ISHSPSCLNR DDMNMQKDAI FGGKRRVRIS SQSLKRAMRK SDYYAQNIGE 
SSLRTIHLAQ LRDVLRQKLG ERFEQKIIDK TLALLSGKSV DEAEKISADA VTPWVVGEIA
WFCEQVAKAE ADNLDDKKLL KVLKEDIAAI RVNLQQGVDI ALSGRMATSG MMSELGKVDG
AMSIAHAITT HQVDSDIDWF TAVDDLQEQG SAHLGTQEFS SGVFYRYANI NLAQLQENLG
GASREQALEI ATHVVHMLAT EVPGAKQRTY AAFNPADMVM VNFSDMPLSM ANAFEKAVKA
KDGFLQPSLQ AFNQYWDRVA NGYGLNGAAA QFSLSDVDPI TGQVQQMPTL EQLKSWVRNN
GEA