Gene EcSMS35_2889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2889 
Symbolcas3 
ID6147428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2958168 
End bp2960867 
Gene Length2700 bp 
Protein Length899 aa 
Translation table11 
GC content51% 
IMG OID641617758 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001744913 
Protein GI170680605 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0793506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAT ATCCTTTAAG TTTACTGAAG GATAAAAATA TTGTGACTTT TTTTGATTTC 
TGGGGAAAAA CCCGACGCGG CGAGAAAGAC GGGGGCGACG ACTATCACCT TCTTTGCTGG
CATTCGCTGG ATGTAGCCGC GATGGGCTAT TTAATGGTTA AAAGCAATTG CTTCGGACTG
GCTGACTATT TTCGTCAATT AGGTTTCGCT GACACGGAGC AGGCAGCACA ATTTTTCGCC
TGGCTGCTGT GCTGGCACGA TATTGGAAAA TTTGCCCGCT CTTTTCAGCA ACTTTACCTG
CACCCACAAC TCAAGGTCCC GGAAGGCGCA AGAAAGAATT ACGAAAAGAT CTCTCATTCA
ACGCTGGGCT ACTGGTTGTG GAACCATTAT TTAAGTGAAT GTCAGGAGTT GCTTCCTTCA
TCTTCACTCT CTCCGCGCAA ACTTAAACGG GTGATGGAGA TGTGGATGCC CATGACTACC
GGGCATCATG GTCGACCCCC TGAGCGTATG GATGAACTGG ATAATTTTCT GCCTGAAGAC
AAAGGTGCCG CACGAGATTT TCTCCTTGCA ATCAAGGTAC TGTTTCCGCT CATAGAGATC
CCCACATTCT GGGATGATGA CGAGGGCGTT GAACTTATAA AACAACTTTC CTGGTATATC
TCTGCAACCG TCGTACTTGC AGACTGGACG GGATCGTCAA CGCGATATTT TCCACGCGTC
GCACAGACAA TGGATATTGA AGATTACTGG CAGAAAACTT TAGTTCAGGC TCAAAACGCC
TTAACCGTCT TTCCTCCAAA AGCAGAAAGC GCGCCTTTCA CCGGAATTAA TACGCTGTTT
CCTTTTATTG AGAATCCGAC ACCGCTACAG CAAAAGGTGC TGGATCTGGA TATCAGCCAG
CCGGGACCAC AGTTATTTAT TCTGGAGGAT GTAACCGGCG CAGGTAAAAC AGAAGCGGCG
CTTATCCTGG CGCACAGGTT GATGGCTGCG GGGAAAGCGC AGGGTTTGTT TTTTGGCCTG
CCAACAATGG CAACGGCCAA TGCCATGTAC GATCGACTGG TCAAAACCTG GCTTGCTTTC
TATTCGCCAG AGTCCCGCCC CAGCCTGGTG CTGGCACACA GTGCCCGAAC ATTAATGGAC
CGCTTTAATG AATCACTCTG GTCCGGTGAT TTAGTCGGGT CAGAAGAACC GGATGAACAA
ACGTTCAGTC AGGGATGTGC GGCCTGGTTT GCCGACAGTA ACAAGAAGGC GCTACTGGCT
GAAATTGGCG TCGGCACGCT GGATCAGGCG ATGATGGCTG TGATGCCGTT CAAACATAAT
AATCTGCGGC TTCTGGGACT GAGTAACAAA ATCCTGCTGG CTGATGAGAT CCATGCCTGT
GACGCTTACA TGTCATGCAT TCTTGAAGGG CTGATCGAGC GGCAGGCGCG TGGCGGAAAC
AGCGTCATTT TGCTTTCTGC CACGTTATCT CAACAGCAGC GCGACAAACT CGTTGCTGCC
TTTGCGCGTG GTATAGAGGG CCAGCAAGAA GCTCCGTTCC TGGAAAAGGA TGATTACCCC
TGGCTGACGC ACGTTACGAA ATCCGATGTG CATTCACACC GGGTAGCGAC GCGCAAAGAC
GTTGAGCGCA GCGTGAGTGT GGGTTGGCTT CATAGCGAAC AAGAGTGTAT TGCGCGTATC
GCATCGGCGG TAAGTCAGGG GAAATGTATC GCCTGGATCC GAAATTCTGT CGATGACGCC
ATCCAGGTTC ATCGTCAGTT ACTTGCCCGC GGCGTCATTC CTGCTTCCAG CCTTTCACTT
TTTCATAGCC GCTTTGCTTT TAGCGATCGC CAGCGAATTG AAACGGAGAC GCTGGCACGC
TTTGGTAAAG AAGACTGTTC ACAGCGTGCC GGAAAAGTCC TCATTTGTAC TCAAGTCCTG
GAGCAGAGCG TTGACTGCGA CCTGGACGAA ATGATTTCCG ACCTTGCCCC TGTTGATTTA
CTGATCCAGC GAGCAGGGCG ATTACAGCGA CATATCCGCG ATATGAATGG CCAGTTAAAG
CGTGACGGTA AAGACGAGCG TCACCCTCCT GAATTGCTGA TTCTGGCCCC CGTCTGGGAC
GACTCTCCGG GTGACGAATG GTTTGGCAGT GCCATGCGTA ACAGTGCATT TGTCTATCCC
GATCATGGGC GAATCTGGCT GACGCAGCGT GTGCTGCGTG AGCAAGGTGC TATTCAGATG
CCACACTCAG CCCGGCTTCT GATTGAATCA GTCTACGGTG AGGACGTAGT AATGCCGGAA
GGGTTTGCCC GCAGCGAGCA GGAGCAAGTG GGCAAATATT ACTGCGATCG CGCAATGGCT
AAAAAGTTTG TACTGAACTT CAGGCCCGGC TATGCCGCCA ATATCAACGA TTACCTCCCG
GAAAAACTGT CGACACGTCT GGCTGAAGAG TCTGTTTCCC TGTGGCTGGC TACCTGTATT
GACGGTGTGG TGAAGCCTTA TGCCACAGGT GCACACGCAT GGGAAATGAG CGTTGTCAGA
GTGCGCCGAA GCTGGTGGAA AAAACATCGG GATGAGTTTT CTTTACTGGA AGGGGATGCG
TTCAGGCAGT GGTGCGTTGA ACAGCGGCAT GATCCGGAGA TGGCAAACGT GATTTTAGTC
ACCGATGACG AAAGTTGTGG GTATTCGGCT ACGGAGGGAT TGATTGGCAA GGTTGGTTGA
 
Protein sequence
MRKYPLSLLK DKNIVTFFDF WGKTRRGEKD GGDDYHLLCW HSLDVAAMGY LMVKSNCFGL 
ADYFRQLGFA DTEQAAQFFA WLLCWHDIGK FARSFQQLYL HPQLKVPEGA RKNYEKISHS
TLGYWLWNHY LSECQELLPS SSLSPRKLKR VMEMWMPMTT GHHGRPPERM DELDNFLPED
KGAARDFLLA IKVLFPLIEI PTFWDDDEGV ELIKQLSWYI SATVVLADWT GSSTRYFPRV
AQTMDIEDYW QKTLVQAQNA LTVFPPKAES APFTGINTLF PFIENPTPLQ QKVLDLDISQ
PGPQLFILED VTGAGKTEAA LILAHRLMAA GKAQGLFFGL PTMATANAMY DRLVKTWLAF
YSPESRPSLV LAHSARTLMD RFNESLWSGD LVGSEEPDEQ TFSQGCAAWF ADSNKKALLA
EIGVGTLDQA MMAVMPFKHN NLRLLGLSNK ILLADEIHAC DAYMSCILEG LIERQARGGN
SVILLSATLS QQQRDKLVAA FARGIEGQQE APFLEKDDYP WLTHVTKSDV HSHRVATRKD
VERSVSVGWL HSEQECIARI ASAVSQGKCI AWIRNSVDDA IQVHRQLLAR GVIPASSLSL
FHSRFAFSDR QRIETETLAR FGKEDCSQRA GKVLICTQVL EQSVDCDLDE MISDLAPVDL
LIQRAGRLQR HIRDMNGQLK RDGKDERHPP ELLILAPVWD DSPGDEWFGS AMRNSAFVYP
DHGRIWLTQR VLREQGAIQM PHSARLLIES VYGEDVVMPE GFARSEQEQV GKYYCDRAMA
KKFVLNFRPG YAANINDYLP EKLSTRLAEE SVSLWLATCI DGVVKPYATG AHAWEMSVVR
VRRSWWKKHR DEFSLLEGDA FRQWCVEQRH DPEMANVILV TDDESCGYSA TEGLIGKVG