Gene ECH74115_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4014 
Symbolcas3 
ID6966553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3708787 
End bp3711486 
Gene Length2700 bp 
Protein Length899 aa 
Translation table11 
GC content50% 
IMG OID643387782 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_002272225 
Protein GI209398890 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.317175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAT ATCCTTTAAG TTTACTGAAG GATAAAAATA TTGTGACTTT CTTTGATTTC 
TGGGGAAAAA CCCGACGTGG CGAGAAAGAG GGTGGCGACG GCTATCACCT TCTTTGCTGG
CATTCGCTGG ATGTGGCCGC AATGGGCTAT TTAATGGTTA AAAGAAATTG CTTCGGGCTG
GCTGATTACT TTCGTCAATT AGGGATTTCT GACAAGGAAC AGGCGGCTCA ATTTTTCGCT
TGGTTGCTGT GCTGGCACGA TATTGGAAAA TTTGCCCGCT CTTTTCAGCA ACTTTACCTG
GCCCCTGAAC TCAAGATTCC GGAAGGTTCC AGAAAGAATT ACGAAAAGAT CTCTCATTCA
ACGCTGGGTT ACTGGCTGTG GAATTATTAT TTAAGTGAAT GTGAGGAGTT GCTTCCTTCA
TCTTCACTCT CTTCTCGTAA ACTTACACGT GTAATAGAGA TGTGGATGTC CATAACTACC
GGGCATCATG GTCGACCACC TGACCGTATT GATGAGCTGG ATAATTTTCT GCCTGAAGAC
AAAGCTGCCG CGCGAGATTT TCTCCTTGAA ATCAAGGCAC TGTTTCCGCT CATAGAGATT
CCCACATTCT GGGATGATGA CGAGGGCGTT GAACTTTTAA AACAACTTTC CTGGTATATC
TCTGCAACAG TCGTACTCGC AGACTGGACG GGTTCGTCAA CGCGATTTTT TCCACGCGTC
GCACACCCAA TGGATATTAA AGATTACTGG CAGAAAACTT TAGTTCAGGC TCAAAACGCC
TTAACCGTCT TTCCTCCAAA AGCAGAAACC GCACCTTTCA CCGGAATTAA TACGCTGTTT
CCTTTTATTG AGCACCCGAC ACCATTACAG CAAAAGGTAC TGGATCTGGA TATCAGCCAG
CCAGGGCCAC AGTTATTTAT TCTGGAAGAC GTGACTGGCG CAGGTAAAAC AGAAGCGGCG
CTTATCCTGG CGCACAGGTT GATGGCTGCG AGGAAAGCAC AGGGTTTGTT TTTTGGCCTG
CCAACAATGG CAACGGCCAA TGCCATGTAC GATCGGCTGG TCAAAACCTG GCTTGCTTTC
TATTCGCCAG AGTCCCGCCC CAGCTTGGTG CTGGCACACA GTGCCCGCAC ATTAATGGAC
CGCTTCAATG AATCACTCTG GTCCGGTGAT TTAGTCGGGT CAGAAGAACC GGATGAACAA
ACATTCAGTC AGGGATGTGC GGCCTGGTTT GCCAACAGTA ACAAGAAGGC GCTACTGGCT
GAAATTGGCG TCGGCACGCT GGATCAGGCG ATGATGGCAG TGATGCCGTT TAAACATAAT
AATCTGCGGC TTCTGGGGTT GAGTAACAAA ATCCTGCTGG CTGATGAGAT CCATGCCTGT
GATGCTTACA TGTCGTGCAT TCTTGAAGGG CTGATCGAGC GGCAGGCGCG TGGCGGAAAC
AGCGTCATTT TGCTTTCTGC TACGTTATCC CAACAGCAGC GCGACAAACT CGTCGCCGCC
TTTGCGCGTG GCACAGAGGG CCAGCAAGAA GCTCCGTTCC TTGAAAAGGA TGATTACCCC
TGGCTGACGC ATGTCACGAA ATCCGATGTG AACTCACACC GGGTAGCGAC GCGCAAAGAC
GTTGAGCGTA GCGTCAGCGT GGGTTGGCTT CATAGTGAAC AAGAGAGTAT TGCGCGTATC
GAATCGGCGG TAAGTCAGGG AAAATGCATC GCCTGGATCC GGAATTCTGT CGATGACGCT
ATTAAGGTTC ATCGTCAGCT GCTTGCCCGC GGCGTCATTC CCGCTTCCAG CCTTTCACTC
TTTCATAGCC GCTTTGCTTT TAGCGATCGC CAGCGAATTG AAATGGAGAC GCTGGCACGC
TTTGGTAAAG AAGACGGTTC ACAGCGTGCC GGAAAAGTCC TCATTTGTAC TCAGGTCTTA
GAGCAGAGCG TTGATTGTGA CCTGGACGAA ATGATCTCCG ACCTGGCCCC TGTTGATTTG
CTGATTCAGC GAGCGGGGCG ATTACAGCGG CATATCCGCG ATATTAATGG TCAGTTAAAG
CGTGACGGAA AAGACGAGCG TTCCCCTCCT GAATTGCTGA TTCTGGCCCC CGTCTGGGAC
GACGCTCCTG GTGACGAATG GTTCGGCAGT GCCATGCGTA ACAGTGCATA TGTCTATCCC
GATCATGGAC GAATCTGGCT GACGCAGCGT GTACTGCGTG AGCAAGGCGC TATTCAAATG
CCACACGCAG CCCGCCTTCT TATTGAATCA GTCTACGGTG AGGACGTGGT AATGCCGGAA
GGATTTGCCC GCAGCGAGCA GGAGCAAGTG GGCAAATATT ACTGCGATCG CGCAATGGCT
AAAAAGTTTG TCCTGAACTT CAAGCCTGGC TATGCCGCCA ATATCAACGA TTACCTTCCG
GAAAAGCTGT CGACACGTCT GGCTGAGGAA TCTGTTTCCC TGTGGCTGGC TACCTGTATT
GCCGGTGTGG TGAAGCCTTA TGCCACCGGT GCTCACGCAT GGGAAATGAG CGTTGTCAGA
GTGCGTCGAA GCTGGTGGAA AAAACATCGG GATGAGTTTT CTTTACTGGA AGGGGAAGCG
TTCAGGCAGT GGTGCATTGA ACAGCGGCAA GATCCGGAAA TGGCAAACGT GATTTTAGTC
ACTGATGACG AAAGTTGCGG GTATTCGGCC AGGGAGGGAT TGATTGGCAA GGTTGATTGA
 
Protein sequence
MRKYPLSLLK DKNIVTFFDF WGKTRRGEKE GGDGYHLLCW HSLDVAAMGY LMVKRNCFGL 
ADYFRQLGIS DKEQAAQFFA WLLCWHDIGK FARSFQQLYL APELKIPEGS RKNYEKISHS
TLGYWLWNYY LSECEELLPS SSLSSRKLTR VIEMWMSITT GHHGRPPDRI DELDNFLPED
KAAARDFLLE IKALFPLIEI PTFWDDDEGV ELLKQLSWYI SATVVLADWT GSSTRFFPRV
AHPMDIKDYW QKTLVQAQNA LTVFPPKAET APFTGINTLF PFIEHPTPLQ QKVLDLDISQ
PGPQLFILED VTGAGKTEAA LILAHRLMAA RKAQGLFFGL PTMATANAMY DRLVKTWLAF
YSPESRPSLV LAHSARTLMD RFNESLWSGD LVGSEEPDEQ TFSQGCAAWF ANSNKKALLA
EIGVGTLDQA MMAVMPFKHN NLRLLGLSNK ILLADEIHAC DAYMSCILEG LIERQARGGN
SVILLSATLS QQQRDKLVAA FARGTEGQQE APFLEKDDYP WLTHVTKSDV NSHRVATRKD
VERSVSVGWL HSEQESIARI ESAVSQGKCI AWIRNSVDDA IKVHRQLLAR GVIPASSLSL
FHSRFAFSDR QRIEMETLAR FGKEDGSQRA GKVLICTQVL EQSVDCDLDE MISDLAPVDL
LIQRAGRLQR HIRDINGQLK RDGKDERSPP ELLILAPVWD DAPGDEWFGS AMRNSAYVYP
DHGRIWLTQR VLREQGAIQM PHAARLLIES VYGEDVVMPE GFARSEQEQV GKYYCDRAMA
KKFVLNFKPG YAANINDYLP EKLSTRLAEE SVSLWLATCI AGVVKPYATG AHAWEMSVVR
VRRSWWKKHR DEFSLLEGEA FRQWCIEQRQ DPEMANVILV TDDESCGYSA REGLIGKVD