Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4014 |
Symbol | cas3 |
ID | 6966553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3708787 |
End bp | 3711486 |
Gene Length | 2700 bp |
Protein Length | 899 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643387782 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_002272225 |
Protein GI | 209398890 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.317175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAAT ATCCTTTAAG TTTACTGAAG GATAAAAATA TTGTGACTTT CTTTGATTTC TGGGGAAAAA CCCGACGTGG CGAGAAAGAG GGTGGCGACG GCTATCACCT TCTTTGCTGG CATTCGCTGG ATGTGGCCGC AATGGGCTAT TTAATGGTTA AAAGAAATTG CTTCGGGCTG GCTGATTACT TTCGTCAATT AGGGATTTCT GACAAGGAAC AGGCGGCTCA ATTTTTCGCT TGGTTGCTGT GCTGGCACGA TATTGGAAAA TTTGCCCGCT CTTTTCAGCA ACTTTACCTG GCCCCTGAAC TCAAGATTCC GGAAGGTTCC AGAAAGAATT ACGAAAAGAT CTCTCATTCA ACGCTGGGTT ACTGGCTGTG GAATTATTAT TTAAGTGAAT GTGAGGAGTT GCTTCCTTCA TCTTCACTCT CTTCTCGTAA ACTTACACGT GTAATAGAGA TGTGGATGTC CATAACTACC GGGCATCATG GTCGACCACC TGACCGTATT GATGAGCTGG ATAATTTTCT GCCTGAAGAC AAAGCTGCCG CGCGAGATTT TCTCCTTGAA ATCAAGGCAC TGTTTCCGCT CATAGAGATT CCCACATTCT GGGATGATGA CGAGGGCGTT GAACTTTTAA AACAACTTTC CTGGTATATC TCTGCAACAG TCGTACTCGC AGACTGGACG GGTTCGTCAA CGCGATTTTT TCCACGCGTC GCACACCCAA TGGATATTAA AGATTACTGG CAGAAAACTT TAGTTCAGGC TCAAAACGCC TTAACCGTCT TTCCTCCAAA AGCAGAAACC GCACCTTTCA CCGGAATTAA TACGCTGTTT CCTTTTATTG AGCACCCGAC ACCATTACAG CAAAAGGTAC TGGATCTGGA TATCAGCCAG CCAGGGCCAC AGTTATTTAT TCTGGAAGAC GTGACTGGCG CAGGTAAAAC AGAAGCGGCG CTTATCCTGG CGCACAGGTT GATGGCTGCG AGGAAAGCAC AGGGTTTGTT TTTTGGCCTG CCAACAATGG CAACGGCCAA TGCCATGTAC GATCGGCTGG TCAAAACCTG GCTTGCTTTC TATTCGCCAG AGTCCCGCCC CAGCTTGGTG CTGGCACACA GTGCCCGCAC ATTAATGGAC CGCTTCAATG AATCACTCTG GTCCGGTGAT TTAGTCGGGT CAGAAGAACC GGATGAACAA ACATTCAGTC AGGGATGTGC GGCCTGGTTT GCCAACAGTA ACAAGAAGGC GCTACTGGCT GAAATTGGCG TCGGCACGCT GGATCAGGCG ATGATGGCAG TGATGCCGTT TAAACATAAT AATCTGCGGC TTCTGGGGTT GAGTAACAAA ATCCTGCTGG CTGATGAGAT CCATGCCTGT GATGCTTACA TGTCGTGCAT TCTTGAAGGG CTGATCGAGC GGCAGGCGCG TGGCGGAAAC AGCGTCATTT TGCTTTCTGC TACGTTATCC CAACAGCAGC GCGACAAACT CGTCGCCGCC TTTGCGCGTG GCACAGAGGG CCAGCAAGAA GCTCCGTTCC TTGAAAAGGA TGATTACCCC TGGCTGACGC ATGTCACGAA ATCCGATGTG AACTCACACC GGGTAGCGAC GCGCAAAGAC GTTGAGCGTA GCGTCAGCGT GGGTTGGCTT CATAGTGAAC AAGAGAGTAT TGCGCGTATC GAATCGGCGG TAAGTCAGGG AAAATGCATC GCCTGGATCC GGAATTCTGT CGATGACGCT ATTAAGGTTC ATCGTCAGCT GCTTGCCCGC GGCGTCATTC CCGCTTCCAG CCTTTCACTC TTTCATAGCC GCTTTGCTTT TAGCGATCGC CAGCGAATTG AAATGGAGAC GCTGGCACGC TTTGGTAAAG AAGACGGTTC ACAGCGTGCC GGAAAAGTCC TCATTTGTAC TCAGGTCTTA GAGCAGAGCG TTGATTGTGA CCTGGACGAA ATGATCTCCG ACCTGGCCCC TGTTGATTTG CTGATTCAGC GAGCGGGGCG ATTACAGCGG CATATCCGCG ATATTAATGG TCAGTTAAAG CGTGACGGAA AAGACGAGCG TTCCCCTCCT GAATTGCTGA TTCTGGCCCC CGTCTGGGAC GACGCTCCTG GTGACGAATG GTTCGGCAGT GCCATGCGTA ACAGTGCATA TGTCTATCCC GATCATGGAC GAATCTGGCT GACGCAGCGT GTACTGCGTG AGCAAGGCGC TATTCAAATG CCACACGCAG CCCGCCTTCT TATTGAATCA GTCTACGGTG AGGACGTGGT AATGCCGGAA GGATTTGCCC GCAGCGAGCA GGAGCAAGTG GGCAAATATT ACTGCGATCG CGCAATGGCT AAAAAGTTTG TCCTGAACTT CAAGCCTGGC TATGCCGCCA ATATCAACGA TTACCTTCCG GAAAAGCTGT CGACACGTCT GGCTGAGGAA TCTGTTTCCC TGTGGCTGGC TACCTGTATT GCCGGTGTGG TGAAGCCTTA TGCCACCGGT GCTCACGCAT GGGAAATGAG CGTTGTCAGA GTGCGTCGAA GCTGGTGGAA AAAACATCGG GATGAGTTTT CTTTACTGGA AGGGGAAGCG TTCAGGCAGT GGTGCATTGA ACAGCGGCAA GATCCGGAAA TGGCAAACGT GATTTTAGTC ACTGATGACG AAAGTTGCGG GTATTCGGCC AGGGAGGGAT TGATTGGCAA GGTTGATTGA
|
Protein sequence | MRKYPLSLLK DKNIVTFFDF WGKTRRGEKE GGDGYHLLCW HSLDVAAMGY LMVKRNCFGL ADYFRQLGIS DKEQAAQFFA WLLCWHDIGK FARSFQQLYL APELKIPEGS RKNYEKISHS TLGYWLWNYY LSECEELLPS SSLSSRKLTR VIEMWMSITT GHHGRPPDRI DELDNFLPED KAAARDFLLE IKALFPLIEI PTFWDDDEGV ELLKQLSWYI SATVVLADWT GSSTRFFPRV AHPMDIKDYW QKTLVQAQNA LTVFPPKAET APFTGINTLF PFIEHPTPLQ QKVLDLDISQ PGPQLFILED VTGAGKTEAA LILAHRLMAA RKAQGLFFGL PTMATANAMY DRLVKTWLAF YSPESRPSLV LAHSARTLMD RFNESLWSGD LVGSEEPDEQ TFSQGCAAWF ANSNKKALLA EIGVGTLDQA MMAVMPFKHN NLRLLGLSNK ILLADEIHAC DAYMSCILEG LIERQARGGN SVILLSATLS QQQRDKLVAA FARGTEGQQE APFLEKDDYP WLTHVTKSDV NSHRVATRKD VERSVSVGWL HSEQESIARI ESAVSQGKCI AWIRNSVDDA IKVHRQLLAR GVIPASSLSL FHSRFAFSDR QRIEMETLAR FGKEDGSQRA GKVLICTQVL EQSVDCDLDE MISDLAPVDL LIQRAGRLQR HIRDINGQLK RDGKDERSPP ELLILAPVWD DAPGDEWFGS AMRNSAYVYP DHGRIWLTQR VLREQGAIQM PHAARLLIES VYGEDVVMPE GFARSEQEQV GKYYCDRAMA KKFVLNFKPG YAANINDYLP EKLSTRLAEE SVSLWLATCI AGVVKPYATG AHAWEMSVVR VRRSWWKKHR DEFSLLEGEA FRQWCIEQRQ DPEMANVILV TDDESCGYSA REGLIGKVD
|
| |