Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2889 |
Symbol | cas3 |
ID | 6147428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2958168 |
End bp | 2960867 |
Gene Length | 2700 bp |
Protein Length | 899 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617758 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_001744913 |
Protein GI | 170680605 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0793506 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAAT ATCCTTTAAG TTTACTGAAG GATAAAAATA TTGTGACTTT TTTTGATTTC TGGGGAAAAA CCCGACGCGG CGAGAAAGAC GGGGGCGACG ACTATCACCT TCTTTGCTGG CATTCGCTGG ATGTAGCCGC GATGGGCTAT TTAATGGTTA AAAGCAATTG CTTCGGACTG GCTGACTATT TTCGTCAATT AGGTTTCGCT GACACGGAGC AGGCAGCACA ATTTTTCGCC TGGCTGCTGT GCTGGCACGA TATTGGAAAA TTTGCCCGCT CTTTTCAGCA ACTTTACCTG CACCCACAAC TCAAGGTCCC GGAAGGCGCA AGAAAGAATT ACGAAAAGAT CTCTCATTCA ACGCTGGGCT ACTGGTTGTG GAACCATTAT TTAAGTGAAT GTCAGGAGTT GCTTCCTTCA TCTTCACTCT CTCCGCGCAA ACTTAAACGG GTGATGGAGA TGTGGATGCC CATGACTACC GGGCATCATG GTCGACCCCC TGAGCGTATG GATGAACTGG ATAATTTTCT GCCTGAAGAC AAAGGTGCCG CACGAGATTT TCTCCTTGCA ATCAAGGTAC TGTTTCCGCT CATAGAGATC CCCACATTCT GGGATGATGA CGAGGGCGTT GAACTTATAA AACAACTTTC CTGGTATATC TCTGCAACCG TCGTACTTGC AGACTGGACG GGATCGTCAA CGCGATATTT TCCACGCGTC GCACAGACAA TGGATATTGA AGATTACTGG CAGAAAACTT TAGTTCAGGC TCAAAACGCC TTAACCGTCT TTCCTCCAAA AGCAGAAAGC GCGCCTTTCA CCGGAATTAA TACGCTGTTT CCTTTTATTG AGAATCCGAC ACCGCTACAG CAAAAGGTGC TGGATCTGGA TATCAGCCAG CCGGGACCAC AGTTATTTAT TCTGGAGGAT GTAACCGGCG CAGGTAAAAC AGAAGCGGCG CTTATCCTGG CGCACAGGTT GATGGCTGCG GGGAAAGCGC AGGGTTTGTT TTTTGGCCTG CCAACAATGG CAACGGCCAA TGCCATGTAC GATCGACTGG TCAAAACCTG GCTTGCTTTC TATTCGCCAG AGTCCCGCCC CAGCCTGGTG CTGGCACACA GTGCCCGAAC ATTAATGGAC CGCTTTAATG AATCACTCTG GTCCGGTGAT TTAGTCGGGT CAGAAGAACC GGATGAACAA ACGTTCAGTC AGGGATGTGC GGCCTGGTTT GCCGACAGTA ACAAGAAGGC GCTACTGGCT GAAATTGGCG TCGGCACGCT GGATCAGGCG ATGATGGCTG TGATGCCGTT CAAACATAAT AATCTGCGGC TTCTGGGACT GAGTAACAAA ATCCTGCTGG CTGATGAGAT CCATGCCTGT GACGCTTACA TGTCATGCAT TCTTGAAGGG CTGATCGAGC GGCAGGCGCG TGGCGGAAAC AGCGTCATTT TGCTTTCTGC CACGTTATCT CAACAGCAGC GCGACAAACT CGTTGCTGCC TTTGCGCGTG GTATAGAGGG CCAGCAAGAA GCTCCGTTCC TGGAAAAGGA TGATTACCCC TGGCTGACGC ACGTTACGAA ATCCGATGTG CATTCACACC GGGTAGCGAC GCGCAAAGAC GTTGAGCGCA GCGTGAGTGT GGGTTGGCTT CATAGCGAAC AAGAGTGTAT TGCGCGTATC GCATCGGCGG TAAGTCAGGG GAAATGTATC GCCTGGATCC GAAATTCTGT CGATGACGCC ATCCAGGTTC ATCGTCAGTT ACTTGCCCGC GGCGTCATTC CTGCTTCCAG CCTTTCACTT TTTCATAGCC GCTTTGCTTT TAGCGATCGC CAGCGAATTG AAACGGAGAC GCTGGCACGC TTTGGTAAAG AAGACTGTTC ACAGCGTGCC GGAAAAGTCC TCATTTGTAC TCAAGTCCTG GAGCAGAGCG TTGACTGCGA CCTGGACGAA ATGATTTCCG ACCTTGCCCC TGTTGATTTA CTGATCCAGC GAGCAGGGCG ATTACAGCGA CATATCCGCG ATATGAATGG CCAGTTAAAG CGTGACGGTA AAGACGAGCG TCACCCTCCT GAATTGCTGA TTCTGGCCCC CGTCTGGGAC GACTCTCCGG GTGACGAATG GTTTGGCAGT GCCATGCGTA ACAGTGCATT TGTCTATCCC GATCATGGGC GAATCTGGCT GACGCAGCGT GTGCTGCGTG AGCAAGGTGC TATTCAGATG CCACACTCAG CCCGGCTTCT GATTGAATCA GTCTACGGTG AGGACGTAGT AATGCCGGAA GGGTTTGCCC GCAGCGAGCA GGAGCAAGTG GGCAAATATT ACTGCGATCG CGCAATGGCT AAAAAGTTTG TACTGAACTT CAGGCCCGGC TATGCCGCCA ATATCAACGA TTACCTCCCG GAAAAACTGT CGACACGTCT GGCTGAAGAG TCTGTTTCCC TGTGGCTGGC TACCTGTATT GACGGTGTGG TGAAGCCTTA TGCCACAGGT GCACACGCAT GGGAAATGAG CGTTGTCAGA GTGCGCCGAA GCTGGTGGAA AAAACATCGG GATGAGTTTT CTTTACTGGA AGGGGATGCG TTCAGGCAGT GGTGCGTTGA ACAGCGGCAT GATCCGGAGA TGGCAAACGT GATTTTAGTC ACCGATGACG AAAGTTGTGG GTATTCGGCT ACGGAGGGAT TGATTGGCAA GGTTGGTTGA
|
Protein sequence | MRKYPLSLLK DKNIVTFFDF WGKTRRGEKD GGDDYHLLCW HSLDVAAMGY LMVKSNCFGL ADYFRQLGFA DTEQAAQFFA WLLCWHDIGK FARSFQQLYL HPQLKVPEGA RKNYEKISHS TLGYWLWNHY LSECQELLPS SSLSPRKLKR VMEMWMPMTT GHHGRPPERM DELDNFLPED KGAARDFLLA IKVLFPLIEI PTFWDDDEGV ELIKQLSWYI SATVVLADWT GSSTRYFPRV AQTMDIEDYW QKTLVQAQNA LTVFPPKAES APFTGINTLF PFIENPTPLQ QKVLDLDISQ PGPQLFILED VTGAGKTEAA LILAHRLMAA GKAQGLFFGL PTMATANAMY DRLVKTWLAF YSPESRPSLV LAHSARTLMD RFNESLWSGD LVGSEEPDEQ TFSQGCAAWF ADSNKKALLA EIGVGTLDQA MMAVMPFKHN NLRLLGLSNK ILLADEIHAC DAYMSCILEG LIERQARGGN SVILLSATLS QQQRDKLVAA FARGIEGQQE APFLEKDDYP WLTHVTKSDV HSHRVATRKD VERSVSVGWL HSEQECIARI ASAVSQGKCI AWIRNSVDDA IQVHRQLLAR GVIPASSLSL FHSRFAFSDR QRIETETLAR FGKEDCSQRA GKVLICTQVL EQSVDCDLDE MISDLAPVDL LIQRAGRLQR HIRDMNGQLK RDGKDERHPP ELLILAPVWD DSPGDEWFGS AMRNSAFVYP DHGRIWLTQR VLREQGAIQM PHSARLLIES VYGEDVVMPE GFARSEQEQV GKYYCDRAMA KKFVLNFRPG YAANINDYLP EKLSTRLAEE SVSLWLATCI DGVVKPYATG AHAWEMSVVR VRRSWWKKHR DEFSLLEGDA FRQWCVEQRH DPEMANVILV TDDESCGYSA TEGLIGKVG
|
| |