Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2299 |
Symbol | |
ID | 4809888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2743056 |
End bp | 2745308 |
Gene Length | 2253 bp |
Protein Length | 750 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107705 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_001038694 |
Protein GI | 125974784 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 [TIGR01596] CRISPR-associated endonuclease Cas3-HD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATTTTG CAAAGCCTGT TGACAATTAT GAAGGAACAT ACGAGTATCA TGTAAAAAGG TGCGCTGAAA TACTGGAGTT TGAAATTAGG TCAAAGTATC CGGCCTTTAA ACGAATTCTC TCGAGTTACG GATTTGATGT TGAGGATTTT ATTGAAAAAA TGAAAACAGC CGTAGTTTTT CATGATTTTG GGAAGCTTAA TCCGTATTTT CAAGAATATA TGAAAAGAAA AATCGAGAAA AAAAAGCTTT CAGGTATAAA GCATTTTAGG CACGAAGTTC TTTCGTGCCT TTTTCTTATG TCAAATGAGA AGAGTAAAGA AGCCTACTTC CCATATCATA TATTGGCGGT GCTTGGACAT CACAAAATGC TTTCTTCGGA TTTGAAAAGT TTTGAAAGAG AGAGGGTTTG GCAGGACAGG TGGCCTGAGA TTTCAGAAGA TGCCGTAAAA CATGCGATGG AAATTGCTTC TGAGTTTGGG ATAAAAGTTG ACGGAAGAAG CGGACCGCAA GGGAAGAGTG CCAACAATTA TCTGAATGCT TTGTTAAAGT TGGCTTTGAT GTTGTATGAG AAAGACAGGG AAAAACTAAA AGTGGTTTAT TCTGTATCAA AGGGGCTTTT GCATAACTGT GACTGGATAG CTTCTTCAAA TCTTGACTAC AATAGGGTAT GCCTTATTGG TGTATCTGCC AATGATATTG AAAAGAAGCT TAAGGAAAAG TTGGAGAAAG AGAACAAAAA ATATGTAAGA AGAGAGTTTC ACTCAATATG TGCCAATGCA GTCGGGGATG TGGTGGCGAT TGCACCCACG GGGTCCGGAA AGACTGAGGC GGCATTGATG TGGGCTTTGA ATTCCGAAAC AACCAAAATT ATTTTTCTCA TGCCGACAAT GGTTACTTCC AACAGCCTTT ATGAGAGACT TTCAACCCAT TATTTCCCCA AAGAGAGCTG TGGGCTGTCC CATTCAGGGG CGGAAACGTA TTTTTATAAA AAGTCTAAAG AAGATGATAC GGAAAATGAT TATGACTGGT CTCAGATTCT ACATCAGAAG GCTTTTATTC CGGCGGTTAT GGTTTCGACA GTGGACCAGG TTTTGTCCAC GGAATTTCAT ACAGGATTAT GGAATCAGAA GGAATATGCA CTGGTCGGAA GTTCGGTGAT TTTTGACGAA ATACATGCTT ATGATGGCTA TACCATAGGC CTTATAACCG GGGTTATAAA AAAGATAAAA AAGTATGGCG GCAGGGTCAT GCTGATGAGT GCGACCATGC CGAAGTTTTT AAGAAATCAT TTTTTAGACC TTCTCGATTC AAAATGTTTG GTTGTTGCAG AGGAGCTTAT GGAAAGGGCA AGCAATGAAT GGGCATACTT GGATACCGAT CTTGAGGGTA TAAGGGAAAA GGTACTGAAG GAAGTATCAG AGGGGAAGAA GGTAGCACTT ATTGTAAATG ATGTGGAAAC GGCCAAAAAG GAGTATAAAT ACTACTCTAA AAAAGGATTT GATGTATTAT GTCTTCATTC TGAATTTACA ATGAAAGACC GGCAGGAAAA AGAGGGCCGG TTGACATCGA AAGAGGGTAA TCCTTACCGG CTTGTGATAT CCACCCAAGT TATAGAGGTG TCTCTGGATG TAAGTTTTGA TGTTATGTTC TCGGAGTGCG CGCCTATAGA CAGCCTGGTG CAGAGGGCGG GAAGGTGTAA TCGCCACGGA CTCATAAACT TTGGAAGGTT TTATGTTTTT AATCCTTCGG ATACTGCTGT AAAATACGTC TATCGAAAAC AGAAGGATAT TCTTGACAAA ACTGTTGAGG TAATAAGGAA AAATTCAGGC AGATTAACTG AGAAGCAAAT TATGGCTATG GTTGAGGAAG TTTACGAGGG ATTTAATCTG TATGATGAAG ATTATAAACT GGGAATTGAC ATTGTACGGG ACATTGAACA GAGATACAAC TTTTTTGATG TAAATATTTT CCAAGAAGAC GAGAATCTTT CCACACGAAA ATTTGATGTT GTTAAAGTTC CAGTTATTCC TGCGGATGAG TATATGGAGA TTGTCGAAAA GCTGTTTGAA AGCAAGGATT ATAAAATGAT ATCTCTTTAT GAAATACCTG TTTCAATAAG CAAGTTTAAA AAGTATATAA GAAGGCTGGA AATAAGCAAT AAATACAATT TGCCTATATT TAAAATAAAG TATAGCAGTG AGTATGGCAT TGATTATGAG GAGGATGATT CGAATACCTG TTATTTATAT TAG
|
Protein sequence | MYFAKPVDNY EGTYEYHVKR CAEILEFEIR SKYPAFKRIL SSYGFDVEDF IEKMKTAVVF HDFGKLNPYF QEYMKRKIEK KKLSGIKHFR HEVLSCLFLM SNEKSKEAYF PYHILAVLGH HKMLSSDLKS FERERVWQDR WPEISEDAVK HAMEIASEFG IKVDGRSGPQ GKSANNYLNA LLKLALMLYE KDREKLKVVY SVSKGLLHNC DWIASSNLDY NRVCLIGVSA NDIEKKLKEK LEKENKKYVR REFHSICANA VGDVVAIAPT GSGKTEAALM WALNSETTKI IFLMPTMVTS NSLYERLSTH YFPKESCGLS HSGAETYFYK KSKEDDTEND YDWSQILHQK AFIPAVMVST VDQVLSTEFH TGLWNQKEYA LVGSSVIFDE IHAYDGYTIG LITGVIKKIK KYGGRVMLMS ATMPKFLRNH FLDLLDSKCL VVAEELMERA SNEWAYLDTD LEGIREKVLK EVSEGKKVAL IVNDVETAKK EYKYYSKKGF DVLCLHSEFT MKDRQEKEGR LTSKEGNPYR LVISTQVIEV SLDVSFDVMF SECAPIDSLV QRAGRCNRHG LINFGRFYVF NPSDTAVKYV YRKQKDILDK TVEVIRKNSG RLTEKQIMAM VEEVYEGFNL YDEDYKLGID IVRDIEQRYN FFDVNIFQED ENLSTRKFDV VKVPVIPADE YMEIVEKLFE SKDYKMISLY EIPVSISKFK KYIRRLEISN KYNLPIFKIK YSSEYGIDYE EDDSNTCYLY
|
| |