Gene Cthe_2299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2299 
Symbol 
ID4809888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2743056 
End bp2745308 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content38% 
IMG OID640107705 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001038694 
Protein GI125974784 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTTTG CAAAGCCTGT TGACAATTAT GAAGGAACAT ACGAGTATCA TGTAAAAAGG 
TGCGCTGAAA TACTGGAGTT TGAAATTAGG TCAAAGTATC CGGCCTTTAA ACGAATTCTC
TCGAGTTACG GATTTGATGT TGAGGATTTT ATTGAAAAAA TGAAAACAGC CGTAGTTTTT
CATGATTTTG GGAAGCTTAA TCCGTATTTT CAAGAATATA TGAAAAGAAA AATCGAGAAA
AAAAAGCTTT CAGGTATAAA GCATTTTAGG CACGAAGTTC TTTCGTGCCT TTTTCTTATG
TCAAATGAGA AGAGTAAAGA AGCCTACTTC CCATATCATA TATTGGCGGT GCTTGGACAT
CACAAAATGC TTTCTTCGGA TTTGAAAAGT TTTGAAAGAG AGAGGGTTTG GCAGGACAGG
TGGCCTGAGA TTTCAGAAGA TGCCGTAAAA CATGCGATGG AAATTGCTTC TGAGTTTGGG
ATAAAAGTTG ACGGAAGAAG CGGACCGCAA GGGAAGAGTG CCAACAATTA TCTGAATGCT
TTGTTAAAGT TGGCTTTGAT GTTGTATGAG AAAGACAGGG AAAAACTAAA AGTGGTTTAT
TCTGTATCAA AGGGGCTTTT GCATAACTGT GACTGGATAG CTTCTTCAAA TCTTGACTAC
AATAGGGTAT GCCTTATTGG TGTATCTGCC AATGATATTG AAAAGAAGCT TAAGGAAAAG
TTGGAGAAAG AGAACAAAAA ATATGTAAGA AGAGAGTTTC ACTCAATATG TGCCAATGCA
GTCGGGGATG TGGTGGCGAT TGCACCCACG GGGTCCGGAA AGACTGAGGC GGCATTGATG
TGGGCTTTGA ATTCCGAAAC AACCAAAATT ATTTTTCTCA TGCCGACAAT GGTTACTTCC
AACAGCCTTT ATGAGAGACT TTCAACCCAT TATTTCCCCA AAGAGAGCTG TGGGCTGTCC
CATTCAGGGG CGGAAACGTA TTTTTATAAA AAGTCTAAAG AAGATGATAC GGAAAATGAT
TATGACTGGT CTCAGATTCT ACATCAGAAG GCTTTTATTC CGGCGGTTAT GGTTTCGACA
GTGGACCAGG TTTTGTCCAC GGAATTTCAT ACAGGATTAT GGAATCAGAA GGAATATGCA
CTGGTCGGAA GTTCGGTGAT TTTTGACGAA ATACATGCTT ATGATGGCTA TACCATAGGC
CTTATAACCG GGGTTATAAA AAAGATAAAA AAGTATGGCG GCAGGGTCAT GCTGATGAGT
GCGACCATGC CGAAGTTTTT AAGAAATCAT TTTTTAGACC TTCTCGATTC AAAATGTTTG
GTTGTTGCAG AGGAGCTTAT GGAAAGGGCA AGCAATGAAT GGGCATACTT GGATACCGAT
CTTGAGGGTA TAAGGGAAAA GGTACTGAAG GAAGTATCAG AGGGGAAGAA GGTAGCACTT
ATTGTAAATG ATGTGGAAAC GGCCAAAAAG GAGTATAAAT ACTACTCTAA AAAAGGATTT
GATGTATTAT GTCTTCATTC TGAATTTACA ATGAAAGACC GGCAGGAAAA AGAGGGCCGG
TTGACATCGA AAGAGGGTAA TCCTTACCGG CTTGTGATAT CCACCCAAGT TATAGAGGTG
TCTCTGGATG TAAGTTTTGA TGTTATGTTC TCGGAGTGCG CGCCTATAGA CAGCCTGGTG
CAGAGGGCGG GAAGGTGTAA TCGCCACGGA CTCATAAACT TTGGAAGGTT TTATGTTTTT
AATCCTTCGG ATACTGCTGT AAAATACGTC TATCGAAAAC AGAAGGATAT TCTTGACAAA
ACTGTTGAGG TAATAAGGAA AAATTCAGGC AGATTAACTG AGAAGCAAAT TATGGCTATG
GTTGAGGAAG TTTACGAGGG ATTTAATCTG TATGATGAAG ATTATAAACT GGGAATTGAC
ATTGTACGGG ACATTGAACA GAGATACAAC TTTTTTGATG TAAATATTTT CCAAGAAGAC
GAGAATCTTT CCACACGAAA ATTTGATGTT GTTAAAGTTC CAGTTATTCC TGCGGATGAG
TATATGGAGA TTGTCGAAAA GCTGTTTGAA AGCAAGGATT ATAAAATGAT ATCTCTTTAT
GAAATACCTG TTTCAATAAG CAAGTTTAAA AAGTATATAA GAAGGCTGGA AATAAGCAAT
AAATACAATT TGCCTATATT TAAAATAAAG TATAGCAGTG AGTATGGCAT TGATTATGAG
GAGGATGATT CGAATACCTG TTATTTATAT TAG
 
Protein sequence
MYFAKPVDNY EGTYEYHVKR CAEILEFEIR SKYPAFKRIL SSYGFDVEDF IEKMKTAVVF 
HDFGKLNPYF QEYMKRKIEK KKLSGIKHFR HEVLSCLFLM SNEKSKEAYF PYHILAVLGH
HKMLSSDLKS FERERVWQDR WPEISEDAVK HAMEIASEFG IKVDGRSGPQ GKSANNYLNA
LLKLALMLYE KDREKLKVVY SVSKGLLHNC DWIASSNLDY NRVCLIGVSA NDIEKKLKEK
LEKENKKYVR REFHSICANA VGDVVAIAPT GSGKTEAALM WALNSETTKI IFLMPTMVTS
NSLYERLSTH YFPKESCGLS HSGAETYFYK KSKEDDTEND YDWSQILHQK AFIPAVMVST
VDQVLSTEFH TGLWNQKEYA LVGSSVIFDE IHAYDGYTIG LITGVIKKIK KYGGRVMLMS
ATMPKFLRNH FLDLLDSKCL VVAEELMERA SNEWAYLDTD LEGIREKVLK EVSEGKKVAL
IVNDVETAKK EYKYYSKKGF DVLCLHSEFT MKDRQEKEGR LTSKEGNPYR LVISTQVIEV
SLDVSFDVMF SECAPIDSLV QRAGRCNRHG LINFGRFYVF NPSDTAVKYV YRKQKDILDK
TVEVIRKNSG RLTEKQIMAM VEEVYEGFNL YDEDYKLGID IVRDIEQRYN FFDVNIFQED
ENLSTRKFDV VKVPVIPADE YMEIVEKLFE SKDYKMISLY EIPVSISKFK KYIRRLEISN
KYNLPIFKIK YSSEYGIDYE EDDSNTCYLY