Gene Cthe_3204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3204 
Symbol 
ID4809506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3794968 
End bp3797370 
Gene Length2403 bp 
Protein Length800 aa 
Translation table11 
GC content34% 
IMG OID640108638 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001039592 
Protein GI125975682 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCTTT CGCACCCGGG AAAGCTGCTT TTGGACCATT TGAAAAATGT GTTTTTAATC 
GGAGACTGTA TTTTAATGCA GAAAAAGACC GAATTTGAGA GTTTTTCAGA ACATGATATT
CGCCAATTAA ACAAATTGAA TTTACTTACT CATGATCTTG GCAAGGCTAC ATCGTATTTT
CAGGATTACA TAAGAAATTT GGATGATAAT ACACAAAAGA ATGATGAAAG GAAAAGGCAC
GGTCTTTTGT CAGGGGTTTT GTCCTTTAAA ATTGTAAATG CAGTTATGAA GAATGAAATA
CTTGCTTTTT TATCTTATAT GGTGGTTTCA AAACATCATG GTGAGCTTGA CGATTTTACA
AATTTTATAT CTGTAATAAG CGGTGACGAA AAGAATAAAA CTCTTTTAAA GCTGCAATTT
GAAAGTATAG ATAAAGGTAA GTTGCAGGAA GTCATTACAA GGCTTGGAAT TGATTTTGAC
ATATTGAGTT ATACGGTTGA TGAGTTTGAA AACGACATTG ATTATATTAC TTCGAGAAAG
GTCAGAAAGA AAGTAAAGGA GCTAATGGGA CTTGAAATAT TCCTGCTAAT TGATTATCTT
TTTTCTTTAT TAATCTTTTC TGATAAACTT GAGGCGATAT ACAATAGTGA AAATATGAAT
ATTGAAGAAT TTATTGAAAA AAACACAAAT AGGCCCAGGA TTTCATCGGA TTGTGTTGAT
AAATTCAAAA AAAGCCTTGA AATAAAAAAT ATTCGGATGG CCGAGTGGCG GAACAGGGCA
TATCAGGATG TTTTGGGCAG TGTTGAAAAT CTTTCGCTTG AAGAAAAAAT ATTGTCCATT
AACCTTCCCA CCGGTTCCGG AAAAACACTT ACAGCTCTTA AAGCTGCTTT GCGGTTGAAG
GAAAGACTGA TTGAAGAAAA GGGATATAAT CCAAGAATTA TTTATGTACT TCCCTTTACT
TCGATTATTG AGCAAAATTT TGATGTTTTT CAGAAAGTTT TAGGCACTAC CGAGTCAAAT
GTACTTTTAA AGCATCATTA TTTATCCCAG AGGGTCTACC AATGGGAAAA GGAAGGAGAA
ATAGAGAGCT TAAGTGATAC TGTTTCCGAA CATCTTGTGG AGTCGTGGGA TTCGGAAATA
GTTGTAAGTA CCTTTGTGCA GTTATTGCAT TCTATTTTTA CAAACAGAAA CAGAAAGCTT
AAAAAGTTTC ACAATATTGT TAATTCAATA ATAATATTGG ATGAGGTGCA GAGTATACCC
CACAGGTATT GGAATCTTGT GCGGGAAACT TTTAGTGCCA TGGCAAAGTA TCTTAACTGT
CATTTCATTT TTATGACTGC AACCATGCCA TTGATTTTTT CTGAAGAAAA CAAGGAAATA
TATGAATTGG TAAAGGATAA GAGGAAATAT TTTGAAGAGT TTAGCCGTAT AACCATTGAT
GCGAAAAGAT TGACGGACAA AACTACACTG GATGAATACA AAACATTGTT ATTGGATGAT
ATTCTCCGAT ATAAAAAGGA CGACTTTCTG ATAGTTATGA ATACAATTAG AACTTCCATT
GAGATATATT CATTTATAAA AGAGGAATTA AAGGATGAGG CAGAGATATA TTATCTTTCG
ACCAATATCA TTCCGAAGGA GAGGCTTGAC AGAATAAAAA AGATAAAGGA AAGCAAGAAT
CGCAAGATAA TTGTATCAAC CCAGATGATT GAAGCCGGTG TGGATATTGA TATTGACCGG
GTCTACAGGG ATTTTGGCCC AATGGACAGT ATAAACCAGA CGGCGGGAAG ATGCAACCGT
GAATGGGGAG ATAAAAAGGG ACTTGTTACG CTGGTGAATT TGGTAAATGA AAATCATCAT
AGCAGGCCTT ATGCAACATA TATCTATGAC AATGTATTGA TAGAGGAGAC GAAAAAGGCA
CTTTCAGGGC TTGAAATGAT TGAGGAAAAA GAGATTTTTC ATCTTGCAGA AAAGTATTAT
GTGGGATTGA AAAGCCATGG AAGTGATGAA AGTCAAAAAC TTTTGGATTG CATAAAGGAA
CTTCGTTACA GGGAAGCTTT TGAATGTGGA AAGGATGATA AAAATTCCGG AGTATTTGAA
CTTATACGAC AGGACTTTAA TACGGTGGAC GTGTTTATCG AAATAGATGA TGACGCAACC
GAAGTATGGC AGGAGTATCA GAGTATAAAA AAGATAAAAG ACAGGTTTGA GCGAAAAAGG
AAATTTAATC AGTTCAAAAA AGACTTGTAT ATGTATGTGC TCAGTCTTCC CGAATTTGCT
GTCAGAAAGC AGGTTGATAT TGATGAAAAG GATATAACTT TCATAAACCG GGAAATGGTT
TTTAATACAT ATGATAAAGA TACGGGTTTT ATGAGGGATT TGGAAAAAGA CTACTTTTTT
TAA
 
Protein sequence
MLLSHPGKLL LDHLKNVFLI GDCILMQKKT EFESFSEHDI RQLNKLNLLT HDLGKATSYF 
QDYIRNLDDN TQKNDERKRH GLLSGVLSFK IVNAVMKNEI LAFLSYMVVS KHHGELDDFT
NFISVISGDE KNKTLLKLQF ESIDKGKLQE VITRLGIDFD ILSYTVDEFE NDIDYITSRK
VRKKVKELMG LEIFLLIDYL FSLLIFSDKL EAIYNSENMN IEEFIEKNTN RPRISSDCVD
KFKKSLEIKN IRMAEWRNRA YQDVLGSVEN LSLEEKILSI NLPTGSGKTL TALKAALRLK
ERLIEEKGYN PRIIYVLPFT SIIEQNFDVF QKVLGTTESN VLLKHHYLSQ RVYQWEKEGE
IESLSDTVSE HLVESWDSEI VVSTFVQLLH SIFTNRNRKL KKFHNIVNSI IILDEVQSIP
HRYWNLVRET FSAMAKYLNC HFIFMTATMP LIFSEENKEI YELVKDKRKY FEEFSRITID
AKRLTDKTTL DEYKTLLLDD ILRYKKDDFL IVMNTIRTSI EIYSFIKEEL KDEAEIYYLS
TNIIPKERLD RIKKIKESKN RKIIVSTQMI EAGVDIDIDR VYRDFGPMDS INQTAGRCNR
EWGDKKGLVT LVNLVNENHH SRPYATYIYD NVLIEETKKA LSGLEMIEEK EIFHLAEKYY
VGLKSHGSDE SQKLLDCIKE LRYREAFECG KDDKNSGVFE LIRQDFNTVD VFIEIDDDAT
EVWQEYQSIK KIKDRFERKR KFNQFKKDLY MYVLSLPEFA VRKQVDIDEK DITFINREMV
FNTYDKDTGF MRDLEKDYFF