Gene Cthe_2297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2297 
Symbol 
ID4809886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2741525 
End bp2742517 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content35% 
IMG OID640107703 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001038692 
Protein GI125974782 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.150876 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GTGCTTTTAT TTTTAGTGAC GGAGAGTTGA AAAGGAAGGA CAGTACAGTA 
CTTTTCGAGA GCGAAGATTC AAAAAATTAT CTTCCGATTG AGGATATCAG CGATATTTAT
ATTTTCGGTG AGGTTACCGT TACAAAGAAG TTTCTGGAGC TGGCAACTCA GAAGGAAATA
CTTCTTCATT TTTATAACTA CAATGAATAT TATGTTGGAA CTTATTATCC CAGAGAGCAT
TATAATTCCG GTTTTATGAT ATTAAAACAG GCGGAGCATT ATCTTGATGA GGAGAAGAGA
ATGGCAATTG CTAAAAAGTT CATACATGGC AGTGTGAAAA ACATGCTTGC GGTTTTAAAA
TACTACAATA ACCGGGAAAA AGATTTGGAC AGACAGATAA CGGCTATTAG TGATTTGGCA
GAAAAAATAG ATGAGATGGA CGAAATCAAC AAATTAATGG CTATTGAGGG AAATATAAGA
GAGATTTACT ATGGTTCCTT CGACATTATA GTAGATGATG AGTATTTTGA GTTTGGTAAA
AGAACAAAAC AACCACCGAA AAATCGAATG AATTCTTTGA TCAGCTTTGG AAACAGCATA
CTTTATACGA CTGTTTTAAG CGAGATATAT AAAACTCATT TGGATCCGAG GATAGGATAC
CTTCATTCTA CAAACCACAG AAGATTCACT TTAAATCTTG ATGTTGCTGA AATTTTTAAG
CCAATCATTG TTGACAGAGT CATTTTTACG TTGATTGGCA AAAAAATGCT GGGAGAAAAG
CATTTTGAAG AAAAAGCCGG TGGAATTGTC TTAAACGACA AGGGGCGAAA GCAGTTTGTA
GCGCAAATGC TTGAGAAGTT AAATGCAACT TTAATGTATA AGCCCTTGGG GAGAGAAGTA
TCTTACAGGA GACTTATCAG GTTGGAATTG TATAAGCTTG AGAAACATCT TATGGGAGAG
CAGGAATATA AACCATACGT TGCTTCATGG TAA
 
Protein sequence
MKKSAFIFSD GELKRKDSTV LFESEDSKNY LPIEDISDIY IFGEVTVTKK FLELATQKEI 
LLHFYNYNEY YVGTYYPREH YNSGFMILKQ AEHYLDEEKR MAIAKKFIHG SVKNMLAVLK
YYNNREKDLD RQITAISDLA EKIDEMDEIN KLMAIEGNIR EIYYGSFDII VDDEYFEFGK
RTKQPPKNRM NSLISFGNSI LYTTVLSEIY KTHLDPRIGY LHSTNHRRFT LNLDVAEIFK
PIIVDRVIFT LIGKKMLGEK HFEEKAGGIV LNDKGRKQFV AQMLEKLNAT LMYKPLGREV
SYRRLIRLEL YKLEKHLMGE QEYKPYVASW