Gene Cthe_3202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3202 
Symbol 
ID4809504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3793309 
End bp3794226 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content36% 
IMG OID640108636 
ProductCRISPR-associated Csh2 family protein 
Protein accessionYP_001039590 
Protein GI125975680 
COG category[L] Replication, recombination and repair 
COG ID[COG3649] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR01595] CRISPR-associated protein, CT1132 family
[TIGR02590] CRISPR-associated protein, Csh2 family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAA ACAGGCAGGA GATATTATTT TTATATGATG TTACGGATGC AAATCCCAAC 
GGTGATCCTT TGGATGAGAA CAAACCCCGA ATTGACGAGG AAACGGGGAT TAATATTGTA
ACGGATGTAA GACTGAAAAG AACCATAAGG GATTATTTGT ATGACTATAA AGGATTTGAT
GGTTCTAACG GGAAGGATAT ATTTGTAAGA GAAATTGAAT CGGAAAAAGG CGGAATTAAG
GACGGTAAAG CAAGGGCAAA GGACTTTAAT GAGAATGTCG ATGAAATTTT GCAAAAGGCC
ATAGATATAA GGTTGTTTGG AGGAGTAATT CCTTTGGACA AGGCATCGAT AACATTTACC
GGGCCGGTGC AGTTTAACAT GGGAAGGTCA TTGAATAAAG TAAATTTAAA GCATATAAAA
GGTACCGGAG CTTTTGCCTC AGGAGAGGGA AAAGCGCAGA AGACATTTAG GGAAGAATAC
ATTGTGCCGT ATTCCATAAT TGCTTTTCAC GGGATAATAA ACGAAAATGC GGCAAAAAGA
ACCGGGCTCA CTGATGAAGA TGTGGATTTG CTGGACGATG CAATGTGGAA CGGTACAAAA
AATCTTATAA CCCGCTCGAA AATGGGACAT ATGCCAAGAC TGATGCTTAG GGTGGTATAT
AAACCAGGAG AGAATTTCTT TATAGGAGAT TTGCAAAACA GAATATCTCT TAATTTTGAC
GTTGAAGAAG AAAAAATCAG ATCAATTAAA GATTTTTCAA TTAAATTGGA TGAGCTTATA
GATGAGTTGG CAAATTATGG TGATAAAATA GAAAAAGTTG TGTTTGTTGC GGATAAGAAT
TTGAGACTTA GTTATAAAGG GCGCGAAATC AATTTAAAAG ATATAAAAGA TATACGGTTT
GAGGAAAAAA CTTTTTAG
 
Protein sequence
MIKNRQEILF LYDVTDANPN GDPLDENKPR IDEETGINIV TDVRLKRTIR DYLYDYKGFD 
GSNGKDIFVR EIESEKGGIK DGKARAKDFN ENVDEILQKA IDIRLFGGVI PLDKASITFT
GPVQFNMGRS LNKVNLKHIK GTGAFASGEG KAQKTFREEY IVPYSIIAFH GIINENAAKR
TGLTDEDVDL LDDAMWNGTK NLITRSKMGH MPRLMLRVVY KPGENFFIGD LQNRISLNFD
VEEEKIRSIK DFSIKLDELI DELANYGDKI EKVVFVADKN LRLSYKGREI NLKDIKDIRF
EEKTF