Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3202 |
Symbol | |
ID | 4809504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3793309 |
End bp | 3794226 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640108636 |
Product | CRISPR-associated Csh2 family protein |
Protein accession | YP_001039590 |
Protein GI | 125975680 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3649] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR01595] CRISPR-associated protein, CT1132 family [TIGR02590] CRISPR-associated protein, Csh2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAAAA ACAGGCAGGA GATATTATTT TTATATGATG TTACGGATGC AAATCCCAAC GGTGATCCTT TGGATGAGAA CAAACCCCGA ATTGACGAGG AAACGGGGAT TAATATTGTA ACGGATGTAA GACTGAAAAG AACCATAAGG GATTATTTGT ATGACTATAA AGGATTTGAT GGTTCTAACG GGAAGGATAT ATTTGTAAGA GAAATTGAAT CGGAAAAAGG CGGAATTAAG GACGGTAAAG CAAGGGCAAA GGACTTTAAT GAGAATGTCG ATGAAATTTT GCAAAAGGCC ATAGATATAA GGTTGTTTGG AGGAGTAATT CCTTTGGACA AGGCATCGAT AACATTTACC GGGCCGGTGC AGTTTAACAT GGGAAGGTCA TTGAATAAAG TAAATTTAAA GCATATAAAA GGTACCGGAG CTTTTGCCTC AGGAGAGGGA AAAGCGCAGA AGACATTTAG GGAAGAATAC ATTGTGCCGT ATTCCATAAT TGCTTTTCAC GGGATAATAA ACGAAAATGC GGCAAAAAGA ACCGGGCTCA CTGATGAAGA TGTGGATTTG CTGGACGATG CAATGTGGAA CGGTACAAAA AATCTTATAA CCCGCTCGAA AATGGGACAT ATGCCAAGAC TGATGCTTAG GGTGGTATAT AAACCAGGAG AGAATTTCTT TATAGGAGAT TTGCAAAACA GAATATCTCT TAATTTTGAC GTTGAAGAAG AAAAAATCAG ATCAATTAAA GATTTTTCAA TTAAATTGGA TGAGCTTATA GATGAGTTGG CAAATTATGG TGATAAAATA GAAAAAGTTG TGTTTGTTGC GGATAAGAAT TTGAGACTTA GTTATAAAGG GCGCGAAATC AATTTAAAAG ATATAAAAGA TATACGGTTT GAGGAAAAAA CTTTTTAG
|
Protein sequence | MIKNRQEILF LYDVTDANPN GDPLDENKPR IDEETGINIV TDVRLKRTIR DYLYDYKGFD GSNGKDIFVR EIESEKGGIK DGKARAKDFN ENVDEILQKA IDIRLFGGVI PLDKASITFT GPVQFNMGRS LNKVNLKHIK GTGAFASGEG KAQKTFREEY IVPYSIIAFH GIINENAAKR TGLTDEDVDL LDDAMWNGTK NLITRSKMGH MPRLMLRVVY KPGENFFIGD LQNRISLNFD VEEEKIRSIK DFSIKLDELI DELANYGDKI EKVVFVADKN LRLSYKGREI NLKDIKDIRF EEKTF
|
| |