Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2051 |
Symbol | |
ID | 4811020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2441017 |
End bp | 2441799 |
Gene Length | 783 bp |
Protein Length | 260 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107456 |
Product | CRISPR-associated Csm3 family protein |
Protein accession | YP_001038451 |
Protein GI | 125974541 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR02581] CRISPR-associated RAMP protein, SSO1426 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGATGG ATAAATTTCA AAACAGATAT GTTGTCAGAG GAATTATTGT GGCCGAAACG CCCATACATA TAGGAGCGGG AAATGAAAGT ATGAATCCCG TCGAACCAGA CAATTCGGTT ATAAAGGACA AGGACGGAAA GCCGTATATT CCTGGAAGTT CTCTCAAAGG AGCTCTCAGA AGCTGGCTGG AGTCTTTTTT AAGAGGCGGC GGAAATGAAA TTACAGGTGG AAATGCTCCC TGTCTTTGTG TAAATGAACC TTGCCTTGGT GATAATCCGG AAAACAAAGA ATGGCTCAAA GAGATAAAAA AGAAGTATAA GAACAATAAA GATGCGGACA GGTTGGTTGC TGAAGAAATA TACAGGAAAT TGTGCCCGGT TTGCAAAGTG TTTGGTTCTC AGCATTTTGC GTCTAAAGTA ACAATAAATG ACAGCAAACT TAAAAGTGAA AGGGCCTATA TTGAAAAAAG AGACGGAGTT GCAATTGACA GGGACACCGG TACTTCAGCG AAGAATAAAA AGTATGATTT TGAACAGGTG GCGGCGGGAA CAGAATTTGA TTTCCATATG ACTGCGGACA ACCTGGATGA AGAGAATGAA AAAATTCTGA AAATAATTGT AAAGATGCTG GAAAGCGGGG ATTTTGTTGT GGGCGGAAAA AGATCGGTCG GACTTGGAAG GATAAGACTT TATAACACCA AAATTTACAA GATAGACGAA AAGAGTCTCG AAAATTATTT ATTCAATGGT TTAAGTGAGG AAATGAGGTG GCAGTATGTT TAG
|
Protein sequence | MLMDKFQNRY VVRGIIVAET PIHIGAGNES MNPVEPDNSV IKDKDGKPYI PGSSLKGALR SWLESFLRGG GNEITGGNAP CLCVNEPCLG DNPENKEWLK EIKKKYKNNK DADRLVAEEI YRKLCPVCKV FGSQHFASKV TINDSKLKSE RAYIEKRDGV AIDRDTGTSA KNKKYDFEQV AAGTEFDFHM TADNLDEENE KILKIIVKML ESGDFVVGGK RSVGLGRIRL YNTKIYKIDE KSLENYLFNG LSEEMRWQYV
|
| |