Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3215 |
Symbol | |
ID | 4809517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3808630 |
End bp | 3809880 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 640108649 |
Product | CRISPR-associated TM1812 family protein |
Protein accession | YP_001039603 |
Protein GI | 125975693 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02221] CRISPR-associated protein, TM1812 family [TIGR02549] CRISPR-associated DxTHG motif protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAAA AACTTTTTGC GTTTTTGGGA ATTGGAGATT ATAAAAGCGT AGAGTATTAT TTTCAAAATA AAAAAGAAGG ATATAAAACA GAGTATATAC AAGAGGCTAT TACTAAATTG CTAAATGAAC AAGATTTAAA TGTTACAGTA TTTGTGACTG CAGAAGCACG GAAAAAGCAC TGGGAACCGG AAAATAATAA AGGGCTTGAA AGCAGATTGA AAAAATTGAA CATTAATTGC AAAGCAGTTA ATATACCTGA TGGAAAAGCA AACGACGATG TATGGAAAAT ATTCACAAGT GTATATAGTG AAATTGAATT TAATGATGAA ATATATGTTG ATGTGACTCA CTCTCTTAGA AATATACCAA TAATTTTTAT GTCTGTTTTG AATTATGCAA AGGTTACAAA AAATTGCACT ATCAAAGGAA TATTTTATGG AGCGTATGAA GCTAAAGAAA ATGAGAGAGC ACCTATATAT GATTTGACGT TGTTTGATCA GATTGGCGAA TGGAGTTCGG GAGTTGAACA ATTGCTTACA ACAGGTGAAT GTGAGATGTT TTGTTCTACT GTGGAAAAAA CTTTGGATCC TTTATTAAGG GAAGCAAAAG GAAAAGATGA ATTAATTAAA CTTGTAAAAA AGTGTTCAAA ACTTATCAAA GAATTTTATA CAGATTTGAA ACTTGTAAGA GGAAAATCAG TTTTAGAAGA TGGTAGAAAG TTGTATCAAG TTTTGTGTGA AATTAAGGCG TTAAATACGG AAAAACATAT TACTATGCAA CCATTTTTTC ATATACTTGA ACGGGTTGAG AATCAGGTCG CTTTCTTTCA AAATGAAAAT TTAATAGAAA ATATATTGGA GTGTGTTAAA TTATGTAAAA AGTTTGGACA GTATCAACAG GCCTATACAT TTTTAAGAGA GAATATAATA AATTATGTTT GTATAAATAC AGGGCTTGAT TGGAAAAAGG AAGATCCGGA TAGATTAAAA GCAGAAGAAT TAATTGGTAA ATTGTATATG AGAAAGATAA AGAAAGTTCA AATAGAAGTT AGTGAAGACA TAAAGTCTAT ACTCGAAAAT GGAGAGGATT TTATTTGTGA TGATGCAATT GAGTTATTTG GAGAGTTAAT TGAATTTAGG AATGATCTTG ATCATGCACA ATTTAGAATG ATTAACCCTT CGAAAGATAA AATTACATGC AAACTTGATT CGTTTATTGA GAGGTTTGAA AAATATTATA TTTCTAAATA A
|
Protein sequence | MAKKLFAFLG IGDYKSVEYY FQNKKEGYKT EYIQEAITKL LNEQDLNVTV FVTAEARKKH WEPENNKGLE SRLKKLNINC KAVNIPDGKA NDDVWKIFTS VYSEIEFNDE IYVDVTHSLR NIPIIFMSVL NYAKVTKNCT IKGIFYGAYE AKENERAPIY DLTLFDQIGE WSSGVEQLLT TGECEMFCST VEKTLDPLLR EAKGKDELIK LVKKCSKLIK EFYTDLKLVR GKSVLEDGRK LYQVLCEIKA LNTEKHITMQ PFFHILERVE NQVAFFQNEN LIENILECVK LCKKFGQYQQ AYTFLRENII NYVCINTGLD WKKEDPDRLK AEELIGKLYM RKIKKVQIEV SEDIKSILEN GEDFICDDAI ELFGELIEFR NDLDHAQFRM INPSKDKITC KLDSFIERFE KYYISK
|
| |