Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2057 |
Symbol | |
ID | 4810653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2446982 |
End bp | 2448814 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640107462 |
Product | CRISPR-associated TM1812 family protein |
Protein accession | YP_001038457 |
Protein GI | 125974547 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02221] CRISPR-associated protein, TM1812 family [TIGR02549] CRISPR-associated DxTHG motif protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAAT TCATTTCTCT TCTTGGTACG TCGAAATACG TTCCTTGTAA TTATTTTATT AAAAACAGGG AAGAACTCAA AATTAATGAT TGCTGTTATG TTCAGAAAGC AATTTTGGAT ATTTTGAGAC AGGAAAATGT TATACCTGAT AAAATAATTA TTTTTACTAC TGATGAAGCA CATGTAAGTA ATTGGGAAAA TAACAAATGG AATAACAAAG GAGACAAAAA TGATGTTCAA GAGATTGAGG ATGAACTGCA AAAACGGCCG GGATTAAAAG GTGAGCTTGA AAAATACAAA GAGTTAACAG GTGCTGATTT TAAAAGTGTA AGAATACCTA ATGGAATTAG AGAAGAAGAT TTATGGGAAA TATTTAGAAC GATATTTGAT GAAATAGATG AAAATGATGA AATAATATTT GATGTTACCC ATTCATTCAG ATATTTGCCA ATGTTGGTTT TTATAGTGAT AAATTACGCC CGGGTAGTAA AAAAATGCAA GCTTAAGAGT ATTTATTATG GAGCGTTTGA AGTATTGGGC ACTCCCGAAA CAGTGGCAAA AATTCCATTG GAGGAAAGAA ATGCTCCCAT TTTTGATCTC ACATCTTTTG TGGATTTGTT TGACTGGACG ATGGGAATTG ACAGATATTT GAATACCGGC GATGTATCAG TAGTGCATGA ACTTACAGAT ATACAAATAA AAAGAGTAAA TAAGGAAAAA CTTAAATTTA TTTCAAAATC AGAACAAGGA GTTGATCCAA AAACTTTATT TATGGATTCG AGGCAGTTAA GAGCACTATC TGAATCAATG AAAAATTTTA GCGACGTAGT GTTTACATGT AGAGGTTTGG AATTGACACG GGTTGCATGT GAATTAAAAG AAAAACTGAG AGAAGTTACT GAAAGTGCTT CAAGACAGCA TATAATACCA TTGATGCCGG TTCTTGAAAT GATGAAAGAA AGATTTGATA AATTTAGCAA AGATAACGAT TACATAAATA TTATTGAAAC TGCGAGATGG TGTGCTGACA ATAAAATGTA CCAGCAAGGA CTGACAATTT TGGAAGAAGG ACTGATTAGT TTTGGCTGTG AAAAGTTGGG TTATGAAAAT TTAAGCGATA AAATAGACCT TGATAAGAGA AGAAAGATAG GTTCATATGC TTTTGTTGTT AGTAATGATT TTGGAAGAAA AAGCGACAAT AAAAAAGAGA AGATACCTTT ATTAAATATA AAATCAGATG TTATAACTGA TCTGTTGATA CTAATAGATG AAATAAGCAG TATCAGAAAT GACATTAATC ATGCCGGTTG GAGGAAAGAT CCTTCTGAAG CCGGTGCTTT TGGGGAACAA CTTAAGAATT TTATTTCACG TGCGGAAAAA ATTATTTCTC CAGAAAACTT TTCCACAGAA TCAGATAAAT GTATTAATAA TGACAGTGTA GACAAGGAAA AGAAAATGCT TTTGATATTG TCCCATAAAC TTGTGCCCAA ACAGGAAGAA GAAGCAAGAC AACGCTTTGG CATATCAAAG TTTTTGCCAA TGCCCAATGA GTTACATAAT AAATGGTCAA ATATACCTCC TGAGCTGGAG GATTTACGAG ACTATCTCAA CGACATTTTG GAATGGATTG ATTTAAATGC GCAAGAAGGA GATTATGCTC TGATACAGGG AGATTACGGA GCTACATTTA TTGCTGTCAA CCATTGCCTG GCCAAAGGAA TTATTCCTGT TTATTCAACA ACTCACAGAA TTGTGCGAGA GGAAAAAAAT GAAGACAAGG TTATAAGTGT AAGAGAATTT GAACATGTAA TTTTAAGAAA ATATGAAAAT TAA
|
Protein sequence | MLKFISLLGT SKYVPCNYFI KNREELKIND CCYVQKAILD ILRQENVIPD KIIIFTTDEA HVSNWENNKW NNKGDKNDVQ EIEDELQKRP GLKGELEKYK ELTGADFKSV RIPNGIREED LWEIFRTIFD EIDENDEIIF DVTHSFRYLP MLVFIVINYA RVVKKCKLKS IYYGAFEVLG TPETVAKIPL EERNAPIFDL TSFVDLFDWT MGIDRYLNTG DVSVVHELTD IQIKRVNKEK LKFISKSEQG VDPKTLFMDS RQLRALSESM KNFSDVVFTC RGLELTRVAC ELKEKLREVT ESASRQHIIP LMPVLEMMKE RFDKFSKDND YINIIETARW CADNKMYQQG LTILEEGLIS FGCEKLGYEN LSDKIDLDKR RKIGSYAFVV SNDFGRKSDN KKEKIPLLNI KSDVITDLLI LIDEISSIRN DINHAGWRKD PSEAGAFGEQ LKNFISRAEK IISPENFSTE SDKCINNDSV DKEKKMLLIL SHKLVPKQEE EARQRFGISK FLPMPNELHN KWSNIPPELE DLRDYLNDIL EWIDLNAQEG DYALIQGDYG ATFIAVNHCL AKGIIPVYST THRIVREEKN EDKVISVREF EHVILRKYEN
|
| |