Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3201 |
Symbol | |
ID | 4809503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3791468 |
End bp | 3793309 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640108635 |
Product | CRISPR-associated Csh1 family protein |
Protein accession | YP_001039589 |
Protein GI | 125975679 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02556] CRISPR-associated protein, TM1802 family [TIGR02591] CRISPR-associated protein, Csh1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.573889 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTTGCAG GTGTATTGCA GCTGGGACAG TACGCACTCA ATAAAAAGAG TACTGACACC GAGGAATATC TTCAAGTCAT TGAAAACCCT AATGATAAAG GCAATTACAA TCATGTCTTA AAAATTGCAT TTGAATCTAC CGAAGGAAAC ATAGTCTATC GTGGGGTTGA ATACGAAGAG TTCTCCTCGA AAAAAATAAA TAACTATGCA TACAAAAAGG GAAGTGCAAG GGGTGGCGAT GTAACACCTA CTTCAAAGTA TACAGATTCA ATGAAGACTT TAAACAAGAT AATGATATCT TTTAATGATA TTTTAAAGAG TGCAAATCAA AACAATGACG AACAAAAAAT ATTTAAAGGC ATTTATGAAT ACATGGTTTC AAATCAAGAA AACATTGCAA ACGATATTAC GGAAAAAATC AAATCCATCT CGTTAAAAAA AGGAGAATCT TTCATTATAA CTGTTACTTT GTTTGATAAC AATACTGAAA AATATATGGG CAATTTTCAA TTAATAAGAA ATCATCTTGC CAGAATTTTG AATGAACAGT ATTACAATAA ATACGGTAAA ACATCAAAAG GTAAAGGTAT CTGCTATTAT TGTAAAAATG AAGGTGAAGT TTTCGGGTTT GTCAATACAT ATAATTCCTA CACGGTTGAT AAAATAGGAT TTGTAACCGG TGGTTTCAAG CAGGAAAACG CGTGGAAAAA CTATCCTGTC TGTTCTTGTT GCGCACAGAA GCTGGAGCAA GGTAAGAAAT ATATCAGAGA AAATCTTACA TCTAAATTTT CAGGTTTCGA CTATTTCGTA ATTCCTAAAG CAGTAATAAG TGATGAACAT GATGAAGCGG AATTTATAGA GACCCTCGAA GAATTTGAAA AAAATACGAA CTTTTCCACC CAAGAATCCA CAAAGCAAAA CTTGCTTGGC AGTGAAAAAG ATTTTCTGGA GATAATGAAG GACAGCAAAA ATTATCTTAA TTACAATATG CTTGTTTTTA AAGAAGAGCA GTCCGGAAGC GTTTTTAGGA TACTTCTTTA TATAGAGGAT ATTGTGCCAA GCAGGGTAAA AAACATATTG AGAGTAAAAG ACAGAGTAGA TGAAACAGTG CTGTTTAAAA ATCTTCCGGG TAAAGATAAT GCAACTTATG ATTTAAAGTT TGGATTCGAT AAAATAAGGA CCTTTTTCCC GAATAACAAG ACAGAAGGAA ATTTTGACAA AAGTTTTCTT GAGATTTTAA ACAATGTGTT TACATATAAA AAGATAAGTT ACAAATTTCT TTTAGGGAGA ATGATTTCAA AAATAAGGAG TGATTTTGCA AGGGAGGAAT ATGTAAAAAA CCTTGTACTT CAGGCGCTGA TGTGCATTAT GTTTATTGAT AAGCTTAATT TGCTCAGTGG TAAAGGGAAA GAGGTGCAAA AAATAATGAT TGAGAAGACC GAGAAAAACA AAAAATATCT TGATTTTTTT GAAAACGAAA GTTATAAGGA CGTTTTCAAT TCAGATTATA AGAGAGCAGT TTTTTTGACA GGAGTGCTGA CGGAAAAGTT ATTAAACATC CAATACAAAG AAAGAGGAAG CAAACCTTTC TACAGCAGGC TCAATGGCTT GAAACTGAAT AAAAACATAG TAAAAAGAAT ATATACCGAG GCCATTAATA AATTAAATGA ATATAACAAG AACTATTATA AAGAATTGGA ATATCTTATT GGCATGTACA TGTTGTCGGA AGAATCCCAA AAGAATGTTT CCGATGATGA AATCAGTTTT TATTTTGTGC TTGGAATGTC ATTGGCAAGG TTCTTTAATG AAGAAAAGAA AGACGGAGAG GATGAGGAAT AA
|
Protein sequence | MLAGVLQLGQ YALNKKSTDT EEYLQVIENP NDKGNYNHVL KIAFESTEGN IVYRGVEYEE FSSKKINNYA YKKGSARGGD VTPTSKYTDS MKTLNKIMIS FNDILKSANQ NNDEQKIFKG IYEYMVSNQE NIANDITEKI KSISLKKGES FIITVTLFDN NTEKYMGNFQ LIRNHLARIL NEQYYNKYGK TSKGKGICYY CKNEGEVFGF VNTYNSYTVD KIGFVTGGFK QENAWKNYPV CSCCAQKLEQ GKKYIRENLT SKFSGFDYFV IPKAVISDEH DEAEFIETLE EFEKNTNFST QESTKQNLLG SEKDFLEIMK DSKNYLNYNM LVFKEEQSGS VFRILLYIED IVPSRVKNIL RVKDRVDETV LFKNLPGKDN ATYDLKFGFD KIRTFFPNNK TEGNFDKSFL EILNNVFTYK KISYKFLLGR MISKIRSDFA REEYVKNLVL QALMCIMFID KLNLLSGKGK EVQKIMIEKT EKNKKYLDFF ENESYKDVFN SDYKRAVFLT GVLTEKLLNI QYKERGSKPF YSRLNGLKLN KNIVKRIYTE AINKLNEYNK NYYKELEYLI GMYMLSEESQ KNVSDDEISF YFVLGMSLAR FFNEEKKDGE DEE
|
| |