Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1955 |
Symbol | |
ID | 4810738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2327899 |
End bp | 2330055 |
Gene Length | 2157 bp |
Protein Length | 718 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107371 |
Product | RNA binding S1 |
Protein accession | YP_001038366 |
Protein GI | 125974456 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGATA TTATTTCCAC ACTTGTAAAA GAGTTTAATC TCAAGCCTTT CCAGGTGGAA AACACCGTAA AACTTATTGA CAGCGGCAAT ACCATTCCCT TTATTGCAAG GTACAGGAAA GAAATAACGG GAGAATTGAA CGATCAGGTG CTAAGGCAGC TTCATGAAAG ACTGATTTAC CTGAGGAACC TTGAGGCAAG AAAAGAAGAA GTTCGGCGCC TGATTGACGA GCAGGGAAAG CTTACCGCGG AAATTACGGC ATCTCTTGAA AAAGCGACCA CCCTCCGGGA GGTTGAAGAT ATATACAGGC CTTTCAGGCC AAAAAGAAGA ACCAGGGCGA CTGTTGCAAA GGAAAAAGGA CTTGAACCTT TGGCCGAAAT TATTATGGCC CAGGAACTTA AAACCGGCAG TATTGAAGAC ATAGCCAAGC CTTTTATAAA TCCTGAGAAG GAAGTCAATA CCGTCGAGGA TGCCTTAAAC GGAGCAATGG ACATAATAGC CGAAGACATT TCGGACAATC CGCATATCAG AAGTATTGTG CGGGACGTGT TTATGAAACA AGGAATGATT GTGTCCAAAA AGAAGAAGGA TGAAGATTCG GTATACAGAA TGTACTATGA TTTCTCGGAA CCGGTGGCAA AAATAGCCGG TCACAGGGTT CTTGCGATAA ACAGGGGAGA AAAGGAAGAG TTTTTGCAAG TCAAAATTGA AGTTCCTGAA GAAACGCTTA TGGAGCAGCT TAAGGCGAAA CTTGTAAAGA GGCCTCCTTC CATAACGTCG GAATATGTAG AAAAGGCGTT GGCGGACTCT TATGAGCGCC TTATTTTTCC TTCGGTTGAG AGGGAAGTAA GGAATGAACT TACGGAGAAT GCCGAGGAAC AGGCGATAAA GGTCTTTGCG ACCAATCTTA AAAATCTTTT GCTCCAGCCT CCTGTGAAAG GAAAAACCGT TTTGGGGCTT GACCCTGCAT ACAGGACGGG CTGCAAAATT GCAGTGGTGG ATGAGACGGG AAAAGTACTT GACACTGCCG TAATATATCC GACACCTCCC CAGAACAAGG TTGAGGAAGC AAAAGAGATT ATGAAGCGGC TTATTGAGAA ACACGGTGTT GATATAATAT CAATAGGCAA CGGGACTGCT TCGAGGGAGT CTGAAATATT TGTCGCCGAG CTTTTGAAGG AGATAGACAG AAAAGTTTAC TATATGGTGG TAAGCGAAGC GGGAGCTTCG GTTTATTCCG CTTCGAAGCT TGGGGCGGAG GAATTTCCCG ACTTTGACGT GGCTTTAAGA AGTGCTGTGT CCATAGCCAG AAGGCTTCAG GACCCATTGG CGGAGCTGGT TAAAATAGAT CCCAAATCCA TAGGCGTGGG CCAGTACCAG CACGACATGA ATCAAAAGCG GCTGAGTGAG ACTTTGCAGG GCGTGGTTGA AGATTGTGTA AACAGCGTGG GCGTTGACCT GAATACGGCC TCACCGTCTC TTTTGTCTTA CATCTCGGGA ATAAACTCCG TAATTGCAAA AAATATTGTG GAATACAGGG AAACCAACGG AAAGTTTAAA AGAAGAGAAG AACTCAAAAA AGTTAAGAAA CTAGGTGACA AAACTTTCGA GCAATGTGCC GGCTTTCTTA GGATACCTGA CGGAGACAAT GTTCTTGACA ATACTTCCGT ACATCCGGAG TCTTATGAGG CGGCCAAAAA GCTTCTTGAT ATTATGGGAT ACAGCCTTGA AGATGTGAAG AACAGAAAAC TTGATGGACT TGTGGAAAAA GTGGAGAAAA TGGGTATGGA AAAAGTTGCC AGGGAGATTG GTGTCGGAGT GCCGACTTTG AAAGATATTA TAAAAGAGCT TTTAAAGCCT GGACGCGACC CCAGGGATGA GCTTCCGAAA CCGATGCTTC TTACCGACGT GCTGCATTTG GAGGATTTGA GGCCGGGCAT GATATTGACC GGAACCGTAA GGAATGTTGC CGACTTTGGT GCCTTTGTGG ATGTGGGAGT GCACCAGGAC GGGCTGGTTC ACATATCCGA GCTTAGCGAC AAGTATGTAA AAAGTCCCAT GGATGTGGTG TCGGTGGGGG ATATAGTGAA GGTCAGAATT TTGGATGTTG ATGTTGAAAG AAAAAGAATA TCCATGAGCA TGAAGGGTGT CAATTAA
|
Protein sequence | MSDIISTLVK EFNLKPFQVE NTVKLIDSGN TIPFIARYRK EITGELNDQV LRQLHERLIY LRNLEARKEE VRRLIDEQGK LTAEITASLE KATTLREVED IYRPFRPKRR TRATVAKEKG LEPLAEIIMA QELKTGSIED IAKPFINPEK EVNTVEDALN GAMDIIAEDI SDNPHIRSIV RDVFMKQGMI VSKKKKDEDS VYRMYYDFSE PVAKIAGHRV LAINRGEKEE FLQVKIEVPE ETLMEQLKAK LVKRPPSITS EYVEKALADS YERLIFPSVE REVRNELTEN AEEQAIKVFA TNLKNLLLQP PVKGKTVLGL DPAYRTGCKI AVVDETGKVL DTAVIYPTPP QNKVEEAKEI MKRLIEKHGV DIISIGNGTA SRESEIFVAE LLKEIDRKVY YMVVSEAGAS VYSASKLGAE EFPDFDVALR SAVSIARRLQ DPLAELVKID PKSIGVGQYQ HDMNQKRLSE TLQGVVEDCV NSVGVDLNTA SPSLLSYISG INSVIAKNIV EYRETNGKFK RREELKKVKK LGDKTFEQCA GFLRIPDGDN VLDNTSVHPE SYEAAKKLLD IMGYSLEDVK NRKLDGLVEK VEKMGMEKVA REIGVGVPTL KDIIKELLKP GRDPRDELPK PMLLTDVLHL EDLRPGMILT GTVRNVADFG AFVDVGVHQD GLVHISELSD KYVKSPMDVV SVGDIVKVRI LDVDVERKRI SMSMKGVN
|
| |