Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0407 |
Symbol | |
ID | 4808410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 508249 |
End bp | 509556 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105821 |
Product | radical SAM family protein |
Protein accession | YP_001036838 |
Protein GI | 125972928 |
COG category | [R] General function prediction only |
COG ID | [COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00162852 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGCTTC AAAAAAAGCT TGAAATATTA TCCGCCGCCG CAAAATATGA TGTTTCGTGT TCCTCCAGCG GCAGCAACAG AAAAAATACC AAAGGAGGCC TGGGAAACGC AGCATCTTTC GGTATATGCC ATAGCTGGTC TGATGACGGA AGGTGTATTT CTCTTTTAAA AATCCTTCTT ACCAATTACT GCGTGTATGA TTGCGCCTAC TGTGTCAACA GAGTAAGCAA TGATATCCCA AGAGCTGCTT TTACACCCCA AGAAGTGGCA AATCTTACAA TAAACTTTTA CAGACGAAAC TATATTGAAG GTTTGTTCCT AAGCTCTGCA GTGGTAAAAA ACCCCAACCA TACAATGGAG CTCCTTTATG AATCGCTAAG GATTTTAAGG AAAGAATACA GATTCAACGG CTACATACAT GTCAAGGCTA TCCCGGGTGC TGACCTTGGC CTCATTGAAG CTGTCGGAAA ATTGGCTGAC AGAATGAGTG TCAACATCGA GCTCCCTTCC GAAAACGGAC TTAAGCTTCT GGCCCCTCAA AAAAATAAAC AGGCGATACT TAAGCCAATG AATTTTATAG CATCCAAGAT AACCGAAAAA AGGGATGAGA GAAAGGTATT TAAAAATGCA CCTTTATTTG TTCCCGGAGG TCAGAGTACT CAACTTATCG TGGGAGCCAC CCAGGACCAC GATATCAACA TTCTGAGGCT TTCTGAAAAC CTGTACAAAA AATACAAACT TAAAAGAGTT TATTATTCGG CATATGTACC TGTATCCAAA AATCCGCTTC TGCCCGATTT AAAGACTCCT CCGCTTCTTA GAGAACACAG GCTTTATCAG GCTGACTGGC TTTTGAGATT TTACGGTTTT TCAGCGGACG AGCTCCTTGA TGAGCGCAAT CCCGACTTTG ACCCCAAACT TGACCCAAAA ACAAACTGGG CAATTAACAA TATGTCCCTC TTTCCTGTTG AAATAAACCG CGCAGACTAC GAAATGCTGC TTAGAGTTCC GGGCATCGGA GTTCGCTCTG CCAAAAAAAT CATTATGGCA AGAAAAGTTA AATCATTATC CTTCGAAGAC TTAAAAAAAC TTGGTGTCGT TCTAAAACGT GCAAAATTCT TCATTACCTG TAATGGCAAG TATTTTTTCA ACTGTAACTT GGATCAGAAT TTAATAAGGC AAAACCTGAT TAATGGTTTT GAAGATAATG AAAAAAGGCA GGAATGGGAG CAAATTTCGA TTTTTTCTCT AATACCGGAA AAACCGACTC TTCAAGACCA AATAATGAGC ATAACCGGAG AAATATAA
|
Protein sequence | MELQKKLEIL SAAAKYDVSC SSSGSNRKNT KGGLGNAASF GICHSWSDDG RCISLLKILL TNYCVYDCAY CVNRVSNDIP RAAFTPQEVA NLTINFYRRN YIEGLFLSSA VVKNPNHTME LLYESLRILR KEYRFNGYIH VKAIPGADLG LIEAVGKLAD RMSVNIELPS ENGLKLLAPQ KNKQAILKPM NFIASKITEK RDERKVFKNA PLFVPGGQST QLIVGATQDH DINILRLSEN LYKKYKLKRV YYSAYVPVSK NPLLPDLKTP PLLREHRLYQ ADWLLRFYGF SADELLDERN PDFDPKLDPK TNWAINNMSL FPVEINRADY EMLLRVPGIG VRSAKKIIMA RKVKSLSFED LKKLGVVLKR AKFFITCNGK YFFNCNLDQN LIRQNLINGF EDNEKRQEWE QISIFSLIPE KPTLQDQIMS ITGEI
|
| |