Gene Cthe_0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0407 
Symbol 
ID4808410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp508249 
End bp509556 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content41% 
IMG OID640105821 
Productradical SAM family protein 
Protein accessionYP_001036838 
Protein GI125972928 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00162852 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGCTTC AAAAAAAGCT TGAAATATTA TCCGCCGCCG CAAAATATGA TGTTTCGTGT 
TCCTCCAGCG GCAGCAACAG AAAAAATACC AAAGGAGGCC TGGGAAACGC AGCATCTTTC
GGTATATGCC ATAGCTGGTC TGATGACGGA AGGTGTATTT CTCTTTTAAA AATCCTTCTT
ACCAATTACT GCGTGTATGA TTGCGCCTAC TGTGTCAACA GAGTAAGCAA TGATATCCCA
AGAGCTGCTT TTACACCCCA AGAAGTGGCA AATCTTACAA TAAACTTTTA CAGACGAAAC
TATATTGAAG GTTTGTTCCT AAGCTCTGCA GTGGTAAAAA ACCCCAACCA TACAATGGAG
CTCCTTTATG AATCGCTAAG GATTTTAAGG AAAGAATACA GATTCAACGG CTACATACAT
GTCAAGGCTA TCCCGGGTGC TGACCTTGGC CTCATTGAAG CTGTCGGAAA ATTGGCTGAC
AGAATGAGTG TCAACATCGA GCTCCCTTCC GAAAACGGAC TTAAGCTTCT GGCCCCTCAA
AAAAATAAAC AGGCGATACT TAAGCCAATG AATTTTATAG CATCCAAGAT AACCGAAAAA
AGGGATGAGA GAAAGGTATT TAAAAATGCA CCTTTATTTG TTCCCGGAGG TCAGAGTACT
CAACTTATCG TGGGAGCCAC CCAGGACCAC GATATCAACA TTCTGAGGCT TTCTGAAAAC
CTGTACAAAA AATACAAACT TAAAAGAGTT TATTATTCGG CATATGTACC TGTATCCAAA
AATCCGCTTC TGCCCGATTT AAAGACTCCT CCGCTTCTTA GAGAACACAG GCTTTATCAG
GCTGACTGGC TTTTGAGATT TTACGGTTTT TCAGCGGACG AGCTCCTTGA TGAGCGCAAT
CCCGACTTTG ACCCCAAACT TGACCCAAAA ACAAACTGGG CAATTAACAA TATGTCCCTC
TTTCCTGTTG AAATAAACCG CGCAGACTAC GAAATGCTGC TTAGAGTTCC GGGCATCGGA
GTTCGCTCTG CCAAAAAAAT CATTATGGCA AGAAAAGTTA AATCATTATC CTTCGAAGAC
TTAAAAAAAC TTGGTGTCGT TCTAAAACGT GCAAAATTCT TCATTACCTG TAATGGCAAG
TATTTTTTCA ACTGTAACTT GGATCAGAAT TTAATAAGGC AAAACCTGAT TAATGGTTTT
GAAGATAATG AAAAAAGGCA GGAATGGGAG CAAATTTCGA TTTTTTCTCT AATACCGGAA
AAACCGACTC TTCAAGACCA AATAATGAGC ATAACCGGAG AAATATAA
 
Protein sequence
MELQKKLEIL SAAAKYDVSC SSSGSNRKNT KGGLGNAASF GICHSWSDDG RCISLLKILL 
TNYCVYDCAY CVNRVSNDIP RAAFTPQEVA NLTINFYRRN YIEGLFLSSA VVKNPNHTME
LLYESLRILR KEYRFNGYIH VKAIPGADLG LIEAVGKLAD RMSVNIELPS ENGLKLLAPQ
KNKQAILKPM NFIASKITEK RDERKVFKNA PLFVPGGQST QLIVGATQDH DINILRLSEN
LYKKYKLKRV YYSAYVPVSK NPLLPDLKTP PLLREHRLYQ ADWLLRFYGF SADELLDERN
PDFDPKLDPK TNWAINNMSL FPVEINRADY EMLLRVPGIG VRSAKKIIMA RKVKSLSFED
LKKLGVVLKR AKFFITCNGK YFFNCNLDQN LIRQNLINGF EDNEKRQEWE QISIFSLIPE
KPTLQDQIMS ITGEI