Gene Cthe_1538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1538 
Symbol 
ID4810045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1863582 
End bp1864706 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content41% 
IMG OID640106957 
ProductXRE family transcriptional regulator 
Protein accessionYP_001037958 
Protein GI125974048 
COG category[K] Transcription 
COG ID[COG1813] Predicted transcription factor, homolog of eukaryotic MBF1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAA TCAATATAGC CAGAACCATC GTTAAAATGC GGCGTGAGAA AGGACTGACG 
CAGGAAGACA TTGCAAATTA CATTGGCGTG TCGAAGGCTT CGGTTTCTAA ATGGGAAACC
GGTCAGAGTT ATCCTGACAT TACTTTTCTG CCGCAGCTTG CGACACTTTT TAATATAAGC
ATTGATGAGC TCATGGGTTA TGAACCTCAA ATGAGTAAAG AGGATATCCG TAAACTGTAC
GTGAAATTAT CTGCCGATTT TGCTTCCAAA CCTTTTGATG AAGTATTGAA TTCTTGCCGC
GAAATTGCTA AAAAGTATTT CTCCTGCTTT CACCTGTTAT TCCACATTGG ATTGCTGCTT
GTAAACAACA GCACGGAATC GGGAGACAAG GAAAAAACCC TTTCTGTGCT TTCGGAAGCC
AAAGAGCTGT TTGTTCGGGT AAAAACAGAA AGTGATGATG CCGAGCTTGT GCAACTTTCC
TTATGTATGG AGGCATGCTG CGCGTTGATG ATGGGAAATC CGAACGAAGT AATTGAGCTT
TTGGAGGGAA CAAGAAAAAA AATCATTTCC AGTGAAACGA TTCTTGCTTC GGCCTATCAA
ATGATTGGTA AATCGAAAGA AGCCAAAATG ACATTACAAG CTGCTATATA TCAGCATATG
TGTAATCTCT TTAGTGCATT AACCGATTAT CTTTTGCTTT GTACGGACAC TCCCGAACAG
TTTGATAAAA CGCTGAAGCG TGCAAATGAC ATTGCTGAAG CTTTTGACTT GAAAAAGCTT
CATCCGTCGT TGCTCATGAA GCTCTACATC ATTGCTGCCC AGGGATACAT GATGCTTGGG
AGTAAAGAAA AGTCTCTGGA AATTCTTGAA AAATACACGG AACTTGTCAC CGGTGATATT
TACCCATTGC AGCTAAAAGG AGACGAATAT TTTAATCTGA TAGATCAGTG GATTGAAGAG
CTGGACTTGG GGAATGCTCT TCCAAGAGAT GAAAAAATTA TACGCAAGAG CATGGCTGAC
GGAGTCATCA ATAATCCTGC GTTTACAATA TTGGCTGATG AAATCCGGTT TAGGAGAATT
GCAGAAAAAC TGAAGAATAA CTGTTATCAA CAAGACGCAC CATGA
 
Protein sequence
MKEINIARTI VKMRREKGLT QEDIANYIGV SKASVSKWET GQSYPDITFL PQLATLFNIS 
IDELMGYEPQ MSKEDIRKLY VKLSADFASK PFDEVLNSCR EIAKKYFSCF HLLFHIGLLL
VNNSTESGDK EKTLSVLSEA KELFVRVKTE SDDAELVQLS LCMEACCALM MGNPNEVIEL
LEGTRKKIIS SETILASAYQ MIGKSKEAKM TLQAAIYQHM CNLFSALTDY LLLCTDTPEQ
FDKTLKRAND IAEAFDLKKL HPSLLMKLYI IAAQGYMMLG SKEKSLEILE KYTELVTGDI
YPLQLKGDEY FNLIDQWIEE LDLGNALPRD EKIIRKSMAD GVINNPAFTI LADEIRFRRI
AEKLKNNCYQ QDAP