Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1538 |
Symbol | |
ID | 4810045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1863582 |
End bp | 1864706 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106957 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001037958 |
Protein GI | 125974048 |
COG category | [K] Transcription |
COG ID | [COG1813] Predicted transcription factor, homolog of eukaryotic MBF1 |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAA TCAATATAGC CAGAACCATC GTTAAAATGC GGCGTGAGAA AGGACTGACG CAGGAAGACA TTGCAAATTA CATTGGCGTG TCGAAGGCTT CGGTTTCTAA ATGGGAAACC GGTCAGAGTT ATCCTGACAT TACTTTTCTG CCGCAGCTTG CGACACTTTT TAATATAAGC ATTGATGAGC TCATGGGTTA TGAACCTCAA ATGAGTAAAG AGGATATCCG TAAACTGTAC GTGAAATTAT CTGCCGATTT TGCTTCCAAA CCTTTTGATG AAGTATTGAA TTCTTGCCGC GAAATTGCTA AAAAGTATTT CTCCTGCTTT CACCTGTTAT TCCACATTGG ATTGCTGCTT GTAAACAACA GCACGGAATC GGGAGACAAG GAAAAAACCC TTTCTGTGCT TTCGGAAGCC AAAGAGCTGT TTGTTCGGGT AAAAACAGAA AGTGATGATG CCGAGCTTGT GCAACTTTCC TTATGTATGG AGGCATGCTG CGCGTTGATG ATGGGAAATC CGAACGAAGT AATTGAGCTT TTGGAGGGAA CAAGAAAAAA AATCATTTCC AGTGAAACGA TTCTTGCTTC GGCCTATCAA ATGATTGGTA AATCGAAAGA AGCCAAAATG ACATTACAAG CTGCTATATA TCAGCATATG TGTAATCTCT TTAGTGCATT AACCGATTAT CTTTTGCTTT GTACGGACAC TCCCGAACAG TTTGATAAAA CGCTGAAGCG TGCAAATGAC ATTGCTGAAG CTTTTGACTT GAAAAAGCTT CATCCGTCGT TGCTCATGAA GCTCTACATC ATTGCTGCCC AGGGATACAT GATGCTTGGG AGTAAAGAAA AGTCTCTGGA AATTCTTGAA AAATACACGG AACTTGTCAC CGGTGATATT TACCCATTGC AGCTAAAAGG AGACGAATAT TTTAATCTGA TAGATCAGTG GATTGAAGAG CTGGACTTGG GGAATGCTCT TCCAAGAGAT GAAAAAATTA TACGCAAGAG CATGGCTGAC GGAGTCATCA ATAATCCTGC GTTTACAATA TTGGCTGATG AAATCCGGTT TAGGAGAATT GCAGAAAAAC TGAAGAATAA CTGTTATCAA CAAGACGCAC CATGA
|
Protein sequence | MKEINIARTI VKMRREKGLT QEDIANYIGV SKASVSKWET GQSYPDITFL PQLATLFNIS IDELMGYEPQ MSKEDIRKLY VKLSADFASK PFDEVLNSCR EIAKKYFSCF HLLFHIGLLL VNNSTESGDK EKTLSVLSEA KELFVRVKTE SDDAELVQLS LCMEACCALM MGNPNEVIEL LEGTRKKIIS SETILASAYQ MIGKSKEAKM TLQAAIYQHM CNLFSALTDY LLLCTDTPEQ FDKTLKRAND IAEAFDLKKL HPSLLMKLYI IAAQGYMMLG SKEKSLEILE KYTELVTGDI YPLQLKGDEY FNLIDQWIEE LDLGNALPRD EKIIRKSMAD GVINNPAFTI LADEIRFRRI AEKLKNNCYQ QDAP
|
| |