Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0120 |
Symbol | |
ID | 4808678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 151334 |
End bp | 152074 |
Gene Length | 741 bp |
Protein Length | 246 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105531 |
Product | RNA polymerase, sigma 28 subunit |
Protein accession | YP_001036554 |
Protein GI | 125972644 |
COG category | [K] Transcription |
COG ID | [COG1191] DNA-directed RNA polymerase specialized sigma subunit |
TIGRFAM ID | [TIGR02885] RNA polymerase sigma-F factor [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02980] RNA polymerase sigma-70 factor, sigma-B/F/G subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAAT TGCATGAAAA GGAAACTCTG GAATTAATAG CAGCTGCCAA GGCAGGAGAC AAAGATGCCC AATCCCTGTT GGTGGAAAAA AATGTTGGTC TTGTATGGAG TATCGTGCGA AGATTTCAAA ACAGAGGATA TGAAGCTGAA GATATTTTCC AAATCGGATG CATAGGACTT ATTAAAGCCA TAAACAAGTT TGACTGCTCA TACGATGTCA AATTTTCCAC CTATGCGGTT CCGATGATAA TTGGAGAAAT AAAACGCTTT ATTCGCGATG ACGGAATGAT AAAAGTAAGC CGTTCTCTTA AGGAGCTGGC AAATAAAGCC AGGATCACCA AGGAAATTAT GTCAAAGGAA TTGGGGCGGG AGCCGACTGT CGGTGAAATT TCGGAGCAGC TCGGCATTCC CGTTGAAGAA GTGGTTATGG CAATGGAGGC AAGTTGTACT CCTGAATCCT TATACAGTAC ACTGGGAGAG GGAGACAATT CCTCCACCCT TCTTATTGAT AAAATTGCAA ATGAAAGCGA AAACCAGGAA GTGGACATTG TAGACAGAAT CGACTTAAGA AAGGTATTGG ATACTTTAAA GCCACGGGAA AAGCAGATTA TTGTTTTGAG GTATTTTAAA GAAAAAACCC AGGTTCAAAT TGCGAAAATG CTGGGGATAT CCCAGGTACA GGTGTCAAGG ATTGAAAAGA AGATATTGGA AGAGATCCGA AAAAAAATAA AATACAATTA A
|
Protein sequence | MDELHEKETL ELIAAAKAGD KDAQSLLVEK NVGLVWSIVR RFQNRGYEAE DIFQIGCIGL IKAINKFDCS YDVKFSTYAV PMIIGEIKRF IRDDGMIKVS RSLKELANKA RITKEIMSKE LGREPTVGEI SEQLGIPVEE VVMAMEASCT PESLYSTLGE GDNSSTLLID KIANESENQE VDIVDRIDLR KVLDTLKPRE KQIIVLRYFK EKTQVQIAKM LGISQVQVSR IEKKILEEIR KKIKYN
|
| |