Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2932 |
Symbol | |
ID | 4810215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3448005 |
End bp | 3448952 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108355 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001039323 |
Protein GI | 125975413 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGAAA TTGAAAAGCC GAAGATAGAA TGTGTAGTAT GTAGTGAAGA CAACAGATAT GGAAAATTTG TAGTTGAACC GTTGGAGCGA GGATACGGCA TTACTCTTGG CAATTCTCTG CGAAGAATAT TGCTTTCATC TTTGCCCGGT GTCGCTGTGA CATCAATCAA AATAGACGGC ATACTGCACG AGTTTTCAAC AATACCCGGT GTAATTGAAG ATGTGACCGA AATAATCCTT AATATAAAAG AGCTGTCATT GAATTTCCAC GGAGAAGGAC CGAAAGTTAT ATATATTGAT GCCGAGGGAG AAGGAGAAGT TAAGGCAAAA GATATTAAGG CGGATGCCGA TGTTGAAATT CTCAACCCGG AACACAAAAT TGCGACATTG AGCGGTGACC ACAGACTTTA TATGGAAATG ACCATTGACA AGGGAAGAGG ATATGTATCT GCCGAAAAGA ACAAACATCC CGGCCAGCCG ATAGGGGTTA TACCTGTTGA TTCAATTTTC ACACCGGTAC ACAAGGTTAA CTATACGGTG GAAAACACAC GTGTCGGACA GGTTACCGAC TATGACAAGC TGACTTTGGA AGTTTGGACA AACGGAAGCA TCAAGCCTGA TGAAGCAATC AGTCTGGGTG CGAAAATATT AAGTGAGCAT CTCAACTTGT TTATCGATTT GTCTGATAAT GCGAAGAATG CTGAAATAAT GGTTGAAAAA GAAGAAACCA AGAAAGAAAA AGTTCTTGAA ATGACTATTG AAGAACTTGA TCTGTCGGTA AGATCTTACA ACTGCTTGAA GAGAGCGGGT ATAAACACGG TTGAGGATCT TATAAGCAGA ACCGAGGAAG ATATGATGAA GGTCAGAAAC CTTGGCAGAA AGTCTCTTGA AGAAGTCGTA AACAAGTTGA AAGCTTTGGG ATTGTCATTG GCACCAAGTG AAGACTAA
|
Protein sequence | MIEIEKPKIE CVVCSEDNRY GKFVVEPLER GYGITLGNSL RRILLSSLPG VAVTSIKIDG ILHEFSTIPG VIEDVTEIIL NIKELSLNFH GEGPKVIYID AEGEGEVKAK DIKADADVEI LNPEHKIATL SGDHRLYMEM TIDKGRGYVS AEKNKHPGQP IGVIPVDSIF TPVHKVNYTV ENTRVGQVTD YDKLTLEVWT NGSIKPDEAI SLGAKILSEH LNLFIDLSDN AKNAEIMVEK EETKKEKVLE MTIEELDLSV RSYNCLKRAG INTVEDLISR TEEDMMKVRN LGRKSLEEVV NKLKALGLSL APSED
|
| |