Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0930 |
Symbol | |
ID | 4811223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1118172 |
End bp | 1119188 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106349 |
Product | radical SAM family protein |
Protein accession | YP_001037357 |
Protein GI | 125973447 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG1243] Histone acetyltransferase |
TIGRFAM ID | [TIGR01210] conserved hypothetical protein TIGR01210 [TIGR01212] radical SAM protein, TIGR01212 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000118017 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCTA AGCATATTGT CATACCAATT TTTATTCCTC ACAAAGGATG TCCTTTTGAC TGTATATATT GCAATCAAAA ATATATAAGC GGTCAAAAAG ATGACATGAC CGAAGAAAAA ATGATATCGA TTATCGAGTC CCATATTGAT TCTGCCTGTG AAGATACGTA CATTGAGATA GGCTTTTACG GAGGCAGCTT TACAGGTATA GAAAGAGAAG AACAATATAG GTATCTTGAG ACGGCCAACA GGTATATAAA AGAGGGAAAG GTCAAAAGCA TACGGCTCTC AACCAGGCCG GACTATATAA ACGAAGAAAT TCTTGATTAC CTCGAAAAAT ATTCCGTAAA GACAATAGAG CTTGGGGTTC AAAGTCTTGA CAGGGAAGTT CTTGAGAAAA GCTGCAGGGG ACACAGTGTT GAGGATGTTT ACAATGCTTC GGCCCTTATT AAGAAAAGAG GCTTTGTACT TGGGATACAA ACAATGATAG GGCTTCCGGG AGACAGCAGA AAGAAGGCTC TTCATACTGC AGAGGAAGTT GTTAAAATAA AGCCTGATAT TTTAAGGATT TATCCCACAT TAGTGGTAAG GGGTACCTAT CTTGAAAAGA TGTATATAAA AGGTGAATAC ACCCCTTTGG AGCTTGAGGA AGCCGTTGAA CTTTGTGCCG AGCTTCTTTA TATTTATAAA AAGAACAATA TAAATGTGAT AAGAATCGGG CTTCAGCCCA CCGAGAGCAT AAACGAGGGT GGCGATGTTA TAGCAGGGCC TTTTCATCCT GCCTTCAGGC AGCTGGTGGA ATCAAAAATG GCACTTAGTG CTATTGAAAA GGCGATTGTG GAGAAAAATT TGTCGAAAAA AGACACCCTT GTAATTTGCA CTGATAAAAA AGAGATATCA AATGTTATAG GCCAAGGAAG GAAAAATGTA GAATATTTAC GAAAAAAGTA TGGCTTTGAT AAAATAATTG TCAGAGAATA TAATGTGGGA CATGAAATTT ATGATATAAA ATATTAA
|
Protein sequence | MASKHIVIPI FIPHKGCPFD CIYCNQKYIS GQKDDMTEEK MISIIESHID SACEDTYIEI GFYGGSFTGI EREEQYRYLE TANRYIKEGK VKSIRLSTRP DYINEEILDY LEKYSVKTIE LGVQSLDREV LEKSCRGHSV EDVYNASALI KKRGFVLGIQ TMIGLPGDSR KKALHTAEEV VKIKPDILRI YPTLVVRGTY LEKMYIKGEY TPLELEEAVE LCAELLYIYK KNNINVIRIG LQPTESINEG GDVIAGPFHP AFRQLVESKM ALSAIEKAIV EKNLSKKDTL VICTDKKEIS NVIGQGRKNV EYLRKKYGFD KIIVREYNVG HEIYDIKY
|
| |