Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0572 |
Symbol | |
ID | 4808247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 699532 |
End bp | 700581 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640105986 |
Product | ribosomal RNA large subunit methyltransferase N |
Protein accession | YP_001037001 |
Protein GI | 125973091 |
COG category | [R] General function prediction only |
COG ID | [COG0820] Predicted Fe-S-cluster redox enzyme |
TIGRFAM ID | [TIGR00048] radical SAM enzyme, Cfr family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00705994 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGCAA AAGCAGACCT TCTGAGTATG ACGATAGAGG AACTTGAGAA TCTGATGGCT GAAATGGGAG AGCAAAAGTT TCGCGCAAAA CAGATTTTTC AGTGGACCAA TAAAGGAATT AAAGATATTG ATGCCATGAC GAATCTCTCA AAAGACTTAA GGGAAAAGTT AAAGGAAAGA GCATATATAA ACAGGCTTGA AGTCATAAAA AAGTTTGTTT CGAAAATAGA CGGCACAATT AAGTATTTGT TCAAATTAAA TGACGGCAAT ATAATTGAGA GTGTGCTTAT GCAATATTTG CATGGCTATA GTGCCTGTAT TTCTTCCCAG GTGGGCTGCA AAATGGGATG CAAATTTTGC GCATCCACAG GTGTCGGATT TGTAAGGAAT CTTACGCCGG GAGAGATGCT TGACCAGATT CTGACCATAC AGAATGATAC AAAAAACAGA ATTGGAAATG TAGTAATAAT GGGCATAGGA GAGCCTCTGG ACAACTATGA AAACGTGGTG AAGTTCTTAA GGCTTGTAAA TCACAAGGAC GGTATTAATT TAGGGGCGAG ACACATTTCG GTTTCAACCT GTGGGCTTGT TCCTGAGATT TTAAGGCTGG CAGAGGAAAA AATACCTGTT ACCCTGTCCA TTTCACTTCA TGCTCCAAAT GATGAAATCA GAGAAAAAAT TATGCCTATA AATAAAAGGT ATTCTATTGA CAAAATAATT GAAGCATGTA AGATATATAC TGAGACTACT AATAGAAGAA TTACCTTTGA GTATGCTATG ATTGACGGTT TGAATGATTC AAAGGAAAAT GCGCTGGAAC TTGCAAAAAG AATCCGGGGC ATGTTGTGTC ATGTCAATCT GATACCGGTA AATACCGTAT CAGATACCGG GTTTAAGAGA AGTTCGAGGG AAAAAATAAC GGCGTTCAAG GAAATCCTTG AAAGGTTTGG TGTTGAAACA ACTGTAAGAC GCGAGCTTGG AAGTGACATA AATGCGGCAT GCGGACAGCT TCGTAGAAAC CTTGTGGAAA ACGGACAATT GGTATATTAG
|
Protein sequence | MDAKADLLSM TIEELENLMA EMGEQKFRAK QIFQWTNKGI KDIDAMTNLS KDLREKLKER AYINRLEVIK KFVSKIDGTI KYLFKLNDGN IIESVLMQYL HGYSACISSQ VGCKMGCKFC ASTGVGFVRN LTPGEMLDQI LTIQNDTKNR IGNVVIMGIG EPLDNYENVV KFLRLVNHKD GINLGARHIS VSTCGLVPEI LRLAEEKIPV TLSISLHAPN DEIREKIMPI NKRYSIDKII EACKIYTETT NRRITFEYAM IDGLNDSKEN ALELAKRIRG MLCHVNLIPV NTVSDTGFKR SSREKITAFK EILERFGVET TVRRELGSDI NAACGQLRRN LVENGQLVY
|
| |