Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2428 |
Symbol | |
ID | 4808144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2901065 |
End bp | 2901973 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107842 |
Product | HemK family modification methylase |
Protein accession | YP_001038823 |
Protein GI | 125974913 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.185206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATACTGA AAGATGCACT GTTGATGGGA ACAAAGCTTC TTAAGTCAGC GGATATTGAT ACCCCGGCGT TGGAGGCCGG GGTACTTTTG TGCCGTGTTT TGAATGTGGA CAGAAGTTAT TTGTATTCTC ATGATGATTA CAACATGACC GAAGAGGAGT ATAAAAAGTT TACCTTGTTT CTTGAGGAAA GAATCAAAGG AAAACCTCTT CAATACATAA CCGGGCACCA AGAATTTATG TCCCTTGATT TTATTGTAAC GCCGGACGTA TTGATACCGA GACAGGACAC AGAGACCCTT GTTGAGGCTG TGTTGACGCA TGTAAAAAGT ACCGGCCTTG AGAATGCAAG AATACTCGAT ATAGGCACCG GCTCGGGATG TATAGCCGTA AGCCTTGCAC ATTTTCTGAA AGACAGCAGG GTTCTTGCAT TGGATATTTC TGAGAAAGCG CTTGAAATTG CCGAAACAAA CGCAAAGAGA TGTGGTGTGT GGGATCGGAT GTTTTTTCTT AAAGGAGATG CGTTGGAAGG ACTTGCCGGC ATTATAGCCC AAAGTCCTTT TGCAAAAGAC TTTGAACGCA AGGGAGAAGG ATTTTTTGAC ATTATTGTTT CAAATCCTCC CTACATACCG TCGGAAGAAA TAAAGACCCT CCACAAACAG GTAAAGGATT ATGAGCCTCG CACGGCGCTG GACGGGGGTA TTGACGGCCT TGACTTTTAC AGGGCCATAA CCTGTGAAGC AGCAAAACTG TTAAGTACGG ATTCGTTGCT GGCTTTTGAG GTAGGCTATA ATCAGGCGGA AAATGTTTCA GAATTTATGA AAGAAAGCTT TTCTGCCATT AAAGTCGTAA AGGATTTGGC AGGAATTGAC CGGGTGGTGA TGGGCTGCAG GAAACAGCTG AAAGATTAA
|
Protein sequence | MILKDALLMG TKLLKSADID TPALEAGVLL CRVLNVDRSY LYSHDDYNMT EEEYKKFTLF LEERIKGKPL QYITGHQEFM SLDFIVTPDV LIPRQDTETL VEAVLTHVKS TGLENARILD IGTGSGCIAV SLAHFLKDSR VLALDISEKA LEIAETNAKR CGVWDRMFFL KGDALEGLAG IIAQSPFAKD FERKGEGFFD IIVSNPPYIP SEEIKTLHKQ VKDYEPRTAL DGGIDGLDFY RAITCEAAKL LSTDSLLAFE VGYNQAENVS EFMKESFSAI KVVKDLAGID RVVMGCRKQL KD
|
| |