Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2396 |
Symbol | |
ID | 3830763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2516791 |
End bp | 2517642 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637830315 |
Product | HemK family modification methylase |
Protein accession | YP_431221 |
Protein GI | 83591212 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.299117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTGC GGCAAGCCCT GGGGGAGGCC GTCCGGCGCC TGGCGGCCGG GGGGGTTGAA CGCCCGCGGC TGGAGGCGGA AGTCCTTCTC GGGTGGGCCT GTAGTTTAAC CCGGCCCCGC CTCCTGGCCC GCCTGGAGGA GGAACTGGCA CCGGCAGCCG CAGGACGGTT CTGGCAGGCA ATTGACCGCC GGGCAGCCGG TTACCCCCTC CAGTACCTCA CCGGACACCA GGAATTTATG TCCCTGGACT TTAAAGTCAC TCCGGCGGTT TTAATCCCCC GCCAGGATAC CGAAGTGGTG GTGGAGGCTG TCCTTGAGCG TCTGGACCCC TGCGAGAGCT ATACCATCGC CGACTGCGGT ACGGGCAGCG GGGCCATTGC CCTGAGCCTG GCCCATTACC TGCCCCGGGC CCGGGTTTAC GCCACGGACA TCAGCCCGGC GGCCCTGACG GTGGCCCAGG AAAACGCCAG GAAACTGGGG CTGGCGGCCA GGGTAACCCT TCTCCAGGGT GATTTTTTGG CGCCCCTGCG GGGTTTAAAG CTCGACGCCC TTGTGGCCAA CCCCCCCTAC ATACCCACTG CCGCCCTGCC AGGGCTGCCC GCGGATGTCC GCTCTGAACC GCGCCTGGCC CTGGACGGCG GGCCCGACGG CCTGGATGCC TACCGGTTCC TCCTGCCGGG GGCGGCAGGA CTTTTGCGGC CCGGCGGTCT CCTGGCCCTG GAAATCGGCT CCGACCAGGG ACAGGCCGTA AAGGACCTGG CCCGGGCCGT GGGAGCCTAT CGCAACGAAC AGGTTTTACC AGATTATGCC GGCCGCGATC GTTGTTTCCT GGCTTATCGC CGGGAAGAAT AA
|
Protein sequence | MTLRQALGEA VRRLAAGGVE RPRLEAEVLL GWACSLTRPR LLARLEEELA PAAAGRFWQA IDRRAAGYPL QYLTGHQEFM SLDFKVTPAV LIPRQDTEVV VEAVLERLDP CESYTIADCG TGSGAIALSL AHYLPRARVY ATDISPAALT VAQENARKLG LAARVTLLQG DFLAPLRGLK LDALVANPPY IPTAALPGLP ADVRSEPRLA LDGGPDGLDA YRFLLPGAAG LLRPGGLLAL EIGSDQGQAV KDLARAVGAY RNEQVLPDYA GRDRCFLAYR REE
|
| |