Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0046 |
Symbol | |
ID | 3830912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 45951 |
End bp | 46874 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637827978 |
Product | hypothetical protein |
Protein accession | YP_428928 |
Protein GI | 83588919 |
COG category | [R] General function prediction only |
COG ID | [COG0313] Predicted methyltransferases |
TIGRFAM ID | [TIGR00096] probable S-adenosylmethionine-dependent methyltransferase, YraL family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000000617643 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000342991 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTAGTC TCTACCTGGT GGGAACCCCC ATCGGCAACC TGGAGGATAT TACCTTCAGG GCTTTGCGGG TCCTGAAGGA AGTAGACCTT ATCGCCGCCG AAGATACCCG GCATACCCGG GAACTCCTGA CCCATTATGG TATTCACACC CCCCTGACCA GCTATCACCG TCACAACCTG GCCAGCAAGA CCCCTTACCT GTTGGGGCTG CTGCGGGAGG GTAAGGATAT CGCCCTGGTT TCCGACGCCG GCCTGCCGGG AATCAGCGAT CCCGGGGAGG AACTGGTCCG GGCCACGGTA GCCGCCGGAC TGCCGGTGGT ACCGGTACCC GGGGCGAATG CCGCCCTGAC TGCCCTGGTG GCCTCCGGTT TGCCCGCTGG TCGTTTTGCC TTTGAAGGCT TTTTGCCCCG GGCCGGGAAG GAGCGCCGGG AACGCCTGGC CGCCCTGGTG GGGGAAGAAC GGACCCTGAT TTTTTATGAG GCGCCCCACC GGCTAACTGC CACCCTGGAT GACCTGGCGG CAACCCTGGG ACCCAGGCAG GTGGCCATCG GTCGGGAGTT AACAAAAAAG TTTGAGACCA TATGGCGGGG AACACTGCCG GAGGCCAGGG AGTATTTCCG GGATAACCCA CCCCGGGGCG AACTTACCCT GGTGGTAGCC GGGGCGCCAC CAGCCCCCCG GCCGGCCTAT GATCCCGCCC GGGCGGCTGC CGAGGTGGCT GACCTGGAGG CCAGCGGGCT GGACCGTAAG GAAGCCATGG CCCGGGTAGC TCGCATCTAC GGCCAGTCCC GGCGGGAGAT CTACAGGGCC TGCCTGCAAG CCCGGGAAGG TGGGCAGGGA GGGTCCGGCG GGCTGGGGGA ACCTGCAACC GCAAGGGGTG GGAGCCTACC GGGAGACAAA GCTGTTAGCC CTTCAGGGGA TTAA
|
Protein sequence | MASLYLVGTP IGNLEDITFR ALRVLKEVDL IAAEDTRHTR ELLTHYGIHT PLTSYHRHNL ASKTPYLLGL LREGKDIALV SDAGLPGISD PGEELVRATV AAGLPVVPVP GANAALTALV ASGLPAGRFA FEGFLPRAGK ERRERLAALV GEERTLIFYE APHRLTATLD DLAATLGPRQ VAIGRELTKK FETIWRGTLP EAREYFRDNP PRGELTLVVA GAPPAPRPAY DPARAAAEVA DLEASGLDRK EAMARVARIY GQSRREIYRA CLQAREGGQG GSGGLGEPAT ARGGSLPGDK AVSPSGD
|
| |