Gene Moth_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0046 
Symbol 
ID3830912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp45951 
End bp46874 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content65% 
IMG OID637827978 
Producthypothetical protein 
Protein accessionYP_428928 
Protein GI83588919 
COG category[R] General function prediction only 
COG ID[COG0313] Predicted methyltransferases 
TIGRFAM ID[TIGR00096] probable S-adenosylmethionine-dependent methyltransferase, YraL family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000617643 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000342991 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTAGTC TCTACCTGGT GGGAACCCCC ATCGGCAACC TGGAGGATAT TACCTTCAGG 
GCTTTGCGGG TCCTGAAGGA AGTAGACCTT ATCGCCGCCG AAGATACCCG GCATACCCGG
GAACTCCTGA CCCATTATGG TATTCACACC CCCCTGACCA GCTATCACCG TCACAACCTG
GCCAGCAAGA CCCCTTACCT GTTGGGGCTG CTGCGGGAGG GTAAGGATAT CGCCCTGGTT
TCCGACGCCG GCCTGCCGGG AATCAGCGAT CCCGGGGAGG AACTGGTCCG GGCCACGGTA
GCCGCCGGAC TGCCGGTGGT ACCGGTACCC GGGGCGAATG CCGCCCTGAC TGCCCTGGTG
GCCTCCGGTT TGCCCGCTGG TCGTTTTGCC TTTGAAGGCT TTTTGCCCCG GGCCGGGAAG
GAGCGCCGGG AACGCCTGGC CGCCCTGGTG GGGGAAGAAC GGACCCTGAT TTTTTATGAG
GCGCCCCACC GGCTAACTGC CACCCTGGAT GACCTGGCGG CAACCCTGGG ACCCAGGCAG
GTGGCCATCG GTCGGGAGTT AACAAAAAAG TTTGAGACCA TATGGCGGGG AACACTGCCG
GAGGCCAGGG AGTATTTCCG GGATAACCCA CCCCGGGGCG AACTTACCCT GGTGGTAGCC
GGGGCGCCAC CAGCCCCCCG GCCGGCCTAT GATCCCGCCC GGGCGGCTGC CGAGGTGGCT
GACCTGGAGG CCAGCGGGCT GGACCGTAAG GAAGCCATGG CCCGGGTAGC TCGCATCTAC
GGCCAGTCCC GGCGGGAGAT CTACAGGGCC TGCCTGCAAG CCCGGGAAGG TGGGCAGGGA
GGGTCCGGCG GGCTGGGGGA ACCTGCAACC GCAAGGGGTG GGAGCCTACC GGGAGACAAA
GCTGTTAGCC CTTCAGGGGA TTAA
 
Protein sequence
MASLYLVGTP IGNLEDITFR ALRVLKEVDL IAAEDTRHTR ELLTHYGIHT PLTSYHRHNL 
ASKTPYLLGL LREGKDIALV SDAGLPGISD PGEELVRATV AAGLPVVPVP GANAALTALV
ASGLPAGRFA FEGFLPRAGK ERRERLAALV GEERTLIFYE APHRLTATLD DLAATLGPRQ
VAIGRELTKK FETIWRGTLP EAREYFRDNP PRGELTLVVA GAPPAPRPAY DPARAAAEVA
DLEASGLDRK EAMARVARIY GQSRREIYRA CLQAREGGQG GSGGLGEPAT ARGGSLPGDK
AVSPSGD