Gene Moth_1912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1912 
Symbol 
ID3830836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1981394 
End bp1982815 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content63% 
IMG OID637829845 
ProductRNA methyltransferase 
Protein accessionYP_430755 
Protein GI83590746 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID[TIGR00479] 23S rRNA (uracil-5-)-methyltransferase RumA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000182205 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTCGG CTATTACCAT TATCGGGTTA AACCATGAAG GCGCCGGGAT CGGTCATCTG 
CAGGACGGAC GGGTTATCTT TGTGCCCGGA GCCCTGCCGG GGGAACAGGT CCTGGTGGAA
GTAGTCAGCG TTAAAAGGAA TTACGCCAGG GGTCGGCTGG TAGAGGTCGT AGAGGCGTCG
CCGGACCGGG TGTTGCCCCC CTGCCCGGAA GCGGCCTCCT GTGGCGGGTG CGATCTGCAA
CACCTGGACT ATCGGGCCCA GTTGCATTGG AAGCGTCGTC TGGTAATCGA TGCCTTGCAA
CGCCTGGGAC ATCTGAGGGA TATCCCGGTG CTACCCGTTC TGGGTATGGC TAATCCCTGG
GGTTACCGCA ACAAGGTGCG GCTGCATGTC CGCCGGGGGC GGCTGGGCTT TTACCGCCCG
GGAAGCCACG AGTTAGCACC CTTCTCCTGC TGTCCATTGT TACCCCCCGG CCTTCTGAAG
GCGGCCCGGG CGATCGTGCG GTTGCTGCCG GAACTGCCAC CCGGCCTGCA GCATGTAACC
CTGCGCCAGG GCCTGGCTAC CGGGGAACTG CTGGTTGTCC TGGAGGCTTT ACCTGGATGG
CAGGGCGATA GGGAACTGGC GGAGAAACTG GCCGGCAGAT TCCCGGAACT GGTGGGGGTT
GTATCCCTGG CTGGCGGCGG CAGGAGCAGG AGCGGTCCAA AGGATTTTGC CGGGGAGCCT
GCTTTAGAGT GCAGCGGCTG GGTAAAAACA GGCGGGAAAG AAGCAAGGCG GGCCCGGTAC
CAGGAAGGAT TCGTGCCCCG CAGGCTAGGT GGCCGGCCCT TCACCCTCTA TGGCCGTGAT
TACCTGGAGG AACGCCTTGG TGACCTCCGC TTCTATATAT CAGCCACGAC CTTTTTCCAG
GTTAATTCGG CCCAGGCGGA AGTCCTCTAT AACAAGGCGG CAACCTTTGC CGGCCTGCAG
GGCGGGGAAG AGGTTCTGGA CGCCTACTGC GGCAGCGGTG CCATTGCCCT GTGGCTCAGC
CGCCAGGCCG GGCGGGTGGA GGGGGTGGAA GTAGTCCCGG AGGCCATTGT TGACGCCCGG
CGCAATTCAA TTTTAAACAA CCTGGCCAAT GTCCACTTTC GTACCGGCGC TGCCGAGGCG
GTCCTGCCCC GCCTGGCAGG AAAGGGTTAC CGGCCGGAGG TAATTATCCT GGATCCGCCC
CGGGCCGGAT GCGACCGCCG GGTGCTGGCG GCCGTGGCGA CTATGGAACC CCGGCGGGTG
GTCTACATCT CCTGCAACCC GTCAACTTTA GCCCGCGACC TGGCACACTT ACGGGAAGCT
GGCTTTAAGC CCGGCCCGGT GCAGCCGGTG GACATGTTCC CCCATACCCA CCATGTGGAG
TGTTGCTGCT TTCTTGTAAA GGAGAGAAAT AACAGCCGCT GA
 
Protein sequence
MDSAITIIGL NHEGAGIGHL QDGRVIFVPG ALPGEQVLVE VVSVKRNYAR GRLVEVVEAS 
PDRVLPPCPE AASCGGCDLQ HLDYRAQLHW KRRLVIDALQ RLGHLRDIPV LPVLGMANPW
GYRNKVRLHV RRGRLGFYRP GSHELAPFSC CPLLPPGLLK AARAIVRLLP ELPPGLQHVT
LRQGLATGEL LVVLEALPGW QGDRELAEKL AGRFPELVGV VSLAGGGRSR SGPKDFAGEP
ALECSGWVKT GGKEARRARY QEGFVPRRLG GRPFTLYGRD YLEERLGDLR FYISATTFFQ
VNSAQAEVLY NKAATFAGLQ GGEEVLDAYC GSGAIALWLS RQAGRVEGVE VVPEAIVDAR
RNSILNNLAN VHFRTGAAEA VLPRLAGKGY RPEVIILDPP RAGCDRRVLA AVATMEPRRV
VYISCNPSTL ARDLAHLREA GFKPGPVQPV DMFPHTHHVE CCCFLVKERN NSR