Gene Moth_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1650 
Symbol 
ID3830938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1684863 
End bp1685936 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content62% 
IMG OID637829575 
ProducttRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase 
Protein accessionYP_430495 
Protein GI83590486 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0482] Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain 
TIGRFAM ID[TIGR00420] tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0173174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGCTA TGAGTGGCGG CGTGGACAGT TCCACAGCCG CCGCTTTATT GAAGGAAGCC 
GGCTATGAGG TAATCGGGGT AACCCTGGCC CTCTGGCCCG AAGATACACC GCCCCCGCCG
GGAGAAACCG GGTGTTGCAG CCTCAAGGCT GTAGACGATG CCCGCCGGGT GGCCAACATC
CTGGATATAC CCTATTACGT CCTCAATTTT CGCGACCTCT TTGAGCGCGA GGTCATCGAT
TATTTTATAG CTTCCTACCT GGAGGGAGAG ACCCCCAACC CATGTATCGC CTGCAACCGG
CGGATCAAGT TCGGCGCCCT GCTGGCTAAA GCCCGGGCCC TGGGGATAGA CTATATCGCT
ACCGGCCACT ACGCCAGGCG CTGGTACGAC AAAGAAAAGG GGCGTTACCT CCTGGCCCGG
GGCCGGGATG CCGGCAAGGA CCAGAGCTAC GCCCTCTACA CCTTTACCCA GGAACAGCTC
GCCCATACCC TGTTGCCCCT GGGTGACTAC ACCAAGGTGG AGGTGCGGGA GATTGCCGCC
CGTTACGGGC TGCCGGTGGC CCGGAAGGCC GAGTCCCAGG AGATCTGCTT TGTTACTGAG
GGAGACTACC GGGATTATAT CCAGAGCCGG GCCAGGGAGA AGATTAAACC CGGGCCCATC
CTGGATACGA GGGGCCGGGT CCTGGGCCAG CACCGGGGTC TGCCCTTTTA TACCATCGGC
CAGCGTAAGG GTCTGGGCCT GGCCCTGGGC AAACCCTGCT TTGTAGTCGC CCTGGACCCG
GAGCGCAACG CCGTTATCGT CGGCGACAAA GAAGATCTGG AACGGCGGGT CCTCTACGCC
CGTGATAACA ACTATATTCT CTGGGGTGAA CTGCCTGGTA AAGCCCGGGT AACGGCCAGG
ATTCGCTACC GGGCGCCCGA AGCAGCCGCA ACCTGGCATC CCCTGGCGGG TGGCCGGGCC
AGGCTGGAGT TTGACGAACC CCAGCGGGCC ATCACCCCGG GCCAGGCTGT GGTCTACTAC
CAGGGCGACC TGGTGGTAGG CGGCGGGACC ATTGAAAGCG TAGCGCAAAT ATAG
 
Protein sequence
MVAMSGGVDS STAAALLKEA GYEVIGVTLA LWPEDTPPPP GETGCCSLKA VDDARRVANI 
LDIPYYVLNF RDLFEREVID YFIASYLEGE TPNPCIACNR RIKFGALLAK ARALGIDYIA
TGHYARRWYD KEKGRYLLAR GRDAGKDQSY ALYTFTQEQL AHTLLPLGDY TKVEVREIAA
RYGLPVARKA ESQEICFVTE GDYRDYIQSR AREKIKPGPI LDTRGRVLGQ HRGLPFYTIG
QRKGLGLALG KPCFVVALDP ERNAVIVGDK EDLERRVLYA RDNNYILWGE LPGKARVTAR
IRYRAPEAAA TWHPLAGGRA RLEFDEPQRA ITPGQAVVYY QGDLVVGGGT IESVAQI