Gene Moth_1112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1112 
Symbol 
ID3833244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1138402 
End bp1139736 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content55% 
IMG OID637829040 
ProducttRNA-i(6)A37 modification enzyme MiaB 
Protein accessionYP_429969 
Protein GI83589960 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0621] 2-methylthioadenine synthetase 
TIGRFAM ID[TIGR00089] RNA modification enzyme, MiaB family
[TIGR01574] tRNA-N(6)-(isopentenyl)adenosine-37 thiotransferase enzyme MiaB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000242644 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAAAAG CAAGAAAAAC CTTCAAGATT ATAACCTATG GCTGTCAGAT GAACCAGAGG 
GACAGCGAAA TGATGGCCGA TCTCCTTCAA GATGCTGGCT ATGAACCGGT GGCCCGGGAA
GAAGAAGCCG GCGTAATCAT CCTCGATACC TGTTGCGTCC GGGAGAAGGC TGAAAATAAG
GTTTACGGCA AACTGGGCCA GATCGAGAAG TTAAAGAGTG CCAACCCGGA CCTGGTAATT
GCTGTGGCCG GGTGTATGGT CCAGCAGCCG GGGGTCGCCG AGAAAATCCG CCAGCAGGCA
CCTTATGTCG ATCTCCTCCT GGGGACCGGT AACCTCCAGG AGCTGCCGCA ACTGATCGAG
GAAATCAAGG CCATGCACCG GCCCAGGATA GTTGTCGGCG AGCAGGAAGG ACCGGTAGTG
GAGGATCTCC CCCGGCGGCG GGCCAGGGGG GCCCAGGCCT TTGTAACCAT AACCTACGGT
TGCAATAACT TTTGCACCTA TTGTATTGTC CCCTATGTCC GCGGGCGGGA GAGAAGCCGC
CGGCCGGAGA ATATAATCAA AGAAGTAAAA GAACTGGTTG ACCAGGGAGT AATTGAAGTT
ACCCTCCTGG GCCAGAATGT CAACTCTTAT GGCCGCGACT TGAGAGACGG GATCAATTTT
GCCGGCCTGC TGGAGCGTGT TAATGCTGTT GAAGGTTTAA AGCGTATACG TTATGTAACT
TCCCACCCCA GGGATTTTAC CCCCGAACTG GTAACTACTA TTAGCCGGTT GGATAAAGTC
TGTGAACATG TTCATCTGCC TGTGCAGGCC GGCAGTAACC GGATCCTGGA ACTCATGCAC
CGGGGTTATA CCAGGGAACA CTACCTGGAA CTGGTTGCCG ACCTGCGGCG CCATATCCCG
GGGATCAGCC TGACGACGGA TCTCATTGTT GGTTTTCCCG GGGAGACAGA AGCCGACTTT
GAGGATACCC TGGACCTGGT TGCCAGGGTG CAGTTCGATA ATGCCTTTAC CTTTATGTAC
TCGCCGCGAC GGGGCACGGA GGCTGCCACC ATGCCGGGTC AGCTGCCCAC GGCCATCAAG
AAGGAGCGCC TGAAGCGGTT GATGGAACTC CAGAACAGCA TCAGCCTGGC GAAGAATGAA
GCCCTGGTAG GCCAGGAAGT GGAAGTTCTC GTGGAAGGCC CCAGTAAAAC CGATCCCGAC
CAGTTGAGCG GCCGTACCCG GACCAATAAG CTAATTATTT TCCCCGGGGA TCAATCCCTG
ACCGGCCGGC TGGTAAGGGT GCGGTTAACC CGAGCCCAGA CTTGGTTATT AAAAGGAGAA
ATGGTTGATG GCTAG
 
Protein sequence
MVKARKTFKI ITYGCQMNQR DSEMMADLLQ DAGYEPVARE EEAGVIILDT CCVREKAENK 
VYGKLGQIEK LKSANPDLVI AVAGCMVQQP GVAEKIRQQA PYVDLLLGTG NLQELPQLIE
EIKAMHRPRI VVGEQEGPVV EDLPRRRARG AQAFVTITYG CNNFCTYCIV PYVRGRERSR
RPENIIKEVK ELVDQGVIEV TLLGQNVNSY GRDLRDGINF AGLLERVNAV EGLKRIRYVT
SHPRDFTPEL VTTISRLDKV CEHVHLPVQA GSNRILELMH RGYTREHYLE LVADLRRHIP
GISLTTDLIV GFPGETEADF EDTLDLVARV QFDNAFTFMY SPRRGTEAAT MPGQLPTAIK
KERLKRLMEL QNSISLAKNE ALVGQEVEVL VEGPSKTDPD QLSGRTRTNK LIIFPGDQSL
TGRLVRVRLT RAQTWLLKGE MVDG