Gene Moth_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1074 
Symbol 
ID3833187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1105321 
End bp1106619 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content56% 
IMG OID637829002 
Producthypothetical protein 
Protein accessionYP_429931 
Protein GI83589922 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0621] 2-methylthioadenine synthetase 
TIGRFAM ID[TIGR00089] RNA modification enzyme, MiaB family
[TIGR01125] MiaB-like tRNA modifying enzyme YliG, TIGR01125 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGAG TTGCTGTTAT AACCCTCGGT TGTCCTAAAA ACCAGGTAGA AAGCGAATAT 
ATGCTGGGGA TCCTGGAAAA GAACCACCTG GAAGTGGTAA GCGATCCCCG GCAGGCGGAA
GTAGTAATCA TTAACACCTG CAGCTTTATT ACCGCGGCAC GGGAAGAGGC TTTAGATACG
ATCCTGGAGC TGGCCCGGGC TGCCAATCAC CCGCGGTTAA TTGTTGCCGG TTGCCTGGCC
CAGCAATACG CCTCCGAGTT GTGGCAGGAA TTGCCGGAGG CGGCAGCCTT TATCGGACCC
GGGGCCACAG GCCGCTTGCC GGAAATTATT AACCGGGTAT TAAAGGGTGA GAGGGTGCTG
GATGTACCCG GCCCGGAAAT GATTACCGGG GAATTGCCAC GCCTTATCGA AGATGGGAAG
CCCTTTGCCT ATTTAAAGAT TGCCGAGGGT TGCAATAACC GTTGTACTTA CTGTACTATC
CCTTCCATCA AGGGGCCCTA TCGCAGCCGG CCCCTGGAGA AAGTGGTAGC CGAGGCCGTT
TCTCTGGCGG CCAGGGGCAT AAAAGAGCTG GTCCTGGTAG CCCAGGATAC CACGGCGTAC
GGCCTGGATT GTTACGGAGA GTACCGCCTG CCGGAACTCC TGCGCCGCCT GGCCAGGATT
GAGGGGATAG AGTGGGTGCG TCTACTCTAC GCCTACCCGA CCAGGATCAC CCCGGAATTG
ATCGAGGTAA TGGCTACTGA GCCCGGGGTG GTACCTTACC TGGATCTACC CCTGCAGCAT
GCCAGTGAAG GCGTCTTGAG ACGAATGGGC CGTCCCGGGA CGGGAGCGGC GGGCCTGAGA
GCTATAGAAA GCCTGCGGCG GGCCATACCG GAGATAACCA TACGCTCTAC CTTTATCGTG
GGCTTTCCCG GAGAGGAAGA GGAGGATTTT CAAATCCTTC TTGACTTCCT TACTGACGCC
CGGTTGGACT GGGTGGGGGC TTTTAAATTC TCTCCCGAGG AAGGTACAAT AGCGGCGAGC
CTTCCAGGTC AGGTACCAGA AGAGGTGAAG GAAGAACGTT ACCAGAGGTT AATGCTCCAC
CAGCAATCCA TCACCAGGGC CTGCAATGAA GGCTGGCTGG GCCGGGAGGT CCAGGTTTTG
AAGGAAGGGC CGGAGGTAGG GCGCAGTATG CGCCAGGCCC CGGAAGTAGA CGGTGTGGTA
TATGTTAAGG GAGATCCCTC ACCAGCCGGT AGCATGGTTA CAGTGAAGCT GACCCAGCTT
TATAATATCT ATGACTTTCT GGGGGAGATT AAGTTATGA
 
Protein sequence
MIRVAVITLG CPKNQVESEY MLGILEKNHL EVVSDPRQAE VVIINTCSFI TAAREEALDT 
ILELARAANH PRLIVAGCLA QQYASELWQE LPEAAAFIGP GATGRLPEII NRVLKGERVL
DVPGPEMITG ELPRLIEDGK PFAYLKIAEG CNNRCTYCTI PSIKGPYRSR PLEKVVAEAV
SLAARGIKEL VLVAQDTTAY GLDCYGEYRL PELLRRLARI EGIEWVRLLY AYPTRITPEL
IEVMATEPGV VPYLDLPLQH ASEGVLRRMG RPGTGAAGLR AIESLRRAIP EITIRSTFIV
GFPGEEEEDF QILLDFLTDA RLDWVGAFKF SPEEGTIAAS LPGQVPEEVK EERYQRLMLH
QQSITRACNE GWLGREVQVL KEGPEVGRSM RQAPEVDGVV YVKGDPSPAG SMVTVKLTQL
YNIYDFLGEI KL