Gene Moth_0048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0048 
Symbol 
ID3830798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp47363 
End bp49318 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content58% 
IMG OID637827980 
Productmethionyl-tRNA synthetase 
Protein accessionYP_428930 
Protein GI83588921 
COG category[J] Translation, ribosomal structure and biogenesis
[R] General function prediction only 
COG ID[COG0073] EMAP domain
[COG0143] Methionyl-tRNA synthetase 
TIGRFAM ID[TIGR00398] methionyl-tRNA synthetase
[TIGR00399] methionyl-tRNA synthetase C-terminal region/beta chain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000300514 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000703201 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGGGGATA AGGTTTTCTA TGTAACTACC CCTATCTATT ACCCCAGCGA TAAGTTGCAT 
ATCGGCCATG CCCTGACCAC CACCATGGCC GATACCCTGG CGCGTTACAA GCGCCTGCGG
GGTTACGACG TCTACTTTCT TACCGGGTCG GACGAACACG GCCAGAAGAT CCAGCGCAAG
GCCCGGGAGG CCAACCTGGA ACCCATCCAG TATGTGGATC GGATTGTCGC CACTTTCCAG
GAACTCTGGC GGCGCCTAAA TATCTCTTAT AATGATTTTA TCCGTACCAC CGAACCGCGC
CATGCACGGG TCGTCCAGGC CCTGTTGCAA AAGATATATG ACCAGGGCGA TATCTACAAA
TCCACTTACG AGGGCTGGTA CTGTACCCCC TGCGAGACCT TCTGGACAGA ACGGCAGCTG
GTGGATGGCA ATTGCCCGGA TTGCGGCCGG CCGGTGGAAC TGGTCAAGGA AGAAAGTTAT
TTCTTCCGTA TGAGTAAATA CGCCGACCGT TTGCTGCAAT ATATCAAAGA TCATCCCGAT
TTTATCCAGC CTGTCACCCG GCGTAACGAA ATGATTAGTT TTATCGAGGG CGGCCTGGAG
GACCTGTGCA TCTCCCGGAC GACCTTTGAC TGGGGCATTC CGGTGCCTAT GGACCCCAAA
CATGTTATCT ACGTGTGGTT CGACGCCCTG ACCAACTATA TCTCGGCCCT GGGTTATGGC
ACCGCCGACG ACCATCTCTT CCGCAAATAC TGGCCGGCAG CCGTGCACCT GGTGGGCAAG
GACATTGTCC GCTTTCATAC TATTATCTGG CCCATTATCC TAATGGCTGC CGGTATTGAG
CCGCCCCGTC AGGTCTTCGG CCACGGTTGG CTACTGGTTG ACGGCGGTAA GATGTCTAAA
TCCAGGGGGA ATGTCGTTGA CCCCATGATC CTCATTGATC GCTACGGCTC CGACGCCATC
CGCTACTTCC TCCTCAGGGA GATGCCCTAT GGTGCCGACG GCTATTACAG CGAGGAGGCC
TTAATTAATC GCTACAATAC CGATCTGGCC AACGACTTTG GCAACCTCTT AAGCCGGACG
ACGGCCATGA TCGAGAAATT CAACGGGGGC GTTATTGACC CGCCGTCAGC CCCGGAACCC
TTGGACGAAG AGCTCAAAAA CCTGGCCGCC GGCATCCCGG ACGAGGTGGA CAACGCCCTG
AATCATTATG AGTTTGCCAG GGCCCTGGCG GCAATCTGGC GTCTGGTTAA CCGGGCCAAT
AAATACATTG AAGAAACCGC ACCATGGGCC CTGGCCAGGG ACCCCGGGCA AAAACAGCGC
CTGCAGACGG TTCTCTATAA CCTGGCCGAG GCCGTGCGCC AGGCGACGAT TATGGTCGGC
CCCTTTATGC CCGGCGTACC CGACCGGGTC TGGGACCAGC TGGGCCTTAA AGACGTTCCG
GCGGCCCTTA CCTGGGAGAG CCTGGCTACC TGGGGGGGCA TTCCCGCCGG TACCAGGGTG
AGAAGGGGCG AAGCCCTGTT CCCCCGGATT GATTTAAAAG AAGGAGAGAT ACCAGTGGCA
GAAAAAGCAA CGGAACCGGT TAAGGTGGCC GAGGCTGCTC CCGCCGGGGG GGCTGTTGCC
CGACCCGGTG AAGAGGAAAT CACTATAGAA GAATTCGCCC GGATTAAACT GCGGGTAGCC
CTGGTGCTGG AGGCCGAAAA GGTGGCCAAT GCCGACAAAC TCCTGAAATT GCGGGTCCGG
GTCGGCAACG AGGAACGCAC CATTGTGGCC GGGATTGCCC GCTACTACCA GCCGGAGGAA
CTCGTTGGGA AAAAGGTGGT CATCGTAGCC AACTTGAAAC CGGCCAGACT GCGGGGTATC
GTCTCCCAGG GCATGGTCCT GGCGGCCGTT GACGACGAAT CCTTAAGCCT GGTAACCCCG
GAGCGGGCCA TCAAAGACGG CGCCCAGGTG CGCTAA
 
Protein sequence
MGDKVFYVTT PIYYPSDKLH IGHALTTTMA DTLARYKRLR GYDVYFLTGS DEHGQKIQRK 
AREANLEPIQ YVDRIVATFQ ELWRRLNISY NDFIRTTEPR HARVVQALLQ KIYDQGDIYK
STYEGWYCTP CETFWTERQL VDGNCPDCGR PVELVKEESY FFRMSKYADR LLQYIKDHPD
FIQPVTRRNE MISFIEGGLE DLCISRTTFD WGIPVPMDPK HVIYVWFDAL TNYISALGYG
TADDHLFRKY WPAAVHLVGK DIVRFHTIIW PIILMAAGIE PPRQVFGHGW LLVDGGKMSK
SRGNVVDPMI LIDRYGSDAI RYFLLREMPY GADGYYSEEA LINRYNTDLA NDFGNLLSRT
TAMIEKFNGG VIDPPSAPEP LDEELKNLAA GIPDEVDNAL NHYEFARALA AIWRLVNRAN
KYIEETAPWA LARDPGQKQR LQTVLYNLAE AVRQATIMVG PFMPGVPDRV WDQLGLKDVP
AALTWESLAT WGGIPAGTRV RRGEALFPRI DLKEGEIPVA EKATEPVKVA EAAPAGGAVA
RPGEEEITIE EFARIKLRVA LVLEAEKVAN ADKLLKLRVR VGNEERTIVA GIARYYQPEE
LVGKKVVIVA NLKPARLRGI VSQGMVLAAV DDESLSLVTP ERAIKDGAQV R