Gene Moth_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0084 
Symbol 
ID3832693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp85566 
End bp87038 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content60% 
IMG OID637828016 
ProductMazG family protein 
Protein accessionYP_428966 
Protein GI83588957 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.302456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCA AGGTGATAGT TACCGGCTTG GGGCCGGGGG ACCCGGCCCA GGTGCCAGAA 
GCTGTCCTTA AGGCCCTGGC CGGGGCGGAT AAGATTTACC TGAGAACCGC CCACCACCCG
GCGGTAGCCG CCCTGGAGGT ACGGGGCCTT AAGTGGGAAA CCTTTGATTC TTTTTATGAG
GAAGCGGCGG ACTTCGAGGA ACTCTACCAG CGCATAGCCA GTTTTCTCCT CACCGCGGTA
GCTAAGACAG ATAAGAAACT GGCCTACGCC GTACCGGGTC ATCCCCTGGT CGCGGAGCGA
AGCGTGGCCC TGCTGCTAGA AAAGGCCCCG GCAGCCGGTG TCGACCTGGA GATTATTCCG
GCAATGAGCT GCCTGGATGC CCTTTACGCC ACCCTCAAAA TAGACCCCGC CCTGGGGTTG
ACCGTAGCCG ACGCCCTGAC CTTCACGGTG GCTGGCCTGG ACCCGGCCCG GGGTTTAATC
CTTACCCAGG TATATAACCG CCGGGTAGCC GGGGAGATTA AATTAGACCT TATGACTGTC
TACCCGGATG AATACCCGGT AACCGTCGTC CGCGGCGCCG GTCTTCCTGA CGGCGAAAGG
GTAGCCACCG TGCCCCTCTA CACCATCGAC CGCCTGGAAT GGCTTGATCA CTTAACCAGT
CTTTACCTGG CACCCTACCC CGAGGGCCGG GATCGCACCC TGGCGGGCCT GGAAGCCATT
ATGGCTCGCT TACGAAGTCC GGAGGGGTGC CCCTGGGACC GGGAGCAAAC CCATATCACT
CTGAAACGTT ATCTGGTAGA AGAGACCTAC GAGGTCCTGG AGGCCATCGA CGCCGGGGAC
ATGAATAAAC TATGCGAAGA ATTGGGAGAC TTACTGCTAC AGGTGGTCTT CCACGCCAGG
CTGGCGGAGG AAGAGGGTGA TTTTACCCTG GCCGACTGCC TGGAGGGTAT CTGTGCCAAG
ATGCGCCGGC GCCACCCCCA CGTCTTCGGC CAGGCCGTTC TGAATACGGC CGGGGAAGTA
CTGGCGCGCT GGGACCAGAT CAAGGCTACC GAAAGGAGAG AAAAAGGCGA AGAAGCACCG
TCGGTATTGA GCGTGCCCCG GGGCTTGCCG GCCCTCTTAA AGGCCTTGAA GGTCCAGGAG
CAGGCCGCCC GGGTGGGTTT TGACTGGCCC CGGATAGAGG AAGTCTGGAC CAAGGTGGAA
GAAGAACTGG ACGAGTTGAA AAAAGCGGTT GCCGGCGCCG GGGTGGAGGA GCAGGCGGCT
GAGATGGGGG ATCTCCTCTT CTCCCTGGTC AACCTGGCCC GCTGGCTCCA GATTGAACCG
GAGGCGGCGC TTCAGGCAAC GGTTGCTAAA TTCGCTCGGC GCTTCAATTA TATAGAAAAG
GCCGCCCTGA AGGGAGGCAG GGATATCGAG GATCTCTCCC TGGCGGAGAT GGACGCCCTC
TGGGAAGAAG CGAAAAAAAT TAGGCCATCT TAG
 
Protein sequence
MAGKVIVTGL GPGDPAQVPE AVLKALAGAD KIYLRTAHHP AVAALEVRGL KWETFDSFYE 
EAADFEELYQ RIASFLLTAV AKTDKKLAYA VPGHPLVAER SVALLLEKAP AAGVDLEIIP
AMSCLDALYA TLKIDPALGL TVADALTFTV AGLDPARGLI LTQVYNRRVA GEIKLDLMTV
YPDEYPVTVV RGAGLPDGER VATVPLYTID RLEWLDHLTS LYLAPYPEGR DRTLAGLEAI
MARLRSPEGC PWDREQTHIT LKRYLVEETY EVLEAIDAGD MNKLCEELGD LLLQVVFHAR
LAEEEGDFTL ADCLEGICAK MRRRHPHVFG QAVLNTAGEV LARWDQIKAT ERREKGEEAP
SVLSVPRGLP ALLKALKVQE QAARVGFDWP RIEEVWTKVE EELDELKKAV AGAGVEEQAA
EMGDLLFSLV NLARWLQIEP EAALQATVAK FARRFNYIEK AALKGGRDIE DLSLAEMDAL
WEEAKKIRPS