Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0084 |
Symbol | |
ID | 3832693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 85566 |
End bp | 87038 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828016 |
Product | MazG family protein |
Protein accession | YP_428966 |
Protein GI | 83588957 |
COG category | [R] General function prediction only |
COG ID | [COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain |
TIGRFAM ID | [TIGR00444] MazG family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.302456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGCA AGGTGATAGT TACCGGCTTG GGGCCGGGGG ACCCGGCCCA GGTGCCAGAA GCTGTCCTTA AGGCCCTGGC CGGGGCGGAT AAGATTTACC TGAGAACCGC CCACCACCCG GCGGTAGCCG CCCTGGAGGT ACGGGGCCTT AAGTGGGAAA CCTTTGATTC TTTTTATGAG GAAGCGGCGG ACTTCGAGGA ACTCTACCAG CGCATAGCCA GTTTTCTCCT CACCGCGGTA GCTAAGACAG ATAAGAAACT GGCCTACGCC GTACCGGGTC ATCCCCTGGT CGCGGAGCGA AGCGTGGCCC TGCTGCTAGA AAAGGCCCCG GCAGCCGGTG TCGACCTGGA GATTATTCCG GCAATGAGCT GCCTGGATGC CCTTTACGCC ACCCTCAAAA TAGACCCCGC CCTGGGGTTG ACCGTAGCCG ACGCCCTGAC CTTCACGGTG GCTGGCCTGG ACCCGGCCCG GGGTTTAATC CTTACCCAGG TATATAACCG CCGGGTAGCC GGGGAGATTA AATTAGACCT TATGACTGTC TACCCGGATG AATACCCGGT AACCGTCGTC CGCGGCGCCG GTCTTCCTGA CGGCGAAAGG GTAGCCACCG TGCCCCTCTA CACCATCGAC CGCCTGGAAT GGCTTGATCA CTTAACCAGT CTTTACCTGG CACCCTACCC CGAGGGCCGG GATCGCACCC TGGCGGGCCT GGAAGCCATT ATGGCTCGCT TACGAAGTCC GGAGGGGTGC CCCTGGGACC GGGAGCAAAC CCATATCACT CTGAAACGTT ATCTGGTAGA AGAGACCTAC GAGGTCCTGG AGGCCATCGA CGCCGGGGAC ATGAATAAAC TATGCGAAGA ATTGGGAGAC TTACTGCTAC AGGTGGTCTT CCACGCCAGG CTGGCGGAGG AAGAGGGTGA TTTTACCCTG GCCGACTGCC TGGAGGGTAT CTGTGCCAAG ATGCGCCGGC GCCACCCCCA CGTCTTCGGC CAGGCCGTTC TGAATACGGC CGGGGAAGTA CTGGCGCGCT GGGACCAGAT CAAGGCTACC GAAAGGAGAG AAAAAGGCGA AGAAGCACCG TCGGTATTGA GCGTGCCCCG GGGCTTGCCG GCCCTCTTAA AGGCCTTGAA GGTCCAGGAG CAGGCCGCCC GGGTGGGTTT TGACTGGCCC CGGATAGAGG AAGTCTGGAC CAAGGTGGAA GAAGAACTGG ACGAGTTGAA AAAAGCGGTT GCCGGCGCCG GGGTGGAGGA GCAGGCGGCT GAGATGGGGG ATCTCCTCTT CTCCCTGGTC AACCTGGCCC GCTGGCTCCA GATTGAACCG GAGGCGGCGC TTCAGGCAAC GGTTGCTAAA TTCGCTCGGC GCTTCAATTA TATAGAAAAG GCCGCCCTGA AGGGAGGCAG GGATATCGAG GATCTCTCCC TGGCGGAGAT GGACGCCCTC TGGGAAGAAG CGAAAAAAAT TAGGCCATCT TAG
|
Protein sequence | MAGKVIVTGL GPGDPAQVPE AVLKALAGAD KIYLRTAHHP AVAALEVRGL KWETFDSFYE EAADFEELYQ RIASFLLTAV AKTDKKLAYA VPGHPLVAER SVALLLEKAP AAGVDLEIIP AMSCLDALYA TLKIDPALGL TVADALTFTV AGLDPARGLI LTQVYNRRVA GEIKLDLMTV YPDEYPVTVV RGAGLPDGER VATVPLYTID RLEWLDHLTS LYLAPYPEGR DRTLAGLEAI MARLRSPEGC PWDREQTHIT LKRYLVEETY EVLEAIDAGD MNKLCEELGD LLLQVVFHAR LAEEEGDFTL ADCLEGICAK MRRRHPHVFG QAVLNTAGEV LARWDQIKAT ERREKGEEAP SVLSVPRGLP ALLKALKVQE QAARVGFDWP RIEEVWTKVE EELDELKKAV AGAGVEEQAA EMGDLLFSLV NLARWLQIEP EAALQATVAK FARRFNYIEK AALKGGRDIE DLSLAEMDAL WEEAKKIRPS
|
| |