Gene Moth_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2100 
Symbol 
ID3832466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2192178 
End bp2193230 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content59% 
IMG OID637830025 
Productmethyltransferase MtaA/CmuA 
Protein accessionYP_430935 
Protein GI83590926 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01463] methyltransferase, MtaA/CmuA family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCCA GCGAGCAAAT GGCAGGTAGT GAGAGGGTGA TAGCAGCCGT GCAGGGGCAG 
GAGGTTGACC GCTTCCCCCT GGTTACGCCG ACCTCGGTGG TGACGGTAGA AAGCATGACC
GTCACCGGTG TTTATTTCCC GGAGGCCCAC ACCGACCCCT ATAAAATGGC CGCCCTGGCT
GCGGCCGGCC ACGAATTACT GGGCTTTGAT ACCGTCACCC CTTATTTCAG CATCCTGCTT
GAGGCGGCGG CCCTTGGGTG CGAAGTGGAC TTGAACTCGG TGGACGCCAT GCCAGCCATT
AAAATTAACC CTCTGAAGAA CCTTTTGGAG AGGAAGTGGG ACTGGCGCCC GCCTGCCAAT
TTCCTGGATC GGCAACCGGT AAAAGCCCTC CTGGCTGCTA TCAGACTATT AAAAAAGCGC
TATGGCAGGC GCGTGGCCGT GGTGGGTAAG GTGATCGGCC CCTGGACCCT GGCTTACCAT
CTGTGCGGGG TTCAGGACTT CCTCCTAGGG CTGGTTCTGG AACCGGAAGC CGTCCGGGAA
CTCTTAGAGC GGTTGCTGGC CGTTCCTTTG CGTCTGGCAG TAGCTGAGAT TGAAGCCGGG
GTTGATGTCC TCACCTGGGC TGATCACGCT ACCAGCGACC TGGTCAGCGC TGCTGCTTAC
CGGGATTTTC TCCTGCCTCT CCACCAGAGG GCTATGGAGC AATTAGCCGG TAGTTGTCCG
GTGATTTTGC ATACCTGTGG CCGGGCTACC GACCGGGTGG CTTATTTCGC CCGGGCTGGG
TTTACCGCCT TTCATTTTGA CTCCCGCAAC CCGGTCGGCG ATCTTCTGTC CCTGGCCAAT
GGCCGGTTGA ATCTCATCGG TGGCATCAAC AACCCCCAGA CCTTGCTGAA CGGTAAAGTG
AAGGAAGTTA GAGCAACCAT CGAAGGCCTG TTACAGGCGG GTATCAAGAT GGTAGCCCCG
GAATGCGCCG TGCCCCTGCG GACACCCAAC CAGAACCTCC GGGCCATAGT TCAGGCGGTG
CGCGACTTCA GCCGCCGCCA CCGGAAGGTT TGA
 
Protein sequence
MSASEQMAGS ERVIAAVQGQ EVDRFPLVTP TSVVTVESMT VTGVYFPEAH TDPYKMAALA 
AAGHELLGFD TVTPYFSILL EAAALGCEVD LNSVDAMPAI KINPLKNLLE RKWDWRPPAN
FLDRQPVKAL LAAIRLLKKR YGRRVAVVGK VIGPWTLAYH LCGVQDFLLG LVLEPEAVRE
LLERLLAVPL RLAVAEIEAG VDVLTWADHA TSDLVSAAAY RDFLLPLHQR AMEQLAGSCP
VILHTCGRAT DRVAYFARAG FTAFHFDSRN PVGDLLSLAN GRLNLIGGIN NPQTLLNGKV
KEVRATIEGL LQAGIKMVAP ECAVPLRTPN QNLRAIVQAV RDFSRRHRKV