Gene Moth_2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2102 
Symbol 
ID3832468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2194797 
End bp2195813 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content55% 
IMG OID637830027 
Productmethyltransferase MtaA/CmuA 
Protein accessionYP_430937 
Protein GI83590928 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01463] methyltransferase, MtaA/CmuA family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTGCCA TGACCCCCAA AGAGCGTTTC TTGCGTGCCT TAAACCGCGA ACATGTAGAC 
CGCATCCCGG TAGGTAACCC GGTATCAGTA GCTACGGTGG AATCCATGGA AGCCTGCGGC
GCCTATTTCC CTGATGTCCA TTTGAACCCG GAGAAAATGG CCACCCTGGC AGCGACGGGG
TATGAAATCC TGGGCTTCGA CACCATTGCC CCTTATTTCA GCGTCCAGCA GGAAGCTGCG
GCCTTTGGCC TTAAAATGAA CTGGGGCACC GTCGACTCTA TGCCAGATGT CCTGGAGAAC
CCCTTTGAAG ACCCAGACGA CATTATCATC CCGGCTGACT TTTTAGAGCG ACCGCCCATT
CGTACCGTAC TTGAGGCACT GAAAATCCTC AAAAAAGAGT ACGGCGATCA CGTCTGCCTG
GTGGGTAAGG TCATGGGACC CTGGACCCTG TCCTACCATC TCCACGGCGT GCAAAAATTC
CTCATTAAGA CCATCCTGGA ACCGGATAAA GTGCGCCGTT TCCTCGATAA GCTAAAGGAC
CTGGCGGTGA AATTCGCCAA CGCCCAGTTT GCCGCCGGGG CCGACGTGGT GACGGTGGCC
GACCACGCCA CCGGCGACCT GGTAAGCGGT ACCTGCTACC GCGACTTTCT CCTGCCGATT
CACCAGGAGA TGACACGACA GCTCAATGGA CCGACTATCC TGCATATCTG CGGCAATACC
ACCGACCGTC TGGATTATAT CGCCCAGGCG GGGTTTAACT GCTTCCACTT TGACTCCAAG
GTCAATCCGC GACAGGCCCA CGAGATTGTC AATGGCCGTA TTGCTTTGAC GGGCAGTATC
AATAACCCGA AAACCCTCTT TAATGGCACG CCGGATGATG TCCGCCGGGA GGTCTTTGCC
AACTGCGAGG CAGGTATCGA GATTATCTCT CCCGAGTGTG CCGTGCCTTT GCGGACTCCC
AATGCCAACT TGAAGGCCAT TGTTACAGCA GTGGAGGAAT ATTGCCGGAA CCATTAA
 
Protein sequence
MPAMTPKERF LRALNREHVD RIPVGNPVSV ATVESMEACG AYFPDVHLNP EKMATLAATG 
YEILGFDTIA PYFSVQQEAA AFGLKMNWGT VDSMPDVLEN PFEDPDDIII PADFLERPPI
RTVLEALKIL KKEYGDHVCL VGKVMGPWTL SYHLHGVQKF LIKTILEPDK VRRFLDKLKD
LAVKFANAQF AAGADVVTVA DHATGDLVSG TCYRDFLLPI HQEMTRQLNG PTILHICGNT
TDRLDYIAQA GFNCFHFDSK VNPRQAHEIV NGRIALTGSI NNPKTLFNGT PDDVRREVFA
NCEAGIEIIS PECAVPLRTP NANLKAIVTA VEEYCRNH