Gene Moth_0840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0840 
Symbol 
ID3831537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp872515 
End bp873471 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content61% 
IMG OID637828770 
Productphospho-N-acetylmuramoyl-pentapeptide- transferase 
Protein accessionYP_429700 
Protein GI83589691 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID[TIGR00445] phospho-N-acetylmuramoyl-pentapeptide-transferase 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.723445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGAAG CCTTGAAACC CCTAGTCCTG GCGGCGGTGG TTACCCTTAT CCTGGGCCCG 
CCGGTCCTCG CCTTTCTCAG GCGCCTGAAG GCCGGCCAGA CCGTCCGCAG CGACGGCCCC
CGGAGCCACT TGGCCAAAGC GGGTACCCCG ACCATGGGTG GGGTCCTGTT TCTCATTGGC
CTGACTGTGT CTACCCTGGT CCTGGCCCCA CCCTCCCCCT TAACCCTGTC TACCTTGATC
CTTACTTGGG GTTATGCCCT GATTGGGCTG GTCGACGACG GTTTAAAGGT AATTCTACAC
CGCCCCTTAG GCCTCATGGC CCGGCAAAAG CTCGGTGGCC AGGTTCTCCT GGGCCTGGTG
GCCGGAGTGG CGGCCATGCT CTGGCTGGGG CGGGGGAGCG TCATTCAGGT GCCCGTAACC
GGTTGGCACT GGGACCTGGG CTGGTATTAC CCCCTCCTGG CGGCCCTGCT CCTGGTGGCG
ACGACCAATG CCGTCAACCT TACCGACGGC CTGGATGGCC TGGCAGCGGG GATCACCCTG
TGGGTTGCCC TGGCCTACGG GATTCTAGCC CTGACCCTGG GTCAGGGGGA ACTGGTTACA
TTTGCCATGG CTCTGGCAGG AGGATGTCTG GGATTTTTAG TGTATAATTT TCATCCGGCG
AGGGTTTTTA TGGGAGATAC CGGCTCTCTG GCCCTGGGGG CGGCCATTGG CTTCCTGGCT
ATCATGACCA GGACCGAACT GGTCCTGCCA GTCCTGGGGG GCGTCTATGT CCTGGAGACC
CTTTCGGTAA TCCTGCAGGT GGTCTCCTTT CGCCTCACAG GCCGGCGCCT CTTTCGCATG
AGCCCCCTGC ACCACCATTT CGAGCTGGGG GGGTGGCCGG AGAGCAGGGT AGTGCTTTTT
TTCTGGGCCC TGGCTATAAT CATGGCCCTG GCCGGTCTTT ATCTTTTAAC TATTTAA
 
Protein sequence
MIEALKPLVL AAVVTLILGP PVLAFLRRLK AGQTVRSDGP RSHLAKAGTP TMGGVLFLIG 
LTVSTLVLAP PSPLTLSTLI LTWGYALIGL VDDGLKVILH RPLGLMARQK LGGQVLLGLV
AGVAAMLWLG RGSVIQVPVT GWHWDLGWYY PLLAALLLVA TTNAVNLTDG LDGLAAGITL
WVALAYGILA LTLGQGELVT FAMALAGGCL GFLVYNFHPA RVFMGDTGSL ALGAAIGFLA
IMTRTELVLP VLGGVYVLET LSVILQVVSF RLTGRRLFRM SPLHHHFELG GWPESRVVLF
FWALAIIMAL AGLYLLTI