Gene Moth_0517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0517 
Symbol 
ID3831819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp536384 
End bp538357 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content59% 
IMG OID637828451 
ProductN-acetylmuramoyl-L-alanine amidase 
Protein accessionYP_429390 
Protein GI83589381 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000337238 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAGAC GCCTGACAAA TGGCCGGTGG GGCCTGACGC TGCTGATACT GGCGTTGATG 
TACATAATTG CGCTGGGGCT GATGGCCCGG CCGGCTGCAG CTGACCCCGG GATTACTCTA
GTCTTGAACG GCAGCAGGGT TAACCCCTCG GTTCCAGCCT ACACCGACAG CAACGGTCGT
ACCATGGTAC CTGTGCGTTT TGTCATGGAA CACATGGGGG GCAGGGTGGA GTGGCTGGAT
GCCGAGCAGG GGATAGTAGT CAGCCGGGGA GCGACAACTT TAAAAATGTG GATTGGCAAA
CGCCAGGCCC AGGTCAACGG CCAGGCTATT GACCTGGATA CAGTACCTGT CCTCCAGGAT
GGCACCAGCA TGGTACCGGT GCGTTTTGTC GCCCAGGCTT TCGGCGGGAA GGTTGAATGG
GATGATGCCT CCCGGACAGT TAGCATCTGG CTGGGTACGG CGTCGCCCGC CGGCCAGGTG
CGGATAACCG GCAGTTATGT CAATGTCCGG ACCGGGCCGG GGACTTCCTA TGGGGTAATT
GATGTCCTGC CCAGGGACAC GCTGGTGCAA TTGTTGGCTA CAGGCGATGG ATGGTACCAG
GTGCAGTTGC CGGATGGACG CCAGGGTTGG GTTTCGGCCA GTTATTCCGA AGTGCTCCAG
GGCAACAACC AACCCCAGGA TACCAATCCT CCCGGCAATA ATCAGCCCGG GAACGGGCAG
TCGCCAGGCA ACAACCCGTC ACCTGGCAAT AATCAGCCGG GGAATGAAGA ACCACCGTCC
GGACAACCCT TGGGCACAGC GATAATCGGT AACAAGCCGG TGGCCATTTT AGCCGGACCT
AACCCGGTGG AAAAACAAGT CGGTATGGCA CCGGCCGGCA GCCGGTTACC CATCTGGCAA
CAGCAGGGTG ACTGGTGGTT GGTGGAGCTG GATAATGGCC TGCGGGGCTG GCTGGCCAGT
TCCCTGGCAA CCTTTTCACC CGAAAAACCG GGCCAGGATA ATGGCGGGTC CGAAACGGGT
AACGGTGGGA CGGCACCTGG TGAAGGTAAT CAGGGAGTGG GCAACAGTGA TAGCAACAGC
CTCAAGATAA CCGGCGTCAC GGTAAATCCC GGGCCCGATT GGATTGAAGT AACGGTACAG
GGTACCCGGC CCTTTACCTT TAAAAGCTCC CGTTGGGCCG ACCACCTGAT TTTCGATATA
CCAGGAGCCA CCCTGGCGGT AGCACCGGGG CAGGACAAGG TGGAAGTGAA CCGGCAGCCG
CTGGCCCGGG TGCGCCTGGG ACAGTATGAT GCCAACACCG TGCGAGTGGT ATGCGATCTT
AATGGGGCAG CCAATTTTAC CACAACGACA GCCGGATCTA CTATAACCAT CAGGCTGCAA
AAACCCTCTG TCCGGGGGGC TAAAATTGTC ATTGATCCCG GCCATGGTAC CGACCCGCAA
GGTTCTGACC CCGGGGCTAT CGGTCCCAGC GGCGTTCAGG AGAAGGACGT CAACCTGGCC
ATCTCCCGGA AATTGGCGGA ACTCTTGCGC GCCGCCGGGG CGACGGTTTA TATGACCCGT
GATGGGGAAA CAACTCCGTA TACCCTATCC GGTAGGGCCT ATTACGCCAA CGAAGTCGGC
GCCGACCTTT TCATCTGCAT TCACTCCAAC GCGTCCCTGA GCCCTTCAGC CTCGGGTACA
TCAACCTATT TCTATGCGCC GCCGGGGACG GCCCTGGGAG AACAGCGGGA TGCACGCCAG
CGCCTGGCCA CCCTTATCCA GAGGGATCTG GTAGCTGCTA TCGGCCGGCG CGACCTGGGG
GTTAAAGAGG CCAATTTCGC AGTCCTGCGC AATACCAAAA TGCCCTCGGT GCTTGTAGAG
ACGGCCTTTA TCTCGAATCC TACGGAGGAG CAGCTCCTGG CCAGTCCTGA TTTCCAGGCC
CTGGTGGCCC AGGGGATCTT TAACGGCATC AGTGACTACC TCTCCGGCCA GTAG
 
Protein sequence
MVRRLTNGRW GLTLLILALM YIIALGLMAR PAAADPGITL VLNGSRVNPS VPAYTDSNGR 
TMVPVRFVME HMGGRVEWLD AEQGIVVSRG ATTLKMWIGK RQAQVNGQAI DLDTVPVLQD
GTSMVPVRFV AQAFGGKVEW DDASRTVSIW LGTASPAGQV RITGSYVNVR TGPGTSYGVI
DVLPRDTLVQ LLATGDGWYQ VQLPDGRQGW VSASYSEVLQ GNNQPQDTNP PGNNQPGNGQ
SPGNNPSPGN NQPGNEEPPS GQPLGTAIIG NKPVAILAGP NPVEKQVGMA PAGSRLPIWQ
QQGDWWLVEL DNGLRGWLAS SLATFSPEKP GQDNGGSETG NGGTAPGEGN QGVGNSDSNS
LKITGVTVNP GPDWIEVTVQ GTRPFTFKSS RWADHLIFDI PGATLAVAPG QDKVEVNRQP
LARVRLGQYD ANTVRVVCDL NGAANFTTTT AGSTITIRLQ KPSVRGAKIV IDPGHGTDPQ
GSDPGAIGPS GVQEKDVNLA ISRKLAELLR AAGATVYMTR DGETTPYTLS GRAYYANEVG
ADLFICIHSN ASLSPSASGT STYFYAPPGT ALGEQRDARQ RLATLIQRDL VAAIGRRDLG
VKEANFAVLR NTKMPSVLVE TAFISNPTEE QLLASPDFQA LVAQGIFNGI SDYLSGQ