Gene Moth_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0844 
Symbol 
ID3831541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp877214 
End bp878620 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content59% 
IMG OID637828774 
ProductUDP-N-acetylmuramate--L-alanine ligase 
Protein accessionYP_429704 
Protein GI83589695 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0773] UDP-N-acetylmuramate-alanine ligase 
TIGRFAM ID[TIGR01082] UDP-N-acetylmuramate--alanine ligase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.952175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATT TGGAAACAGG GGGTTGGACC CATTTTGTCG GCATCGGTGG TGTGGGTATG 
AGCGCCCTGG CACGCATCTT GTTGGCCCAG GGTTACCGGG TCTCAGGATC GGACCCGAAG
GAGAACCAGT TTACCCGGAG CCTGGAGGCA GCCGGGGCCA TCATTTACCA CCAGCATGAT
GCCGCCAATC TGGCCCCTGG AGTCCAGGAA GTAGTAATTT CTTCGGCAGT ACCGTCGTCC
AATCCCGAAG TGGTGGCTGC CCGGCAGCGT TCGCTGCCGG TGGTTAAACG TGGGGAGCTG
CTGGCCCGGC TCTTTAACGC CCGCCGGGGT ATTGCCGTAG CCGGCGCCCA CGGTAAAACG
ACAACCTCGG CCCTGGTTGC CCTGGTAATG AAGGAAGGCG GTTTAGAACC GGCGGCGGTC
ATCGGCGGTT ATGTCCGGGA GTTTGCCAGT AATGCCTACC CCGGCCGGGG GGATTTTCTG
GTGGCGGAGG CTGATGAAAG CGACGGTTCT TTCCTCTGGT TAAAGCCGGA GATAGCCCTC
ATAACCAATA TTGAAGCCGA CCATCTGGAA CATTACGGGA GCCTGGACCG GATTGTCGCT
GCCTTTAAAG ACTTTATCGA TCAGATCCGG CCCGGCGGCA AGGCCATCCT GTGTGCTGAA
GATCCCCGAG TTGCCGGGCT GGTTGCCTGT AGTCCCAGAC AGGTAATTAC TTACGGCCTC
AATGGCAGGC CGGATTACAG GGCGACGGGG GTGCAAATGG CCGGAATGGG CGGGCGGGCC
GCTATTTATT ACCGGGAACA GTATCTGGGG CAACTCACTA TGGCGGTACC CGGACGCCAC
AATATCTTGA ATGCCCTGGG GGCCATTGCC GCAGGTCACC AGCTGGGGAT ACCCTTTGCC
GTTATGGCCC GCGCCCTGGG TCAGTTCCGG GGAGTGGGGC GGCGTTTCGA AATCCTCTGG
GATGACGGTA CTACCAGGGT GGTGGATGAC TATGCCCATC ACCCGACGGA AATCAGGGCG
ACCCTGGCGG CCGCCAGCCA GGTGGGAGCG AAACGGGTGG TGGCTGTTTT TCAACCCCAT
CGCTATACCA GGACCCACCA CCTGTACCGC GAGTTCGGGC AGGCCTTCAG GCAGGCTGAT
GTAGTAATCG TTAATGATAT TTACCCGGCC GGCGAAGCCC CCCTGCCGGG GGTTAATTCC
CAATTAATAA CCGGAGAAAT CAAAGGTAGT GGCCATCAGC AGGTGTACTA CCTGCCCACC
CTGGAAGAAA CCCTGGCTTT TTTAAAGAAA TCCTGCCGTC CCGGGGATCT GGTTTTAACC
CTGGGAGCGG GGGACGTCTG GCGGGTGGGG ATGGGCCTGG CGCAGTACCT GGAGGCCAAG
CAAATTTTGC CCGGAGTAGG AGCGTAG
 
Protein sequence
MADLETGGWT HFVGIGGVGM SALARILLAQ GYRVSGSDPK ENQFTRSLEA AGAIIYHQHD 
AANLAPGVQE VVISSAVPSS NPEVVAARQR SLPVVKRGEL LARLFNARRG IAVAGAHGKT
TTSALVALVM KEGGLEPAAV IGGYVREFAS NAYPGRGDFL VAEADESDGS FLWLKPEIAL
ITNIEADHLE HYGSLDRIVA AFKDFIDQIR PGGKAILCAE DPRVAGLVAC SPRQVITYGL
NGRPDYRATG VQMAGMGGRA AIYYREQYLG QLTMAVPGRH NILNALGAIA AGHQLGIPFA
VMARALGQFR GVGRRFEILW DDGTTRVVDD YAHHPTEIRA TLAAASQVGA KRVVAVFQPH
RYTRTHHLYR EFGQAFRQAD VVIVNDIYPA GEAPLPGVNS QLITGEIKGS GHQQVYYLPT
LEETLAFLKK SCRPGDLVLT LGAGDVWRVG MGLAQYLEAK QILPGVGA