Gene Moth_0841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0841 
Symbol 
ID3831538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp873497 
End bp874864 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content60% 
IMG OID637828771 
ProductUDP-N-acetylmuramoylalanine--D-glutamate ligase 
Protein accessionYP_429701 
Protein GI83589692 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID[TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.991231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTGGC AGGGAAAGAG CGTTCTTGTA GTCGGCCTGG GCCGGAGCGG CCGGGCGGCG 
GCTACTGAAC TTGCCCGCCT GGGGGCAAAG GTGACAGCCT GTGACCGCCA GGCTCTGGCG
GAGGAAGAAC TGGAAAACCT GCGTAAGGAA GGTGTCCACC TAATCCTGGG TGGCTATCCT
GAGGTAAACG AATTACAACC CGATCTAGTT ATCACCAGCC CGGGGGTTCC GTCCGGGGAG
CCCCCCCTGG CCCGGGCCAG GGCCCGGGGA ATCCCCGTCT GGAGTGAACT GGAACTGGCT
TACCGATTGT TGCCTCCGGG GGTAAAGGTG GTGGCTATCA CCGGAACCAA CGGCAAGACC
ACAACAACTT CCCTGTGTGG CCGGATCCTC CAGGAAGCTG GTTGGCCGGC AGTAGTCGGG
GGCAATATTG GTATCCCCCT GGTGAAGGAA CTGCAGGAGA TAGCCCCCGG GAGCTATGTT
GTCTGCGAAG TGAGCAGCTT CCAGCTGGAA GCCATAACCT CCTTCCATCC GCAGGTGGCG
GCCATCTTAA ATATTACTCC GGACCACCTG GACCGTCATG GCAGCCTGGA GAACTACATT
GCTGCCAAGG CGCGGGTTAT GGCATACCAG GAGGCCAGGG ACTTCGCAAT CTTAAACTAC
GACGATCCCC ATACCAGGAG CCTGGCGGGC GGGGCCCGGT CCCGGGTGTT GTTCTTCAGT
CGCCGGGAAC GGCCACCCCT GGGTGCCTGG CTGGAGGACG GGGTGATTTG CTGCGACCTG
GGTGCCGGCG GGACTGTTAA ACTCTGCCAC TGTGAAGAGC TTTCCCTGAA GGGAAGCCAT
AACCTGGAGA ATTCCATGGC CGCAGCCCTG GTAGCCCTGG CCCTGGGGGT GGACCCGGAG
CAGCTGACCC GGACCCTGAA AACCTTTCCT GCCGTCCCCC ATCGCTTAGA ACCGGTGGCG
GAAATCAACG GGGTATGCTA TATCAATGAT TCCAAGGGAA CCAATCCCGA AGCGACCATG
AAGGCTATTA ATGCCTACTC CAATCCCCTG GTACTTATTG CCGGGGGTAG GAATAAGGGC
AGCGACTTTA CCCTGCTGGC CCAACAGATG GCCGGCCGGG TGAAGCACCT GGTGCTGGTG
GGGGAGGCAG CCCGGGAACT GGAGCAGGCC GCCAGGAAGG CGGGAATCGA CTCCATTTAC
CTGGCGCCGG ACTTTGCCAG TGCCGTCCGG GAAGCCGCCG GCGCCGCCCG TCCCGGGGAT
ATCGTCATGC TCTCTCCGGC CTGTGCTAGC TGGGATATGT TTAAAAACTA CGAAGAACGG
GGCGATGTTT TTAAGTCTTT AGTTCTGCAG ATGAAGGATG ACGGTTAG
 
Protein sequence
MSWQGKSVLV VGLGRSGRAA ATELARLGAK VTACDRQALA EEELENLRKE GVHLILGGYP 
EVNELQPDLV ITSPGVPSGE PPLARARARG IPVWSELELA YRLLPPGVKV VAITGTNGKT
TTTSLCGRIL QEAGWPAVVG GNIGIPLVKE LQEIAPGSYV VCEVSSFQLE AITSFHPQVA
AILNITPDHL DRHGSLENYI AAKARVMAYQ EARDFAILNY DDPHTRSLAG GARSRVLFFS
RRERPPLGAW LEDGVICCDL GAGGTVKLCH CEELSLKGSH NLENSMAAAL VALALGVDPE
QLTRTLKTFP AVPHRLEPVA EINGVCYIND SKGTNPEATM KAINAYSNPL VLIAGGRNKG
SDFTLLAQQM AGRVKHLVLV GEAARELEQA ARKAGIDSIY LAPDFASAVR EAAGAARPGD
IVMLSPACAS WDMFKNYEER GDVFKSLVLQ MKDDG