Gene Moth_0839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0839 
Symbol 
ID3831536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp871125 
End bp872522 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content62% 
IMG OID637828769 
ProductUDP-N-acetylmuramoyl-tripeptide--D-alanyl-D- alanine ligase 
Protein accessionYP_429699 
Protein GI83589690 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0770] UDP-N-acetylmuramyl pentapeptide synthase 
TIGRFAM ID[TIGR01143] UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.58965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGACAG CTACGGTCAG GGAGATCTGT GACGCCATCG GCGGCTACCT GGTAGCCGGG 
GATCCGGCAG TTGTGGTAAA GGGCCTCAGT ACCGACAGCC GGGAGATCCA GCCGGGTATG
GCCTTTGTTG CCTTGAAGGG AGAACGCTTT GACGGCCATG ATTTTATTGG TGCGGCCCTA
TCCGGCGGCG CGGCGGCGGC CATAGGAACC AGTTTCCCCG GCCATCAGGT TGTCGGGCTG
CGCCCCGGCC AGGCTTTGAT CCAGGTTGAC GATACCCTTC TGGCCCTGCA AAAACTGGCA
GCCTACCACC GGCAGAAAGT TTTAAAGGGT CCCCTGATCG GGGTAACCGG TTCCAGCGGC
AAAACAACGA CCAAAAACCT CATTGCCGCC GTTTTGAGCC CGGTCCTGAA GGTCCTAAAA
ACGCCGGGGA ATTTTAATAA CGAGATCGGC CTGCCCATGA CCCTTTTACG CCTGGCCCCC
TGGCACCAGG CAGCCGTGGT GGAGATGGCC ATGCGGGGCA AAGGAGAGAT AGCCTCCCTG
GCGGCCATAG CCCGACCGAC CATTGGCGTC ATTACCAACA TCGGTACCAC CCACCTGGGA
CTCCTCGGTT CGATAGCCAA TATTGCCGCT GCCAAAGGGG AGTTGCTGGC AGCTTTGCCC
GCAGAAGGAC TGGCGGTTTT GAATGGCGAT AATGAGTGGT GCCGCCGGCT GGCGGCCACC
TGTCCTTGCC GGGTGGTCTT CTTCAGTACC CGGGGCCAGG GGGAGATTTA CGCCCGGGAT
ATTGGCGATA GAGGCCTGGA GGGGACTTTT TTTACGGCCG TTTTTCCCGG TAAGGAAGTC
CGGGTGCAGC TACCGGTCCC GGGGCGGCAC AATGTGGAAG ATGCCCTGGC AGCCCTGGCT
GTAGGCTTTT CCCTGGGCGT GGAACCGGAA ACTATGGCCC GGAGCCTGGC TGACTGGCCG
GCCGAGGATT TGCGCCAGGA CCTGCGTCCG GGCCCGGGCG GTTCCCTGGT TTTTAACGAC
GCATATAACG CCAACCCTGA ATCCATGGAG GCCGCCCTCC AGGCCCTGGG CGCTTTACCC
GGCCGCCGGG TGGCCGTCCT GGGGGCCATG CTAGAACTTG GACCGGCAGA AGTAGAACTG
CACCGGCGGG TGGGACGATT CGCCGCTGCC CGGGGCCTTT ATCGCCTGAT AACTGTGGGC
GAACTGGCCG GGGAGATAGC CGCCGGCGCT CTGGAAGCCG GGATGCCGGC GGATCAGGTC
TTTGCCTGCC TTACCCACGA GGAGGCGGCC GGGCACCTAC GGGGACTGGG CCATGGAGAT
GTAGTCCTTT TTAAAGGATC GCGTCTTACC GGCATGGAAA AGGTGCTGGC CATCTGGGAG
GGCTCCGAAC ATGATTGA
 
Protein sequence
MLTATVREIC DAIGGYLVAG DPAVVVKGLS TDSREIQPGM AFVALKGERF DGHDFIGAAL 
SGGAAAAIGT SFPGHQVVGL RPGQALIQVD DTLLALQKLA AYHRQKVLKG PLIGVTGSSG
KTTTKNLIAA VLSPVLKVLK TPGNFNNEIG LPMTLLRLAP WHQAAVVEMA MRGKGEIASL
AAIARPTIGV ITNIGTTHLG LLGSIANIAA AKGELLAALP AEGLAVLNGD NEWCRRLAAT
CPCRVVFFST RGQGEIYARD IGDRGLEGTF FTAVFPGKEV RVQLPVPGRH NVEDALAALA
VGFSLGVEPE TMARSLADWP AEDLRQDLRP GPGGSLVFND AYNANPESME AALQALGALP
GRRVAVLGAM LELGPAEVEL HRRVGRFAAA RGLYRLITVG ELAGEIAAGA LEAGMPADQV
FACLTHEEAA GHLRGLGHGD VVLFKGSRLT GMEKVLAIWE GSEHD