Gene Smed_2084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2084 
SymbolmurD 
ID5322943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2141181 
End bp2142572 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content64% 
IMG OID640791021 
ProductUDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase 
Protein accessionYP_001327752 
Protein GI150397285 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID[TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0517095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000454332 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATCCCGG TCACTTCATT CAAGGGTAGG AAGGTCGCAC TCTTCGGGCT GGGCGGCTCC 
GGACTGGCGA CCGCCCAGGC GCTCGTTGCA GGCGGAGCCG ATGTGGTGGC TTGGGACGAC
AACCCCGACA GCGTCGCCAA GGCGGATCAG GCCGGGATCG CGACGGCCGA TCTGCGGGGC
GAGGAATGGC ATGCCTTTTC CGCCTTGGTC CTTTCGCCCG GCGTGCCGCT GACCCATCCA
AAGCCGCATT GGAGCGCCGA CCTCGCGCAT CATGCCGGCG TCGAGATCAT CGGCGATGTC
GAGCTGTTCG TGCGCGAGCG GCGCAAGCAC GCGCCTGACT GCCCTTTCAT TGCCATCACC
GGCACCAACG GCAAATCCAC GACGACGGCG CTGATCGCCC ATATCCTGCG CGCAAGTGGG
CGGGACACAC AGCTCGGCGG CAATATAGGC ACAGCGGTGC TGACGCTGGA GCCGCCGCAG
GCGGACCGCT TCTATGTCGT CGAATGCTCA TCCTACCAGA TCGACCTGGC ACCCACGCTC
GATCCCACCG CCGGGATACT CCTCAACCTC ACGCCGGATC ATCTGGATCG CCATGGTACG
ATGCAGCACT ATGCCGACAT CAAGGAGCGC CTGGTGGCGG GGAGCGGAAC GGCGATTGTC
GGTGTCGACG ACAGCCTTTC GAGTCTGATT GCCGACCGGG TGGAGCGAGC AGGTACCAAG
GTCGTGCGTA TCTCGCGCCG TCATCCGCTT GCCGAAGGTG TCTATGCCGA AGGTACGGCG
CTGATGCGTG CGACTGGCGG GGCATCGTCG CTCTTTACCG ACCTTGCCGG GATCCAGACG
CTGCGTGGCG GTCACAATGC CCAGAATGCC GCGGCCGCGA TCGCCGCGTG CCTGGCGGTC
GGCATTTCCG AAAAGGACAT AGTGGACGGC CTCAGAAGCT TTCCGGGGCT CAAGCACCGG
ATGCAGCCGG TTGCGAAGAA GGGCGAGACC ATCTTCGTCA ACGATAGCAA GGCGACCAAC
GCCGAGGCCG CAGCACCGGC GCTGTCGAGT TACGACCGTA TCTACTGGAT CGCCGGCGGT
CTGCCGAAGG AGGGCGGCAT CACCTCGTTG ACGCCATTCT TTCCGAAAAT CGTCAAAGCC
TATCTGATCG GAGAGGCGGC GCCGTCTTTC GCGGCGACCC TCGGCGAGGC AGTGCCCTAC
GAAATCTCGG GGACATTGGA AAAAGCGGTT GCGCATGCGG CATCGGACGC GGCGCGCGAT
GCCGGGGCGC CGGCGACCGT GATGCTTTCC CCGGCTTGCG CAAGCTTCGA CCAGTATAAG
AACTTCGAAC TGCGCGGAGA TGCCTTCGTC GAGCACGTGA AGGCGCTCGA GGGCGTGATC
ATGCTCATCT GA
 
Protein sequence
MIPVTSFKGR KVALFGLGGS GLATAQALVA GGADVVAWDD NPDSVAKADQ AGIATADLRG 
EEWHAFSALV LSPGVPLTHP KPHWSADLAH HAGVEIIGDV ELFVRERRKH APDCPFIAIT
GTNGKSTTTA LIAHILRASG RDTQLGGNIG TAVLTLEPPQ ADRFYVVECS SYQIDLAPTL
DPTAGILLNL TPDHLDRHGT MQHYADIKER LVAGSGTAIV GVDDSLSSLI ADRVERAGTK
VVRISRRHPL AEGVYAEGTA LMRATGGASS LFTDLAGIQT LRGGHNAQNA AAAIAACLAV
GISEKDIVDG LRSFPGLKHR MQPVAKKGET IFVNDSKATN AEAAAPALSS YDRIYWIAGG
LPKEGGITSL TPFFPKIVKA YLIGEAAPSF AATLGEAVPY EISGTLEKAV AHAASDAARD
AGAPATVMLS PACASFDQYK NFELRGDAFV EHVKALEGVI MLI