Gene TM1040_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2018 
SymbolmurE 
ID4077475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2121197 
End bp2122681 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content66% 
IMG OID638007333 
ProductUDP-N-acetylmuramoylalanyl-D-glutamate--2, 6-diaminopimelate ligase 
Protein accessionYP_614012 
Protein GI99081858 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0769] UDP-N-acetylmuramyl tripeptide synthase 
TIGRFAM ID[TIGR01085] UDP-N-acetylmuramyl-tripeptide synthetase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.821949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGA GACCAGCGCT CAAGCTCAGC CAGTTGGGAC TGACCGCAAG AGCGGGCCTG 
GACCCGCAGA TCACGGGGCT TGCGGTGGAC AGCCGCGAGG TTGGCGAGGG TTTTGTTTTT
GCCGCCCTGC CCGGCACGCG CGTGCATGGT GCAACCTTTG TCGAACAGGT GCTCGATCAG
GGCGCGGTGG CCATTCTGAC CGACGCCAAG GGCGCCGAGA TCGCAGGCGA GGCCATTGCC
GCAGCGGGCG CAGCCCTTGT GGTGGCCGAA GACCCGCGGC AGGCGCTCTC GGGTGCAGCG
GCGCTCTGGT TTGGCGCCCA GCCCCCGGTG ATGGCAGCCG TGACAGGCAC CAATGGCAAG
ACCTCCGTGT CGACCTTCCT GCGCATGATC TGGACCGAGC TTGGCCACAA GGCCGTGAAC
CTTGGCACCA CCGGCATCGA GGGCGCATGG TCACATCCGC TGGCGCATAC CACGCCCGAG
CCGATCACCC TGCACCGCGC GCTTGCGGCA GCAGCCGAGG CGGGCGTCAC CCATGCGGCG
ATGGAGGCCT CCTCGCATGG GCTGGATCAG CGGCGGCTGG ACGGTGTACA GCTCTCGGCG
GCGGGTTTCA CGAATTTCAC CCAGGATCAC CTCGACTATC ACGAGACCTT TGAGGCCTAT
TTTGCGGCCA AGGCAGGGCT TTTCCGTCGT GTGCTCTCGG AAGATGGCGT CGCCGTCATC
AATATGACCG ACCCCAAAGG GGCTGAGATG CGCGCCATTG CTGCCGCCCG CGGGCAGGAG
ATCATTACGG TTGGGCGCGG TCTGGGTGAC ATTGCCCTGA TGGGTATGCG AGTCGATGCC
ACCGGGCAGG ACATCCGGTT CACATGGCAC GACCGCCCCT TTGCCAAGCG GTTGAACCTC
ATCGGCGGCT TTCAGGCGGA AAACGTGCTG GTGGCGGCGG GTCTGGCGAT TGCCAGCGGC
GAGGACCCCG AGCAGGTGTT TGACACCCTG CCTCACCTCA GCACGGTGCG CGGGCGGATG
CAGCTTGCGG CAACCCGCGA CAATGGCGCG ACGGTGTTTG TGGATTACGC CCACACCCCC
GACGCGGTTG CCACCGCGAT CAAGGCGCTG CGCCCGCATG TTCTGGGCCG CCTTGTGGCG
ATCGTCGGCG CGGGCGGGGA TCGCGATGCA ACCAAACGCC CCTTGATGGG CGCCGCAGCG
CAGGACAATG CCGATGCGGT GATCGTCACC GATGACAACC CCCGCTCTGA AGATCCCGCC
GCCATTCGCG CGGCCGTCAT GGGCGGCGCG CCGGACGCGC TCAATGTGGG CGACCGCGCC
GAAGCGATCC TGCGCGGCGT CGATATGCTC GAGGCTGGCG ATGCGCTCCT CATCTGCGGC
AAGGGCCATG AGAGCGGCCA GACCATCGGC ACCGATGTAT TGCCCTTTGA CGACGTGGAG
CAGGCCAGCA TGGCCGTCGC CGCCCTTGAC GGGAGAATGG TATGA
 
Protein sequence
MTQRPALKLS QLGLTARAGL DPQITGLAVD SREVGEGFVF AALPGTRVHG ATFVEQVLDQ 
GAVAILTDAK GAEIAGEAIA AAGAALVVAE DPRQALSGAA ALWFGAQPPV MAAVTGTNGK
TSVSTFLRMI WTELGHKAVN LGTTGIEGAW SHPLAHTTPE PITLHRALAA AAEAGVTHAA
MEASSHGLDQ RRLDGVQLSA AGFTNFTQDH LDYHETFEAY FAAKAGLFRR VLSEDGVAVI
NMTDPKGAEM RAIAAARGQE IITVGRGLGD IALMGMRVDA TGQDIRFTWH DRPFAKRLNL
IGGFQAENVL VAAGLAIASG EDPEQVFDTL PHLSTVRGRM QLAATRDNGA TVFVDYAHTP
DAVATAIKAL RPHVLGRLVA IVGAGGDRDA TKRPLMGAAA QDNADAVIVT DDNPRSEDPA
AIRAAVMGGA PDALNVGDRA EAILRGVDML EAGDALLICG KGHESGQTIG TDVLPFDDVE
QASMAVAALD GRMV