Gene ECD_04101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_04101 
Symbolmpl 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4367400 
End bp4368773 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content55% 
IMG OID 
ProductUDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl- meso-diaminopimelate ligase 
Protein accessionACT45890 
Protein GI253980220 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATTC ATATTTTAGG AATTTGTGGC ACATTTATGG GCGGTCTGGC GATGCTGGCG 
CGCCAGTTAG GCCATGAAGT AACGGGTTCG GACGCCAATG TGTATCCGCC GATGAGCACC
TTACTTGAGA AGCAAGGCAT TGAGCTGATT CAGGGTTACG ATGCCAGCCA GCTCGATCCG
CAGCCGGATC TGGTGATTAT TGGCAACGCC ATGACCCGTG GAAATCCGTG TGTGGAAGCG
GTACTGGAAA AAAACATCCC TTATATGTCA GGTCCACAGT GGCTGCACGA TTTTGTGCTG
CGCGACCGCT GGGTGCTGGC CGTTGCCGGT ACACACGGCA AAACTACCAC CGCGGGAATG
GCGACCTGGA TTCTGGAACA GTGCGGTTAC AAACCGGGAT TTGTGATCGG CGGTGTGCCG
GGGAACTTTG AGGTTTCGGC GCGTCTGGGC GAAAGCGACT TCTTTGTTAT CGAAGCGGAT
GAGTATGACT GCGCCTTCTT CGACAAACGC TCTAAATTTG TCCATTACTG CCCGCGTACG
CTGATCCTCA ACAACCTTGA GTTCGATCAC GCCGATATCT TTGACGACCT GAAAGCGATC
CAGAAACAGT TCCACCATCT GGTGCGTATC GTTCCGGGGC AGGGCCGTAT TATCTGGCCG
GAAAATGACA TCAACCTGAA ACAGACCATG GCGATGGGCT GCTGGAGCGA GCAGGAGCTG
GTGGGTGAGC AGGGTCACTG GCAGGCGAAA AAGCTGACCA CCGATGCTTC CGAATGGGAA
GTTTTGCTGG ATGGCGAAAA AGTGGGCGAA GTGAAATGGT CGCTGGTAGG CGAACATAAT
ATGCACAATG GCCTGATGGC GATTGCAGCG GCTCGCCATG TTGGTGTAGC GCCGGCAGAT
GCCGCTAACG CGCTGGGTTC GTTTATTAAT GCTCGTCGCC GTCTGGAGTT GCGTGGTGAA
GCGAATGGTG TGACGGTATA TGACGATTTT GCCCATCACC CGACGGCGAT TCTGGCAACG
CTTGCGGCGC TGCGTGGCAA AGTTGGTGGT ACGGCGCGCA TTATTGCTGT GCTGGAACCG
CGCTCGAATA CCATGAAAAT GGGGATCTGC AAAGACGATC TGGCACCTTC ATTAGGTCGT
GCCGATGAAG TCTTCCTGCT GCAACCGGCG CATATTCCGT GGCAGGTGGC AGAAGTGGCA
GAAGCCTGCG TTCAGCCTGC ACACTGGAGT GGCGATGTGG ATACGCTGGC AGATATGGTG
GTGAAAACCG CTCAGCCTGG CGACCATATT CTGGTGATGA GCAACGGCGG TTTTGGTGGG
ATCCATCAGA AACTGCTGGA TGGTCTGGCG AAGAAGGCGG AAGCTGCGCA GTAA
 
Protein sequence
MRIHILGICG TFMGGLAMLA RQLGHEVTGS DANVYPPMST LLEKQGIELI QGYDASQLDP 
QPDLVIIGNA MTRGNPCVEA VLEKNIPYMS GPQWLHDFVL RDRWVLAVAG THGKTTTAGM
ATWILEQCGY KPGFVIGGVP GNFEVSARLG ESDFFVIEAD EYDCAFFDKR SKFVHYCPRT
LILNNLEFDH ADIFDDLKAI QKQFHHLVRI VPGQGRIIWP ENDINLKQTM AMGCWSEQEL
VGEQGHWQAK KLTTDASEWE VLLDGEKVGE VKWSLVGEHN MHNGLMAIAA ARHVGVAPAD
AANALGSFIN ARRRLELRGE ANGVTVYDDF AHHPTAILAT LAALRGKVGG TARIIAVLEP
RSNTMKMGIC KDDLAPSLGR ADEVFLLQPA HIPWQVAEVA EACVQPAHWS GDVDTLADMV
VKTAQPGDHI LVMSNGGFGG IHQKLLDGLA KKAEAAQ