Gene B21_04065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04065 
Symbolmpl 
ID8115413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4365491 
End bp4366864 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content55% 
IMG OID644850213 
Producthypothetical protein 
Protein accessionYP_003001786 
Protein GI251787482 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0773] UDP-N-acetylmuramate-alanine ligase 
TIGRFAM ID[TIGR01081] UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATTC ATATTTTAGG AATTTGTGGC ACATTTATGG GCGGTCTGGC GATGCTGGCG 
CGCCAGTTAG GCCATGAAGT AACGGGTTCG GACGCCAATG TGTATCCGCC GATGAGCACC
TTACTTGAGA AGCAAGGCAT TGAGCTGATT CAGGGTTACG ATGCCAGCCA GCTCGATCCG
CAGCCGGATC TGGTGATTAT TGGCAACGCC ATGACCCGTG GAAATCCGTG TGTGGAAGCG
GTACTGGAAA AAAACATCCC TTATATGTCA GGTCCACAGT GGCTGCACGA TTTTGTGCTG
CGCGACCGCT GGGTGCTGGC CGTTGCCGGT ACACACGGCA AAACTACCAC CGCGGGAATG
GCGACCTGGA TTCTGGAACA GTGCGGTTAC AAACCGGGAT TTGTGATCGG CGGTGTGCCG
GGGAACTTTG AGGTTTCGGC GCGTCTGGGC GAAAGCGACT TCTTTGTTAT CGAAGCGGAT
GAGTATGACT GCGCCTTCTT CGACAAACGC TCTAAATTTG TCCATTACTG CCCGCGTACG
CTGATCCTCA ACAACCTTGA GTTCGATCAC GCCGATATCT TTGACGACCT GAAAGCGATC
CAGAAACAGT TCCACCATCT GGTGCGTATC GTTCCGGGGC AGGGCCGTAT TATCTGGCCG
GAAAATGACA TCAACCTGAA ACAGACCATG GCGATGGGCT GCTGGAGCGA GCAGGAGCTG
GTGGGTGAGC AGGGTCACTG GCAGGCGAAA AAGCTGACCA CCGATGCTTC CGAATGGGAA
GTTTTGCTGG ATGGCGAAAA AGTGGGCGAA GTGAAATGGT CGCTGGTAGG CGAACATAAT
ATGCACAATG GCCTGATGGC GATTGCAGCG GCTCGCCATG TTGGTGTAGC GCCGGCAGAT
GCCGCTAACG CGCTGGGTTC GTTTATTAAT GCTCGTCGCC GTCTGGAGTT GCGTGGTGAA
GCGAATGGTG TGACGGTATA TGACGATTTT GCCCATCACC CGACGGCGAT TCTGGCAACG
CTTGCGGCGC TGCGTGGCAA AGTTGGTGGT ACGGCGCGCA TTATTGCTGT GCTGGAACCG
CGCTCGAATA CCATGAAAAT GGGGATCTGC AAAGACGATC TGGCACCTTC ATTAGGTCGT
GCCGATGAAG TCTTCCTGCT GCAACCGGCG CATATTCCGT GGCAGGTGGC AGAAGTGGCA
GAAGCCTGCG TTCAGCCTGC ACACTGGAGT GGCGATGTGG ATACGCTGGC AGATATGGTG
GTGAAAACCG CTCAGCCTGG CGACCATATT CTGGTGATGA GCAACGGCGG TTTTGGTGGG
ATCCATCAGA AACTGCTGGA TGGTCTGGCG AAGAAGGCGG AAGCTGCGCA GTAA
 
Protein sequence
MRIHILGICG TFMGGLAMLA RQLGHEVTGS DANVYPPMST LLEKQGIELI QGYDASQLDP 
QPDLVIIGNA MTRGNPCVEA VLEKNIPYMS GPQWLHDFVL RDRWVLAVAG THGKTTTAGM
ATWILEQCGY KPGFVIGGVP GNFEVSARLG ESDFFVIEAD EYDCAFFDKR SKFVHYCPRT
LILNNLEFDH ADIFDDLKAI QKQFHHLVRI VPGQGRIIWP ENDINLKQTM AMGCWSEQEL
VGEQGHWQAK KLTTDASEWE VLLDGEKVGE VKWSLVGEHN MHNGLMAIAA ARHVGVAPAD
AANALGSFIN ARRRLELRGE ANGVTVYDDF AHHPTAILAT LAALRGKVGG TARIIAVLEP
RSNTMKMGIC KDDLAPSLGR ADEVFLLQPA HIPWQVAEVA EACVQPAHWS GDVDTLADMV
VKTAQPGDHI LVMSNGGFGG IHQKLLDGLA KKAEAAQ