Gene Mlg_0604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0604 
Symbol 
ID4268483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp656075 
End bp657436 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content71% 
IMG OID638125351 
ProductUDP-N-acetylmuramate 
Protein accessionYP_741448 
Protein GI114319765 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0773] UDP-N-acetylmuramate-alanine ligase 
TIGRFAM ID[TIGR01081] UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.197078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000266395 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATCTAC ACATCCTGGG CATCTGCGGC ACCTTCATGG GGGGCCTGGC CCTGCTGGCG 
CGGGAGGCCG GTCACGTGGT GAGCGGTAGC GACCAGGGCA TCTACCCGCC CATGAGCGAC
ATGCTGGCGG AACAGGCCGT GGACCTGCGG GCCGGTTATG CCCCCTCACA CCTGCAACCG
CCCCCCGATC AGGTGATTGT CGGCAACGCC CTGTCGCGCG GTAATCCGGC GGTGGAGTAC
GTGCTGGACC AGGGCCTGCG CTACACCTCG GGGCCCCAGT GGCTGGGCGA GCACCTCCTC
CACGACCGCT GGGTCCTGGC GGTCTCCGGC ACCCACGGCA AGACCACCAC CGCCAGCCTG
TTGGCCTGGA TCCTGGAGTA CGCCGGATTG AACCCGGGCT TCCTGGTGGG CGGGGTGCCG
ACCAACTTCG GCCGCTCGGC CCGCCTCGGC GACGGGCCGT TTTTCGTCAT CGAGGCGGAT
GAATACGACA GCGCCTTTTT CGACAAGCGC TCCAAATTCA TCCACTTCCG CCCCCGCACC
CTGGTGATCC ACAACATCGA GTACGACCAT GCGGACATCT TCCCCGACCT GGCCGCCATC
CAGCGCCAGT TCCACCACCT GGTGCGCACC GTGCCGGGCA ACGGCCTGAT CATCGCCAAC
GGCGATCAGG CCAATGTTGC CGAGACCCTG GGCCAGGGCT GCTGGACCCC CACCCTGCGC
CTGGGCACCG GGCCCGACTG CGACTGGCGC TACGACCTCA ACGGGCAGGG CGAAATGGTG
CTGCGCGGCG GGGATGCCAC CCCGCTGACC GCCCGACCAC CGCTGCCCGG CCTGCACAAC
GCGGCCAACT GCGCCGCCGC ATTGCTCGCA GCCCGCCACG TCGGGGTCCC ACTGAGCACC
GGGCTCGACG CACTGGCCGG CTTCCGCGGC GTGAAGCGGC GCCTGGAGCT GCGGGGTGAG
GCCGGCGGCG TGCGGGTGTA CGACGATTTC GCCCACCACC CCACCGCCAT CCGCGCCACC
CTGGAGGCCA TGCGGCCCGG CCCGGGGCGA TTGCTGGCAG TGCTGGAGCC CCGCTCCAAC
ACCATGCGCA TGGGCATCCA CCGGGAACGG CTGGCCGCCG CCCTCGCCCC CGCCGACGCC
GTCTTCGCCC TGCAGGGCAA GGGCCTGGAG TGGTCGGTGG CGGACGCCCT GGCCGGGCTC
ACCCCGCCCG CCGAGGTGGC GCAGGACGTG CCGGCGCTGG TCCAGCGCAT CCGGCAACAG
GCCCGCCCGG GGGACCGGGT GGTGGTGATG AGCAACGGCG CCTTCGACGG TCTGCACGGC
CGCCTGCTGG CGGCCCTGGA TGGCCGGGAG GTCTCGGCAT GA
 
Protein sequence
MHLHILGICG TFMGGLALLA REAGHVVSGS DQGIYPPMSD MLAEQAVDLR AGYAPSHLQP 
PPDQVIVGNA LSRGNPAVEY VLDQGLRYTS GPQWLGEHLL HDRWVLAVSG THGKTTTASL
LAWILEYAGL NPGFLVGGVP TNFGRSARLG DGPFFVIEAD EYDSAFFDKR SKFIHFRPRT
LVIHNIEYDH ADIFPDLAAI QRQFHHLVRT VPGNGLIIAN GDQANVAETL GQGCWTPTLR
LGTGPDCDWR YDLNGQGEMV LRGGDATPLT ARPPLPGLHN AANCAAALLA ARHVGVPLST
GLDALAGFRG VKRRLELRGE AGGVRVYDDF AHHPTAIRAT LEAMRPGPGR LLAVLEPRSN
TMRMGIHRER LAAALAPADA VFALQGKGLE WSVADALAGL TPPAEVAQDV PALVQRIRQQ
ARPGDRVVVM SNGAFDGLHG RLLAALDGRE VSA