Gene Mlg_0436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0436 
SymbolanmK 
ID4268289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp483803 
End bp484954 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content73% 
IMG OID638125166 
Productanhydro-N-acetylmuramic acid kinase 
Protein accessionYP_741280 
Protein GI114319597 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2377] Predicted molecular chaperone distantly related to HSP70-fold metalloproteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.231564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0404466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCCG GAGCGGCATC GGCCCCGCGC GACGGGCTCT ACCTGGGGCT GATCTCCGGC 
ACCAGCATCG ATGCGGTGGA CGCCGCCCTG GTGGAGATCC GGGGCGGGCA GCCGCGGTTG
TGTCGGGCCC TGGGTCACCC CATCCCCGGG CCATTGGCTT CCGCATTGCA CCGGGTGGAT
GCCCAAACCC CCCTCGACAC CCTGCTCGAT CTGGACCAGC AGGTGGCCCG GCTGCACGCG
GAGGCCGCCC GCCGGTTGCT GTCCGAGGCC AAAACCGGTG CCGCGGAGGT CATCGCCATC
GGCAGTCACG GGCAAACGGT TTATCACCGC CCCCACGGCC CCTACCCCAC CACCGTCCAA
TTGGGGGACC CCTCCCGGCT CGCGGCGGAG ACCGGGATCA CCACGGTCGC CGACTTCCGC
CGCCGGGACA TGGCCCTGGG TGGCCAGGGC GCGCCCCTGG TTCCGGCCTT TCACGCCGCT
TGCCTGCGGC AAGCCGGGGA GGATCGCGCG GTGCTCAACC TGGGGGGTAT CGCCAACCTC
ACGCTCCTGC CAGGCACCGA CACGGCACCG GTCACCGGGT TCGACACCGG CCCCGCCAAC
ACCCTGCTCG ACGCCTGGTT CCGGCAGCAC CGGGACGGGA CCTACGACCG GGATGGGGCC
TGGGCCGCGG GGGGCGCGCT GCACACCGGG CTGCTCCGGC GGCTGCTGAA CGATGACTAC
CTGAAACGGC CACCGCCGAA AAGCACCGGC CCGGAATACT TCAGCCCCGA CTGGCTGCAC
CGACAACTGG ATGCGTTACC GGGCGCCCCA CCGGCTCCGC AGGACGTGCA ACGAACCCTG
CTGGCCTTTA CCGCCCAGAG CGCGGTTGCA GCGCTGGCCG AGGCCCTGCC CGGTGTGCGC
CAGCTGTATA TCTGTGGCGG CGGCATCCAC AACACCGCCT TGTGGCGGGC GCTGGCGGCG
GCGCTGGCGT CCCGGTGCCC CGGCTGCCAG CTGACCCCCA CCACGGAGGC CGGACTCGAC
CCGGACTGGC TGGAGGCGAT GGCCTTCGCC TGGCTGGCCT ACCGAACCCT CGCCGGCCTG
CCCGGCAACC TGCCCGAGGT CACCGGGGCG CGCCAGGCCG CGCCGCTGGG TGGGATCTTC
CCCGCGGGCT GA
 
Protein sequence
MNAGAASAPR DGLYLGLISG TSIDAVDAAL VEIRGGQPRL CRALGHPIPG PLASALHRVD 
AQTPLDTLLD LDQQVARLHA EAARRLLSEA KTGAAEVIAI GSHGQTVYHR PHGPYPTTVQ
LGDPSRLAAE TGITTVADFR RRDMALGGQG APLVPAFHAA CLRQAGEDRA VLNLGGIANL
TLLPGTDTAP VTGFDTGPAN TLLDAWFRQH RDGTYDRDGA WAAGGALHTG LLRRLLNDDY
LKRPPPKSTG PEYFSPDWLH RQLDALPGAP PAPQDVQRTL LAFTAQSAVA ALAEALPGVR
QLYICGGGIH NTALWRALAA ALASRCPGCQ LTPTTEAGLD PDWLEAMAFA WLAYRTLAGL
PGNLPEVTGA RQAAPLGGIF PAG