Gene Mkms_5493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5493 
Symbol 
ID4613177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5735181 
End bp5736377 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content68% 
IMG OID639795187 
Productpeptidoglycan binding domain-containing protein 
Protein accessionYP_941468 
Protein GI119871516 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.306054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.340156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGTC TGCGTCACGG TGATCGCGGA GCCGCGGTCA CCGAGATCCG CGCAGCGCTC 
TCCGCTCTGG GCCTGCTGGA CAGCCCCGAC GACGACCTCA CCACCGGAAG ACATGTCGTC
GCAGACCTGT TCGACGACCA TCTCGACCAG GCGGTCCGCG CCTTCCAGCA GCACCGCGGG
CTCCTGGTCG ACGGCATCGT CGGTGAGGCC ACGTACCGGG CGCTGAAAGA GGCGTCCTAC
CGCCTCGGCG CACGTACGTT GATGCATCAG TTCGGCGCCC CGATGTACGG CGACGATGTC
GCGACACTGC AGGCGCGCCT GCAGGATCTC GGGTTCTACA CTGGTCTGGT CGACGGGCAC
TTCGGGCTGC AGACCCATCA CGGGCTCACC TCCTACCAAC GCGAGTACGG GCTCTATCCC
GACGGCATCT GCGGTCCGGA GACCCTGCGC TCGCTGTACT TCCTCGGGTC ACGCGTGACC
GGCGGTTCGC CGCACGCGAT CCGGGAGGAG GAGCTCGTCC GCCGCTCCGG TCCGCGACTG
TCGGGCAAGC GGGTCATCAT CGATCCGGGC CGCGGCGGCA GCGACCACGG CCTCATCATG
AACGGGCCCC AGGGTCCGAT CAGCGAAGCA GACATCCTGT GGGACTTGGC AAGTCGCCTC
GAGGGCCGGA TGACCGCGAT CGGGATGGAC ACGTTCCTGT CCCGCCCGGC CAACCGCAGC
CCCTCCGACG CCGAACGCGC CGCGACTGCC AACACCGTCG GCGCGGACCT GATGATCAGC
CTGCGCTGCG CCGCACAGCC CACCCCGGCC GCGAACGGCG TCGCGTCGTT CCACTTCGGC
AACTCACACG GGTCGGTCTC CACCATCGGA CGCAATCTCG CCGACTTCAT CCAGCGAGAG
GTCGTCGCCC GCACCGGATT ACGGGACTGC CGCACCCACG GCCGGACCTG GGATCTGCTG
CGTCTGACCC GGATGCCCAC GGTGCAGGTC GATGTCGGCT ACATCACCAA CCCCCGTGAC
CGGGAGCTGT TGGTGTCCAA CCACAATCGG GACGCGATCG CCGAAGGTAT CCTCGCCGCG
GTCAAGCGGC TCTATCTGCT CGGTAAGAAC GACCGCCCCA CAGGGACATT CACCTTCGCC
GAACTGCTGG CGCACGAACT GTCCGTCGAA CAGGCCGGCC GGCTCAGCTC GAACTGA
 
Protein sequence
MSSLRHGDRG AAVTEIRAAL SALGLLDSPD DDLTTGRHVV ADLFDDHLDQ AVRAFQQHRG 
LLVDGIVGEA TYRALKEASY RLGARTLMHQ FGAPMYGDDV ATLQARLQDL GFYTGLVDGH
FGLQTHHGLT SYQREYGLYP DGICGPETLR SLYFLGSRVT GGSPHAIREE ELVRRSGPRL
SGKRVIIDPG RGGSDHGLIM NGPQGPISEA DILWDLASRL EGRMTAIGMD TFLSRPANRS
PSDAERAATA NTVGADLMIS LRCAAQPTPA ANGVASFHFG NSHGSVSTIG RNLADFIQRE
VVARTGLRDC RTHGRTWDLL RLTRMPTVQV DVGYITNPRD RELLVSNHNR DAIAEGILAA
VKRLYLLGKN DRPTGTFTFA ELLAHELSVE QAGRLSSN