Gene Smed_4990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4990 
Symbol 
ID5318711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1503718 
End bp1504719 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content63% 
IMG OID640776772 
Productglycoside hydrolase family protein 
Protein accessionYP_001313704 
Protein GI150377108 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3757] Lyzozyme M1 (1,4-beta-N-acetylmuramidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.427565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00324105 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTCATT TTTCAGTTGT TTCAGCGATC GTCATATTTG GTCTGGTACT GTCGGGCTGC 
GGCTTCGGGG GTGATCGTCC CTCGCGGGAG ACGACGAGCT CCGTGCGTCC GGCAAGTCCG
GTTCCGTCAG CGGAAGTCAG CGCCGTTGCC GCGAAAAGCG CGCCGGAGGC GCTCGCCTGG
GCGGGGCCGG TGCCGGAGCC CCAGGCTTTT ACGTCCGCGC AGAGGACCGT CGGCATGCCC
GTGCCCGCCG AGCGTCCGCT TGCGATGCTC GCTCCGGAAA ATCCCGCGGG GCCTCGGGGC
GGCCGCACAC GCGTTTACAG CCACAGCTTC CGTGATGCCC ATCCGATCAA CTTCGGCAAG
AGATCACCGC GCAAAATGGC CGTTCATGGC GTCGACGTTT CGCGTTGGCA GGGAGACATA
AACTGGGCGA AGCTGCGCAG CCAGGGCGCC AACTTCGCCT ATATCAAGGC GACCGATGGC
GGCGATCATC TCGATCCCAT GTTCAAGAAG AACTGGCGCC GGGCGAACGA AGCCGGACTG
AAACGCGGCG CCTATCACTT CTTCTACTGG TGCCGCACGG CCGGCGAGCA GGCTGACTGG
TTCATTCGCA ACGTGCCGCG GGATCCGACC GCGCTTCCCC CCGTGATCGA TGTCGAATGG
AACGGTGAGT CGAGCTGCAA ACGGCGCCCC TCGCGCGAGC GGGTGCTCGA AAAGATGCAG
GTCTTCATGG ACAAGCTGGA GCGGCATTAC GGCCAGCGCC CGATCATCTA CACCGCGCCT
GACTTCTACC GCGACAATCT GCAGGGCGCG TTTCCCAATC ATCCCTTCTG GCTGCGCTCG
GTCGCGGCCC ATCCGTCCAA GGTTTATCCC GGACGCAAAT GGGTGTTCTG GCAATATTCG
GGTTCGGGTC TGTCCCACGG CGTCGAGGGG CGGATCGATC TCAACGTCTT CAACGGCAGC
GAGGAGGATT GGCACAATTG GGTGGCGGCG CGCTCGAGTT GA
 
Protein sequence
MRHFSVVSAI VIFGLVLSGC GFGGDRPSRE TTSSVRPASP VPSAEVSAVA AKSAPEALAW 
AGPVPEPQAF TSAQRTVGMP VPAERPLAML APENPAGPRG GRTRVYSHSF RDAHPINFGK
RSPRKMAVHG VDVSRWQGDI NWAKLRSQGA NFAYIKATDG GDHLDPMFKK NWRRANEAGL
KRGAYHFFYW CRTAGEQADW FIRNVPRDPT ALPPVIDVEW NGESSCKRRP SRERVLEKMQ
VFMDKLERHY GQRPIIYTAP DFYRDNLQGA FPNHPFWLRS VAAHPSKVYP GRKWVFWQYS
GSGLSHGVEG RIDLNVFNGS EEDWHNWVAA RSS