Gene Smed_0721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0721 
Symbol 
ID5321558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp773082 
End bp774800 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content66% 
IMG OID640789658 
Productpeptidoglycan-binding LysM 
Protein accessionYP_001326412 
Protein GI150395945 
COG category[S] Function unknown 
COG ID[COG1652] Uncharacterized protein containing LysM domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.467953 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAGA ATAAGGCCAG CTGGGTGGCC CTTGGCGTCC TGGCAATCGC AACCGCATTG 
ATGGTTTTTG TGGTTCAGCC AAACCTGCGC GAAAGCGACA AGGAGGCGGT GGCTCAATCC
GGGACAGGCG GCCAGCCTGC CCGACCGGCG GCAACAGACG CCGGAAGCTC GTCCGCCGTC
CCGCTAGGCA CAGCTGCGAA ACAGGCGGCA GCTGGCTCCC CGAATCAAGT CGCATCGTCT
GCGTCTGATC CGGCGACGGC CTGGGTGGTT CCGGGGTTCG ACGTGCTGCG CGTCGAACCG
GACGGCTCGA CGGTCGTCGC GGGCAGGGCC CAGCCGAATA CGAAGCTCGA AATTCTGAGC
GGCGACGCGG TCGTCGGTAC CGCGGACGTG GGCGCTGGTG GCGATTTCGC CGCCGTCTTC
GACAAGCCGC TCCCGGCGGG CGACCACCAG CTGACGCTCA GAAGCGTCGG CTCGGCCGGT
CAGAGCAAGT CGTCGGAAGA GGTTGCCACC GTTTCAGTGC CGAAGGATGC GAGTGGCCAG
TTGCTGGCGA TGGTCTCGAG GCAGGGCAAG GCGAGCCGGC TGATAACGAC GCCGGACGCT
GAGCCAAAGC CGGTGCAGGA GGCCGTCGCC ACGGGTATGG CGCCATCCGG TGAAACCGGT
GGAGCTTCCG ACACGTCCGC CGCTCCGGCC ACCGTTCCCG GACTGCAGGT CACCGCGGTC
GAGATCGAAG GCAGCATGAT GTATGTCGCA GGCAACGCCA AGCCCGGGGC GCTTGTCCGC
ATTTATGCGA ACGACCAACT GTTGGGTGAG GTGGAAGCCG ACGACAAAGG GCATTTCGTC
GTCGATGGCC CGATTGAACT CTCGATCGGC AGCCACATCA TTCGTGCCGA CATGATGAAT
GAAGACGCGA GCAAGGTGGC GATGCGTGCT TCCGTCCCCT TCGACAGGCC GGAAGGAGCC
CAGGTCGCCG CCATTGCCGG TACGACGCTC GGAGCGCCCA CTGCGGGTCT CGACAGGCTG
AAGGCGGAAG CGGGCAAGGC GCTGACACTG CTGAAAGGGC TCTTCTCCGG CGGGAAGCAA
CCCTCCACGG AGCAACTTGC CGCAGCCCGC TCGGCAACCG AATTCGCCCT TCAGTCGCTG
GCCGAATTCA AGCCGGCCGA CACCTCCGAT CCCGCGCTGG CTGCGGCGGC CGCCGAGGCT
TCGGACGCGG CCTCGACGGC GCTGGCGGCG CTAAAGGCCT CGCCGCAGGA CGCTGCAAGC
GTTGCGGCGG CGGTGGAGAA GGTGGATGGC GCTCTCGGGT CCGCATTGAC GCGCCAGAAT
GTCAGCACGT CGGTGGCTTC CGCCGAGCCG CTGGCCATGC CGACGAGCGA ACTCTCCCGG
ACAGCCGCCC CTCCGGCAGC CGGTGATGCG GCCGCACCTG CGGCCGAGCC AGCAGCGGCG
ACCGGTTTGA ATGCTGCAAC GGAACAGCCC GAAACGATCG AGCAGGCGCC ACTCAAGGAA
AGCAAGACGT CGGTGATCAT CCGCCGCGGC GACACGCTCT GGCAGATCTC GCGCCGTGTC
TACGGTGCCG GTCTGCGCTA CACCACGATC TATCTTGCAA ACCGCGAGCA GATCGAAAAC
CCGGACCTGA TCCGGCCCGG TCAGGTTTTT GGGGTTCCGG ACGAGATGCT TACCGAGGAA
GAATCGCGGG AAATTCACCG CAAGCATATG CGGCACTAA
 
Protein sequence
MIKNKASWVA LGVLAIATAL MVFVVQPNLR ESDKEAVAQS GTGGQPARPA ATDAGSSSAV 
PLGTAAKQAA AGSPNQVASS ASDPATAWVV PGFDVLRVEP DGSTVVAGRA QPNTKLEILS
GDAVVGTADV GAGGDFAAVF DKPLPAGDHQ LTLRSVGSAG QSKSSEEVAT VSVPKDASGQ
LLAMVSRQGK ASRLITTPDA EPKPVQEAVA TGMAPSGETG GASDTSAAPA TVPGLQVTAV
EIEGSMMYVA GNAKPGALVR IYANDQLLGE VEADDKGHFV VDGPIELSIG SHIIRADMMN
EDASKVAMRA SVPFDRPEGA QVAAIAGTTL GAPTAGLDRL KAEAGKALTL LKGLFSGGKQ
PSTEQLAAAR SATEFALQSL AEFKPADTSD PALAAAAAEA SDAASTALAA LKASPQDAAS
VAAAVEKVDG ALGSALTRQN VSTSVASAEP LAMPTSELSR TAAPPAAGDA AAPAAEPAAA
TGLNAATEQP ETIEQAPLKE SKTSVIIRRG DTLWQISRRV YGAGLRYTTI YLANREQIEN
PDLIRPGQVF GVPDEMLTEE ESREIHRKHM RH