Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0721 |
Symbol | |
ID | 5321558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 773082 |
End bp | 774800 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640789658 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_001326412 |
Protein GI | 150395945 |
COG category | [S] Function unknown |
COG ID | [COG1652] Uncharacterized protein containing LysM domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.467953 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAAGA ATAAGGCCAG CTGGGTGGCC CTTGGCGTCC TGGCAATCGC AACCGCATTG ATGGTTTTTG TGGTTCAGCC AAACCTGCGC GAAAGCGACA AGGAGGCGGT GGCTCAATCC GGGACAGGCG GCCAGCCTGC CCGACCGGCG GCAACAGACG CCGGAAGCTC GTCCGCCGTC CCGCTAGGCA CAGCTGCGAA ACAGGCGGCA GCTGGCTCCC CGAATCAAGT CGCATCGTCT GCGTCTGATC CGGCGACGGC CTGGGTGGTT CCGGGGTTCG ACGTGCTGCG CGTCGAACCG GACGGCTCGA CGGTCGTCGC GGGCAGGGCC CAGCCGAATA CGAAGCTCGA AATTCTGAGC GGCGACGCGG TCGTCGGTAC CGCGGACGTG GGCGCTGGTG GCGATTTCGC CGCCGTCTTC GACAAGCCGC TCCCGGCGGG CGACCACCAG CTGACGCTCA GAAGCGTCGG CTCGGCCGGT CAGAGCAAGT CGTCGGAAGA GGTTGCCACC GTTTCAGTGC CGAAGGATGC GAGTGGCCAG TTGCTGGCGA TGGTCTCGAG GCAGGGCAAG GCGAGCCGGC TGATAACGAC GCCGGACGCT GAGCCAAAGC CGGTGCAGGA GGCCGTCGCC ACGGGTATGG CGCCATCCGG TGAAACCGGT GGAGCTTCCG ACACGTCCGC CGCTCCGGCC ACCGTTCCCG GACTGCAGGT CACCGCGGTC GAGATCGAAG GCAGCATGAT GTATGTCGCA GGCAACGCCA AGCCCGGGGC GCTTGTCCGC ATTTATGCGA ACGACCAACT GTTGGGTGAG GTGGAAGCCG ACGACAAAGG GCATTTCGTC GTCGATGGCC CGATTGAACT CTCGATCGGC AGCCACATCA TTCGTGCCGA CATGATGAAT GAAGACGCGA GCAAGGTGGC GATGCGTGCT TCCGTCCCCT TCGACAGGCC GGAAGGAGCC CAGGTCGCCG CCATTGCCGG TACGACGCTC GGAGCGCCCA CTGCGGGTCT CGACAGGCTG AAGGCGGAAG CGGGCAAGGC GCTGACACTG CTGAAAGGGC TCTTCTCCGG CGGGAAGCAA CCCTCCACGG AGCAACTTGC CGCAGCCCGC TCGGCAACCG AATTCGCCCT TCAGTCGCTG GCCGAATTCA AGCCGGCCGA CACCTCCGAT CCCGCGCTGG CTGCGGCGGC CGCCGAGGCT TCGGACGCGG CCTCGACGGC GCTGGCGGCG CTAAAGGCCT CGCCGCAGGA CGCTGCAAGC GTTGCGGCGG CGGTGGAGAA GGTGGATGGC GCTCTCGGGT CCGCATTGAC GCGCCAGAAT GTCAGCACGT CGGTGGCTTC CGCCGAGCCG CTGGCCATGC CGACGAGCGA ACTCTCCCGG ACAGCCGCCC CTCCGGCAGC CGGTGATGCG GCCGCACCTG CGGCCGAGCC AGCAGCGGCG ACCGGTTTGA ATGCTGCAAC GGAACAGCCC GAAACGATCG AGCAGGCGCC ACTCAAGGAA AGCAAGACGT CGGTGATCAT CCGCCGCGGC GACACGCTCT GGCAGATCTC GCGCCGTGTC TACGGTGCCG GTCTGCGCTA CACCACGATC TATCTTGCAA ACCGCGAGCA GATCGAAAAC CCGGACCTGA TCCGGCCCGG TCAGGTTTTT GGGGTTCCGG ACGAGATGCT TACCGAGGAA GAATCGCGGG AAATTCACCG CAAGCATATG CGGCACTAA
|
Protein sequence | MIKNKASWVA LGVLAIATAL MVFVVQPNLR ESDKEAVAQS GTGGQPARPA ATDAGSSSAV PLGTAAKQAA AGSPNQVASS ASDPATAWVV PGFDVLRVEP DGSTVVAGRA QPNTKLEILS GDAVVGTADV GAGGDFAAVF DKPLPAGDHQ LTLRSVGSAG QSKSSEEVAT VSVPKDASGQ LLAMVSRQGK ASRLITTPDA EPKPVQEAVA TGMAPSGETG GASDTSAAPA TVPGLQVTAV EIEGSMMYVA GNAKPGALVR IYANDQLLGE VEADDKGHFV VDGPIELSIG SHIIRADMMN EDASKVAMRA SVPFDRPEGA QVAAIAGTTL GAPTAGLDRL KAEAGKALTL LKGLFSGGKQ PSTEQLAAAR SATEFALQSL AEFKPADTSD PALAAAAAEA SDAASTALAA LKASPQDAAS VAAAVEKVDG ALGSALTRQN VSTSVASAEP LAMPTSELSR TAAPPAAGDA AAPAAEPAAA TGLNAATEQP ETIEQAPLKE SKTSVIIRRG DTLWQISRRV YGAGLRYTTI YLANREQIEN PDLIRPGQVF GVPDEMLTEE ESREIHRKHM RH
|
| |