Gene Smed_2498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2498 
Symbol 
ID5323365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2590558 
End bp2592498 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content62% 
IMG OID640791440 
Productpeptidase M23B 
Protein accessionYP_001328163 
Protein GI150397696 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.13296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCG ATAAGAACAT GCTTCGTTCG CTTGGCGACC ACCCGCCGAT CCTTGCGGAC 
GGCCGTCGGG CGCCGGATCG GCGCGAGATT TCGCTGCGCT GGCTCTCGGG CACGTTCTTG
ACCGGTATCA CATCGAGCCT TCTTATGGGC GTCGCGCTCT TTGCCGCCCT CGACGGCCGT
CAGCAACTGG CGATCCCGGC GGAAGCCTTC GCCGCGCTCG CCCCCTCCGC GCCCGCTGGC
GCCGCCAAAC GTGGAGACCG CGTCCTTTCA CCCAACATCG TCGCCAAGCC GGCCGACAAG
ACCGTGATGG AAGTCTCGAC AATGATCCAT GACGGCGAAA AAGAGGTGGT GCGCCGTCAG
CCTTTCGTGC ACGTGAAAAT GGCGCTTGCG GCCAACCACC AGACGTCCGA GTCCTATCCC
GACTTCGACC CGCTGGCCAT CTTCTCCACG GATGAGGCGG ACGCCGAGGC GGCGCCGGCA
AAGACCGGCA CGATCTACGG CTCGGACGTC GAGTCCGAGG TTGCGCTGAA GACCGTTGAC
TTTCCGCTCG ATGGCGGCGG CTTCAAAGTC GGACCGTCGA TGTCGCTCGA CGAGGTGGAG
GAGAATGTCC GCACCAACGG TTCGGTGCTG ACCGAGGGCA GCACCCAGGT CGCCTCGCTC
TTCTATGTCG ACCCCCAGCG CTTTGCGTCC GACGCCGACA ATCTCGACCT GATGCAGGGC
CTTGCCGCGC GCGTCGTCGA GGAGAATATG AGCGTCTCCT CCTATGAGAA CATCACCAAG
CAAAGCACCG AATACGCCGA CGACATCATA CCGGTACGCC GCGCACTGCC CATTGCGAAG
GTGATGACCA ATGCCGGCTA TGCCGAAGCC CAGGCCGAGG ACGCAGCCGG CTATCTCGGC
GAAGCGCTCG GCGCCAAGAA TCTTGGCCCC GGCGACGTGC TACGCATTGG CATTATTCAA
AAGGGCGAGG AAGCAAAGAT CGTCCGCGCG ACGATCTATT CGAACAGCCG GCACGTCCTG
ACCATGGCGG TGGACGACCG CGGCCGTTTC GTTCCCGGCG CCGAGCCCCC GAAGCTCGAA
GCGGTAGCCG CTGCCTTCGA CGACACCGGC AGGCCGGTGA TCTCAAGCAG CCACGACCTG
CCTCGGGTTT ATGACGGAAT CTATCGTGCA GCCCTATCCT ACGGAATGAA CGCCGATATG
ATCGCGCTCG TCGTCAAGCT CTTGGCGAGC AACGTCGATT TCCAGGCCCA ACTGAAGCCG
GCGGATTCGC TCGAAGCCTT TTTCTCCGTT ACCGATGAAA GCGGCCGGGC AACGGAAGAT
TCTGAACTGC TCTACGTCAA TGCCAAATTC GGCGACGCCG AGACACGGTT CTATCGCTTC
CAGGATCCGG ACGATAATTC GATCGACTAT TTCGACAAGG ACGGCAAGAG CATCCGCCAG
TTCCTGCTTC GCAACCCCGT TCCGAACGGT CGTTTCCGCT CCGGCTTCGG AATGCGTCGC
CACCCGATTC TCGGCTTCTC ACGCATGCAC ACGGGCGTCG ATTGGTCAGC CCCGCGCGGC
ACGCCGATCA TCGCCGCCGG CAATGGCGTC GTGCAGAAGG CCGGCTGGGA TTCCGGCGGC
TATGGCAACC AGACCCTCAT CCGCCACGCC AACGGCTACG TCTCGTCCTA TAACCACCAA
AGCGCAATTG CCAAAAGCGT CAAGCCGGGG GCAAAGGTGG TCCAGGGCCA GGTGATCGGC
TGGGTCGGCA CGACCGGCCT TTCGACCGGT CCACACCTCC ATTACGAACT GATCGTCAAT
GGCAACAAGG TGGACCCCTT GCGCATTCGG CTGCCGGGCG GCAAATCGCT TGCCGGAGAA
GCCTTGGCGC AATTCGAAAA GGAACGCGAA CGCATCGACG AGCTTCTCGG CGACGATGCG
AACGAAGTCG CAAGCAAGTA G
 
Protein sequence
MSSDKNMLRS LGDHPPILAD GRRAPDRREI SLRWLSGTFL TGITSSLLMG VALFAALDGR 
QQLAIPAEAF AALAPSAPAG AAKRGDRVLS PNIVAKPADK TVMEVSTMIH DGEKEVVRRQ
PFVHVKMALA ANHQTSESYP DFDPLAIFST DEADAEAAPA KTGTIYGSDV ESEVALKTVD
FPLDGGGFKV GPSMSLDEVE ENVRTNGSVL TEGSTQVASL FYVDPQRFAS DADNLDLMQG
LAARVVEENM SVSSYENITK QSTEYADDII PVRRALPIAK VMTNAGYAEA QAEDAAGYLG
EALGAKNLGP GDVLRIGIIQ KGEEAKIVRA TIYSNSRHVL TMAVDDRGRF VPGAEPPKLE
AVAAAFDDTG RPVISSSHDL PRVYDGIYRA ALSYGMNADM IALVVKLLAS NVDFQAQLKP
ADSLEAFFSV TDESGRATED SELLYVNAKF GDAETRFYRF QDPDDNSIDY FDKDGKSIRQ
FLLRNPVPNG RFRSGFGMRR HPILGFSRMH TGVDWSAPRG TPIIAAGNGV VQKAGWDSGG
YGNQTLIRHA NGYVSSYNHQ SAIAKSVKPG AKVVQGQVIG WVGTTGLSTG PHLHYELIVN
GNKVDPLRIR LPGGKSLAGE ALAQFEKERE RIDELLGDDA NEVASK