Gene Smed_2684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2684 
Symbol 
ID5323553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2786895 
End bp2788778 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content62% 
IMG OID640791628 
ProductpepF/M3 family oligoendopeptidase 
Protein accessionYP_001328349 
Protein GI150397882 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.667149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCG CTTCGCCACG TCTCGCCGCC TCCGGGCCCA CTTTGCTGCG TGCCGCGCTG 
GCGAACTCCG GGGCTGATCA GGCAGCCTCG GCGCCGGCCG AACTCGGCGA CCTGCCGTCC
TGGCGGCTAA CGGACCTCTA CCTTTCGCCA TCCTCCGAGG AATTCCGCTC CGACCTCGCC
AGGGCCGAGG CCGACGCGGT CGCCTTTGAG GCTAAGTGGA AGGGCAAGTT GGACCAGGCC
GCCGGCCGAA CGGGTGATGA GGGTCTCGGC GCCGCCGTGA ATGAGTTCGA GGCACTGGAC
GACCTCATGG GCCGCATCGC TTCGTTTGCC GGGCTCACCT ATTTTTCCGA CACTTCGAAT
CCGGCGAACG GCAAGCTCTA CGGCGACATC CAGTCGAAGC TTACCGATAT TTCGGCGCAT
CTCCTGTTCT TCTCGCTGGA GCTCAACCGC ATCGACGACA AGCGCGTCGA TGCGGCACTC
GCGGCCGATG GCCTTGCCGC CCACTATGCC CCGTGGATTC TCGATCTGCG CAAGGACAAG
CCCTATCAGC TCGACGACAG GCTGGAGCAA CTCTTCCTTG AAAAAACGAT GACTGGCGCA
AACGCCTTCA ACCGGCTCTT CGACGAAACG ATCGCATCTC TGACCTTTTC GGTGGACGGC
AAGGAGCAGC CGCTGGAAGT GACGCTCAAC CTTCTGCAGG ATTCCTCCGT AGAGGTGCGC
AGGAAGGCCG CGCTCGCGCT CGCAGAAACC TTCAAGGCAA ATATCCGCAC CTTCACCCTC
GTCACCAATA CGCTCGCGAA GGACAAGGAA ATCTCCGACC GCTGGCGCGG ATTCGAGGAT
ATAGCCGACA GCCGCCATCT CGCCAATCGC GTGGAGCGCG AGGTCGTGGA TGCTCTGGCC
GCATCGGTGA AGGCCGCCTA TCCACGCCTC TCGCACCGGT ATTATGCGAT GAAGGCCAAG
TGGCTCGGCA TGGAGCAGAT GGATTTCTGG GACCGGAACG CTCCTCTGCC CGAGACCCCG
AACGCGCTCA TTCCCTGGAA CGATGCCAGG GATACGGTCC TTTCCGCCTA TCACGCCTTC
GCCCCGGAAA TGGCGTCGAT CGCCCGGCGC TTCTTCGACG ATGGCTGGAT CGATGCGCCG
GTCCGCCCCG GCAAGGCGCC AGGCGCCTTC GCGCATCCGA CCGTCCCTTC CGTGCACCCC
TATGTTCTCG TCAATTATAT GGGCAAGCCG CGCGACGTCA TGACGCTTGC CCACGAGCTC
GGACACGGCG TCCACCAGGT CCTTGCCGGC GCGCAGGGCG CGCTGATGGC CTCGACACCC
CTGACACTTG CGGAAACCGC ATCCGTCTTC GGAGAGATGC TGACCTTCCG CGCCCTCCTC
GACCGGACGA AGGACCGGCG CGAGCGAAAG GCGATGCTTG CGCAGAAGGT CGAGGATATG
ATCAACACGG TCGTCCGGCA GATCGCCTTC TACGATTTCG AACGCAAAGT GCATACGGCC
CGCAAGGAAG GCGAACTGAC GGCCGAGGAT CTCGGCCGCA TATGGATGTC GGTGCAGAGC
GAAAGCCTCG GCCCGGCGAT CCGGCTTTCG GAGAGTTACG AGACCTACTG GGCCTATATT
CCCCACTTCA TCCATTCGCC TTTCTACGTC TATGCCTATG CCTTCGGTGA TTGCCTCGTA
AACTCCCTCT ATGCCGTCTA TCAGAACGCC GAGCGCGGTT TCCAGGAGAA GTATTTCGAT
ATGCTGAAGG CGGGCGGAAC GAAGCATCAT TCCGAACTTC TGGCGCCCTT CGGGCTCGAC
GCCGCCGATC CGTCGTTCTG GGCACAGGGC CTGTCGATGA TTGAGGGGCT CATCGACGAA
CTGGAGGCGC TTGATAAGGC GTAA
 
Protein sequence
MKFASPRLAA SGPTLLRAAL ANSGADQAAS APAELGDLPS WRLTDLYLSP SSEEFRSDLA 
RAEADAVAFE AKWKGKLDQA AGRTGDEGLG AAVNEFEALD DLMGRIASFA GLTYFSDTSN
PANGKLYGDI QSKLTDISAH LLFFSLELNR IDDKRVDAAL AADGLAAHYA PWILDLRKDK
PYQLDDRLEQ LFLEKTMTGA NAFNRLFDET IASLTFSVDG KEQPLEVTLN LLQDSSVEVR
RKAALALAET FKANIRTFTL VTNTLAKDKE ISDRWRGFED IADSRHLANR VEREVVDALA
ASVKAAYPRL SHRYYAMKAK WLGMEQMDFW DRNAPLPETP NALIPWNDAR DTVLSAYHAF
APEMASIARR FFDDGWIDAP VRPGKAPGAF AHPTVPSVHP YVLVNYMGKP RDVMTLAHEL
GHGVHQVLAG AQGALMASTP LTLAETASVF GEMLTFRALL DRTKDRRERK AMLAQKVEDM
INTVVRQIAF YDFERKVHTA RKEGELTAED LGRIWMSVQS ESLGPAIRLS ESYETYWAYI
PHFIHSPFYV YAYAFGDCLV NSLYAVYQNA ERGFQEKYFD MLKAGGTKHH SELLAPFGLD
AADPSFWAQG LSMIEGLIDE LEALDKA