Gene Smed_0599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0599 
Symbol 
ID5321435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp648224 
End bp650332 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content61% 
IMG OID640789535 
Productoligopeptidase B 
Protein accessionYP_001326290 
Protein GI150395823 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.11054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCGTAT TCAAGAACCT GCCCGCCGCA CCTGCCGCCC CGAAGAAACC GGTTGCCGAT 
ACTCGCCACG GCGTGACCCG CACTGACGAT TACGCGTGGC TGCGGGCCGA CAACTGGCAG
GCCATGTTCA GAGATCCGTC GATCCTGGAC CCGGCGATCC GCCAGCATCT GGAGGCGGAG
AACACCTATA TGAACGCCGC GATGGCCGAC ACGAAGGACT TGCAGAAGAC CTTGTTCGCC
GAGATGCGCG GGCGAATAAA GGAAGACGAT TCATCGGTTC CGATGAAGGA CGGCGCCTTC
GCCTACGGCA CGTCTTACGT CACCGGCGGC GAGCACCCGC GCTACTTCCG GATCCCGCGC
GAAGGAGCAC CTGGCGACGA GTCGATCCGC CAATTGCTCC TCGACGGTGA CAAGGAGGCC
GAGGGCAAGG CCTATTTCCG TATCGCCGGG CTCGACCATT CGAGCGATCA TTCCCGCGGG
ATCTGGGGCT ACGACGACAA GGGCTCGGAA TATTTCACGC TCAGGGTGCG CGACCTTTCA
ACCGGCGAGG AACTCGGCGA TCGGATCGAA AACACGGGTG GCGGCGGTGC CTGGGCACCT
GACGGCACGA GCTTCTTCTA CACCGTGCTC GACGAGAACC ACCGGCCATC GAAGATATTT
CACCACATTG TCGGAACGCC GCAGTCGGAA GACCGGCTGG TTTACGAAGA GCCCGATGCC
GGATTCTTCA TGTCCGTTGG CGGCTCGCTC CTGGATGATT TCATCTATAT CGACATCCAT
GACCACGAGA CGAGCGAGTA TAGGCTGATC CCGACCACTG ACTTGGCCGC AGAGCCGAAG
ATCGTGGCGG AGCGTGTGAC GGGTCTTGAA TATTCCATGA CCGAGGGCGG CGACGTCTTC
TACGTTCTGA CCAATGCCGA CGGCGCCAAG GATTTCAAGA TCATGGAGGC GCCGGTGGCA
GCGCCGCAGA AAGAAAACTG GCGCGAGGTC GTAGCGCACA AGCCGGGAAC GCTGATCCTC
AGCCACATGG CCTATGCCCG CCATCTTGTC TGGCTGCAGC GGCGTAACGG CCTGCCGGAA
ATCGTCATCC GCGACCGCCG GACCGGCGAG GAGCACGCGA TCACCTTCGC CGAGGAGGCC
TATTCTCTGG GACTGTCCGG TGCGGCAGAA TACGATACCG ACGTCATCCG ATTCTCCTAT
TCCTCGATGA CGACGCCTTC ACAGCTTTTC GACTACAACA TGCAGACGCG CGAGCGTACC
TTGCTCAAGA CGCAGGAGGT CCCCTCCGGG CACGAGCCGG ACGACTATGT GACCCGCCGC
GTGTTCGCTC CGGCACCCGA TGGCGAGCAA GTGCCGGTCA CCCTGCTCTA TCGCAAGGAT
ACGCCGCTCG ACGGATCCGC CCCTTGCCTG CTCTACGGTT ACGGTGCCTA CGGCATCACC
ATTCCGGCGA GCTTCAACAC CAATTGCCTG TCGCTCGTCG ACCGCGGCTT CATCTATGCC
ATCGCCCATA TCCGAGGCGG AAAGGACAAG GGCTTTCATT GGTACGAAGA TGGCAAGATG
GCGAAGAAAA CCAACACGTT CAATGACTTC ATCGCCGCTG CCGACTATTT GAATCAGGAG
AGGTTCACCT CCTACGCGAA CATCGTCGCC GAGGGAGGCT CGGCTGGGGG CATGCTGATG
GGCGCCATTG CCAACATGGC GCCGGAGAAA TTCCGCGGCA TCATCGCCGC CGTTCCCTTC
GTCGACGTGC TCAACACCAT GCTGGACGAT AGTTTGCCGC TGACGCCGCC GGAATGGCCC
GAATGGGGCA ATCCGATCGA AAGCCGGGAG TTTTACGGCA TCATTGCCGC CTACTCGCCC
TATGACAATG TCGACACAAA ATCCTACCCG GCTATGCTGG CGCTCGGCGG CCTCACCGAT
CCGCGCGTCA CCTATTGGGA GCCGGCGAAA TGGGTGGCGA AATTGCGTGA GAAGACGACG
GGCAGCGAAC CGATACTCCT CAAGACCAAT ATGGATGCGG GCCATGGCGG CGCCTCCGGG
CGCTTCCAGC GCCTGGAAGA GATCGCCTTC GAATACGCCT TCGCCATCAA GGTCGCCGGC
AGGATGTAG
 
Protein sequence
MSVFKNLPAA PAAPKKPVAD TRHGVTRTDD YAWLRADNWQ AMFRDPSILD PAIRQHLEAE 
NTYMNAAMAD TKDLQKTLFA EMRGRIKEDD SSVPMKDGAF AYGTSYVTGG EHPRYFRIPR
EGAPGDESIR QLLLDGDKEA EGKAYFRIAG LDHSSDHSRG IWGYDDKGSE YFTLRVRDLS
TGEELGDRIE NTGGGGAWAP DGTSFFYTVL DENHRPSKIF HHIVGTPQSE DRLVYEEPDA
GFFMSVGGSL LDDFIYIDIH DHETSEYRLI PTTDLAAEPK IVAERVTGLE YSMTEGGDVF
YVLTNADGAK DFKIMEAPVA APQKENWREV VAHKPGTLIL SHMAYARHLV WLQRRNGLPE
IVIRDRRTGE EHAITFAEEA YSLGLSGAAE YDTDVIRFSY SSMTTPSQLF DYNMQTRERT
LLKTQEVPSG HEPDDYVTRR VFAPAPDGEQ VPVTLLYRKD TPLDGSAPCL LYGYGAYGIT
IPASFNTNCL SLVDRGFIYA IAHIRGGKDK GFHWYEDGKM AKKTNTFNDF IAAADYLNQE
RFTSYANIVA EGGSAGGMLM GAIANMAPEK FRGIIAAVPF VDVLNTMLDD SLPLTPPEWP
EWGNPIESRE FYGIIAAYSP YDNVDTKSYP AMLALGGLTD PRVTYWEPAK WVAKLREKTT
GSEPILLKTN MDAGHGGASG RFQRLEEIAF EYAFAIKVAG RM