Gene Smed_5774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5774 
Symbol 
ID5320076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp743897 
End bp745432 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content59% 
IMG OID640777481 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001314413 
Protein GI150377818 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0246285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCCTT TGTTTAGGAA TTCGGGGTTT CCCCTTTACC ACGAGGCAAT GTTGTTCGTG 
CGGATGCGGC CCCAGCAACG CCAAGCAAGC TTCGAGGCGA CGTTTACCGC AGCGCGTTCT
CCGTTGAGCA GTGGTCTCCA GGCGTTGGCC TTCTATGAAC GGGCCGGATT GGTGAAACGC
ATTACACCTC TCTCGGAGGA GGCACCGCAC TATGCCAGCC GAATGCTCGT CAGGTCCGCC
CGCAACGCCC GAGTCAAGCC GGAACTTCTT GGCGCTCCGG GAAACGGCGC GGAGGGATGG
TCTGCGTCCG CCCTTAATTT GAGTGCTGCG GTGGACGAAC GAACGCCGGG CGGCGCTGCC
AGCCTTGTCG AATTGGCGAG CGACGCCGAC CTGCGAGAGC TTCAGCTCGC GCTTGCTGGA
GATCCAAGCG TCGCGTCCGT TTCACGGGTT CCGCGGCGCT ATCTGACAGC GAAAATGTCC
CGGAAATCGT CCAAACCATC AGCCGTCGGC AGGGAGCGAA CTGCCACGGC GCCGCCACCG
GCTTACACAA TGTGGAATCT TCGAAAAATT AACTGGCAAG AAGCGCGCGA CCTGAACGGT
TTCAATGACG CGAGCGAGAT CAAGGTCGCG GTCCTCGATA CGGGAATCGA TGCTGGTCAC
CCTGACCTCA AAGACCAGAT CGCCGGATAC ATATATGAGC ACCCGGACCT CCCGGGGGCC
AGTTCAGATC AAGATCTGAT AGGACACGGT ACTCATGTGG CAGGTACGAT TGGCGCGACG
ATCAACAACG ATGTTGGCAT TAACGGCATT TCGCAAGCGA GAATCCATGC CTGGAAGATA
TTCGACGATC GCCCCGACCT GCTAACGCAT CCGGACGGAA CAGCGGAGTT TGCGTATTTC
GTTGATCCGG TGATGTATCT CCGTGCGCTG CTCGACTGCG TGGATGCGGG CATCGATGTC
ATTAATCTTT CTATCGGTGG CGGCGGGGCG CCGGATCCGA CCGAAAGTGC GGCTTTCGAG
GCTTTGCTGG CCGGTGGAAC GAGCATTATT GCGGCAATGG GGAATGAGCG GCGCGAAGGC
AGCCCGATAT CCTACCCTGC CGCAATGCCA GGCGTCACAG CTGTGGGGGC GACAAACCTC
CAGGACCGCA TTACGAGTTT TTCCAACAGG GGCAACCACG TAGCGATTGC GGCGCCGGGC
GATGCGATCT GGTCAACGCT CCCGACCTAC CCGGGTCAAA CAGCCTGGAG AGCGGAAAAG
GGCCCCGATG GTCATTGGTG GCAAGGAAAG GCCGCGATCC GCGAGACAGA CTATGACGCC
TGGCCCGGCA CATCGATGGC GGCTCCGCAT GTTGCAGCCG CTGCGGCGCT TTTCATCGCA
AACGGAGGCG ACAGAGATCC GGCAGCGATC CGAGACGCCC TTTTGGCGAG TGCAGACAAG
GTGCCGGCAA TGGGCGGACA AGACTTTACC CCCGATTTCG GGTATGGTCG TTTAAACTTG
AAACGATTGA TTGCCTGCAT TGGGACCAAT GACTGA
 
Protein sequence
MGPLFRNSGF PLYHEAMLFV RMRPQQRQAS FEATFTAARS PLSSGLQALA FYERAGLVKR 
ITPLSEEAPH YASRMLVRSA RNARVKPELL GAPGNGAEGW SASALNLSAA VDERTPGGAA
SLVELASDAD LRELQLALAG DPSVASVSRV PRRYLTAKMS RKSSKPSAVG RERTATAPPP
AYTMWNLRKI NWQEARDLNG FNDASEIKVA VLDTGIDAGH PDLKDQIAGY IYEHPDLPGA
SSDQDLIGHG THVAGTIGAT INNDVGINGI SQARIHAWKI FDDRPDLLTH PDGTAEFAYF
VDPVMYLRAL LDCVDAGIDV INLSIGGGGA PDPTESAAFE ALLAGGTSII AAMGNERREG
SPISYPAAMP GVTAVGATNL QDRITSFSNR GNHVAIAAPG DAIWSTLPTY PGQTAWRAEK
GPDGHWWQGK AAIRETDYDA WPGTSMAAPH VAAAAALFIA NGGDRDPAAI RDALLASADK
VPAMGGQDFT PDFGYGRLNL KRLIACIGTN D