Gene Smed_5144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5144 
Symbol 
ID5319446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp96469 
End bp97863 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content66% 
IMG OID640776922 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001313854 
Protein GI150377259 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.659537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACGA CGTGCAAACT TGCGCTGATC GCTTCCCTTG CGGCAATCGC TCTCGTTTCG 
GGCGGCATCG TTGCTCCCGG CAATATGCTT GTGCGTGAAA CGCTCGGAGC CGGCGTGGCC
CTGGCCGACG ATGATGACGA CGACGGGGGC GGCAACCGGG GCTCGGGCGG CTCCGGCCGT
TCCGGCTCAT ACGGCGCCGG CGCCGGATGG AGCGGCGGCA AGAGCCTTTT TCCGTTCCGA
GGGTTCCTGC CACGTCGGAG CACTCCGCGG CGCCCGCGAG CAGCCGCACC CGCTCCGCCG
ACACGAGCCC CCGAAGAGAT TGTCGGGCTG GGGATCAGTC CGTCTGAACT CGGGCAGCTG
GCGGCCGCCG GCTTCGAGGT CCTCGAACGC GACCCAATGG CGACCTTTGG CACCGAAGTG
ATCAAACTTC GCATCCCCCC GGGCGTGACA CTTGAAAGCG CGCGGCAACA GGCGCGCAGC
GTGGCTCCAC AAGCGGCCGT CGACTTCAAC CACTATTTCC GGCCGGAGCA GCATCCCGAT
GCTCCCTGCG TCACGAGCGA CTGCCTGGCA CGCGAGGTGA TCGGCTGGCC CGAAGCGCAA
AGCTGGCCCG GCACTTGCCC GGCCGGCGTT CGTATCGGTC TGGTCGATAC GGCGATCAAC
CCGGATCACA TCGCATTCGA AGCCCGGAAC ATCGAGATTG TCCGTCTTGC CAGCGACGCG
CTGCCGGAGT CCGGCAAGCA GCACGGAACC GCTGTGGCGG CCTTGCTCGT CGGCTCAGCC
ACCAGCCGCA CACCGGGCCT GATACCCGGC GGCAAGCTCA TCGCCGTGGA CGCATTTCAC
CGCGCAGGGC GGCAGGATGA CGTCTCCGCC GCCTTCGATC TTGCGCGCGC GCTCGACCTT
CTCGCCGGAC GTCAGGTAAA GGTCATCAAT CTCAGCCTTG CCGGCCCGCC CAATCTCCTG
CTTGAACAGG CGGTCAAGGA GGTGGGGGAG CGCGGCATCA TCATGGTTGC CGCCGCCGGC
AATGATGGTC CGAGGGCAAA GCCGGTTTAT CCCGGCGCCT ACGAAAATGT CATCGCCGTC
ACTGCAACCG ACAGGCAGAA GCGGCCTTAC AGGCGCGCCG GACGCGGGGA ACATATCGAC
TTCGCTGCGC CGGGCGTGGC CGTCTGGACA GCGGCCTCGG TCCGAGGCGC GCGTCCGAAG
ACGGGGACGT CTTTCGCCGC ACCTTTCGTG ACCGCGGCCG TGGCGATGAT GATGGCATCC
GAACTGGACC TTGCGCCCGA GCGGCTCCGC GACAGGCTGA CCGGACACGC GGAAGATCTC
GGAGATCCGG GAAAGGATCC CGTCTTCGGT TGGGGCCTTT TGAACGCACG GGCGATTTGC
GACACCAAAA CGTAA
 
Protein sequence
MLTTCKLALI ASLAAIALVS GGIVAPGNML VRETLGAGVA LADDDDDDGG GNRGSGGSGR 
SGSYGAGAGW SGGKSLFPFR GFLPRRSTPR RPRAAAPAPP TRAPEEIVGL GISPSELGQL
AAAGFEVLER DPMATFGTEV IKLRIPPGVT LESARQQARS VAPQAAVDFN HYFRPEQHPD
APCVTSDCLA REVIGWPEAQ SWPGTCPAGV RIGLVDTAIN PDHIAFEARN IEIVRLASDA
LPESGKQHGT AVAALLVGSA TSRTPGLIPG GKLIAVDAFH RAGRQDDVSA AFDLARALDL
LAGRQVKVIN LSLAGPPNLL LEQAVKEVGE RGIIMVAAAG NDGPRAKPVY PGAYENVIAV
TATDRQKRPY RRAGRGEHID FAAPGVAVWT AASVRGARPK TGTSFAAPFV TAAVAMMMAS
ELDLAPERLR DRLTGHAEDL GDPGKDPVFG WGLLNARAIC DTKT