Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5144 |
Symbol | |
ID | 5319446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 96469 |
End bp | 97863 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640776922 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001313854 |
Protein GI | 150377259 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.659537 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACGA CGTGCAAACT TGCGCTGATC GCTTCCCTTG CGGCAATCGC TCTCGTTTCG GGCGGCATCG TTGCTCCCGG CAATATGCTT GTGCGTGAAA CGCTCGGAGC CGGCGTGGCC CTGGCCGACG ATGATGACGA CGACGGGGGC GGCAACCGGG GCTCGGGCGG CTCCGGCCGT TCCGGCTCAT ACGGCGCCGG CGCCGGATGG AGCGGCGGCA AGAGCCTTTT TCCGTTCCGA GGGTTCCTGC CACGTCGGAG CACTCCGCGG CGCCCGCGAG CAGCCGCACC CGCTCCGCCG ACACGAGCCC CCGAAGAGAT TGTCGGGCTG GGGATCAGTC CGTCTGAACT CGGGCAGCTG GCGGCCGCCG GCTTCGAGGT CCTCGAACGC GACCCAATGG CGACCTTTGG CACCGAAGTG ATCAAACTTC GCATCCCCCC GGGCGTGACA CTTGAAAGCG CGCGGCAACA GGCGCGCAGC GTGGCTCCAC AAGCGGCCGT CGACTTCAAC CACTATTTCC GGCCGGAGCA GCATCCCGAT GCTCCCTGCG TCACGAGCGA CTGCCTGGCA CGCGAGGTGA TCGGCTGGCC CGAAGCGCAA AGCTGGCCCG GCACTTGCCC GGCCGGCGTT CGTATCGGTC TGGTCGATAC GGCGATCAAC CCGGATCACA TCGCATTCGA AGCCCGGAAC ATCGAGATTG TCCGTCTTGC CAGCGACGCG CTGCCGGAGT CCGGCAAGCA GCACGGAACC GCTGTGGCGG CCTTGCTCGT CGGCTCAGCC ACCAGCCGCA CACCGGGCCT GATACCCGGC GGCAAGCTCA TCGCCGTGGA CGCATTTCAC CGCGCAGGGC GGCAGGATGA CGTCTCCGCC GCCTTCGATC TTGCGCGCGC GCTCGACCTT CTCGCCGGAC GTCAGGTAAA GGTCATCAAT CTCAGCCTTG CCGGCCCGCC CAATCTCCTG CTTGAACAGG CGGTCAAGGA GGTGGGGGAG CGCGGCATCA TCATGGTTGC CGCCGCCGGC AATGATGGTC CGAGGGCAAA GCCGGTTTAT CCCGGCGCCT ACGAAAATGT CATCGCCGTC ACTGCAACCG ACAGGCAGAA GCGGCCTTAC AGGCGCGCCG GACGCGGGGA ACATATCGAC TTCGCTGCGC CGGGCGTGGC CGTCTGGACA GCGGCCTCGG TCCGAGGCGC GCGTCCGAAG ACGGGGACGT CTTTCGCCGC ACCTTTCGTG ACCGCGGCCG TGGCGATGAT GATGGCATCC GAACTGGACC TTGCGCCCGA GCGGCTCCGC GACAGGCTGA CCGGACACGC GGAAGATCTC GGAGATCCGG GAAAGGATCC CGTCTTCGGT TGGGGCCTTT TGAACGCACG GGCGATTTGC GACACCAAAA CGTAA
|
Protein sequence | MLTTCKLALI ASLAAIALVS GGIVAPGNML VRETLGAGVA LADDDDDDGG GNRGSGGSGR SGSYGAGAGW SGGKSLFPFR GFLPRRSTPR RPRAAAPAPP TRAPEEIVGL GISPSELGQL AAAGFEVLER DPMATFGTEV IKLRIPPGVT LESARQQARS VAPQAAVDFN HYFRPEQHPD APCVTSDCLA REVIGWPEAQ SWPGTCPAGV RIGLVDTAIN PDHIAFEARN IEIVRLASDA LPESGKQHGT AVAALLVGSA TSRTPGLIPG GKLIAVDAFH RAGRQDDVSA AFDLARALDL LAGRQVKVIN LSLAGPPNLL LEQAVKEVGE RGIIMVAAAG NDGPRAKPVY PGAYENVIAV TATDRQKRPY RRAGRGEHID FAAPGVAVWT AASVRGARPK TGTSFAAPFV TAAVAMMMAS ELDLAPERLR DRLTGHAEDL GDPGKDPVFG WGLLNARAIC DTKT
|
| |