Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Svir_04650 |
Symbol | |
ID | 8385803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharomonospora viridis DSM 43017 |
Kingdom | Bacteria |
Replicon accession | NC_013159 |
Strand | + |
Start bp | 456240 |
End bp | 457286 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644974563 |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | YP_003132367 |
Protein GI | 257054535 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.345171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCGA TCGTCATGGG CATCGAGAGT TCGTGCGACG AGACGGGTGT CGGACTCGTG CGGTTGCGCG ACGACGGTGG TGTCGAGCTG CTCGCCGACG AGGTCGCCAG CAGTGTGGAG GAGCACGCGC GGTTCGGGGG TGTGGTCCCG GAGGTCGCCA GCCGTGCCCA CCTGGAGGCG ATGGTGCCCA CCGTGCGTCG GGCGTTCGAC ACGGCGGGGT TGCGGATGTC GGATGTGGAC GCCATCGCGG TCACGGCGGG TCCCGGACTG GCGGGCGCGC TGCTGGTGGG TGTGTCGGCC GCGAAGGCCT ACGCCGCCGC GCTCGACGTA CCGCTGTACG GGGTGAATCA CCTCGCCGGG CACATCGCTG TGGACACGTT GCAGCATGGC CCGTTGCCCA CGCCGTCGCT GGCGTTGCTC GTGTCGGGTG GGCATACGCA GTTGCTGCGA GTCGACGACA TCGCGTGCAA GATCACCGAA ATCGGCTCCA CAGTGGATGA CGCGGCCGGG GAGGCCTACG ACAAGGTGGC GCGGGTGCTG GGCCTGCCGT ATCCGGGTGG TCCGCCCATC GACCGCGCGG CGCGTGAGGG CGATCCGAAC GCCATCGCGT TTCCACGTGG CATGACCGGG CCTCGGGACG CGCCGTTCGA CTTCTCGTTC TCCGGCTTGA AGACGGCGGT GGCCCGGTGG GTCGAGGGCG CGCAGGCTCG GGGCGAGAAC ATCCCGGTGG CGGATGTGGC GGCGTCGTTC CAGGAGGCGG TCGCGGACGT GTTGACGGCC AAGGCGGTGC GGGCCGCGAC CGAGCTGGGT ATCGGCACGC TCGTCATCTC CGGTGGGGTG GCGGCGAATT CACGGCTGGC GAGCCTGGCG GCGGAACGCT GCGCGGACGC GGGTGTCGAG TTGCGTGTGC CGCGACCGAG GTTGTGCACG GACAACGGTG CGATGATCGC CGCGTTGGGA GCGCATCTCG TCGCCGCGGG TTGCCCACCG AGTCCGCTCG ATCTGTCGGC GGATCCGTCG CTGCCGGTCA GCACCGTCTG CCTGTGA
|
Protein sequence | MSSIVMGIES SCDETGVGLV RLRDDGGVEL LADEVASSVE EHARFGGVVP EVASRAHLEA MVPTVRRAFD TAGLRMSDVD AIAVTAGPGL AGALLVGVSA AKAYAAALDV PLYGVNHLAG HIAVDTLQHG PLPTPSLALL VSGGHTQLLR VDDIACKITE IGSTVDDAAG EAYDKVARVL GLPYPGGPPI DRAAREGDPN AIAFPRGMTG PRDAPFDFSF SGLKTAVARW VEGAQARGEN IPVADVAASF QEAVADVLTA KAVRAATELG IGTLVISGGV AANSRLASLA AERCADAGVE LRVPRPRLCT DNGAMIAALG AHLVAAGCPP SPLDLSADPS LPVSTVCL
|
| |