Gene Svir_04650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_04650 
Symbol 
ID8385803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp456240 
End bp457286 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content70% 
IMG OID644974563 
ProductO-sialoglycoprotein endopeptidase 
Protein accessionYP_003132367 
Protein GI257054535 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.345171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCGA TCGTCATGGG CATCGAGAGT TCGTGCGACG AGACGGGTGT CGGACTCGTG 
CGGTTGCGCG ACGACGGTGG TGTCGAGCTG CTCGCCGACG AGGTCGCCAG CAGTGTGGAG
GAGCACGCGC GGTTCGGGGG TGTGGTCCCG GAGGTCGCCA GCCGTGCCCA CCTGGAGGCG
ATGGTGCCCA CCGTGCGTCG GGCGTTCGAC ACGGCGGGGT TGCGGATGTC GGATGTGGAC
GCCATCGCGG TCACGGCGGG TCCCGGACTG GCGGGCGCGC TGCTGGTGGG TGTGTCGGCC
GCGAAGGCCT ACGCCGCCGC GCTCGACGTA CCGCTGTACG GGGTGAATCA CCTCGCCGGG
CACATCGCTG TGGACACGTT GCAGCATGGC CCGTTGCCCA CGCCGTCGCT GGCGTTGCTC
GTGTCGGGTG GGCATACGCA GTTGCTGCGA GTCGACGACA TCGCGTGCAA GATCACCGAA
ATCGGCTCCA CAGTGGATGA CGCGGCCGGG GAGGCCTACG ACAAGGTGGC GCGGGTGCTG
GGCCTGCCGT ATCCGGGTGG TCCGCCCATC GACCGCGCGG CGCGTGAGGG CGATCCGAAC
GCCATCGCGT TTCCACGTGG CATGACCGGG CCTCGGGACG CGCCGTTCGA CTTCTCGTTC
TCCGGCTTGA AGACGGCGGT GGCCCGGTGG GTCGAGGGCG CGCAGGCTCG GGGCGAGAAC
ATCCCGGTGG CGGATGTGGC GGCGTCGTTC CAGGAGGCGG TCGCGGACGT GTTGACGGCC
AAGGCGGTGC GGGCCGCGAC CGAGCTGGGT ATCGGCACGC TCGTCATCTC CGGTGGGGTG
GCGGCGAATT CACGGCTGGC GAGCCTGGCG GCGGAACGCT GCGCGGACGC GGGTGTCGAG
TTGCGTGTGC CGCGACCGAG GTTGTGCACG GACAACGGTG CGATGATCGC CGCGTTGGGA
GCGCATCTCG TCGCCGCGGG TTGCCCACCG AGTCCGCTCG ATCTGTCGGC GGATCCGTCG
CTGCCGGTCA GCACCGTCTG CCTGTGA
 
Protein sequence
MSSIVMGIES SCDETGVGLV RLRDDGGVEL LADEVASSVE EHARFGGVVP EVASRAHLEA 
MVPTVRRAFD TAGLRMSDVD AIAVTAGPGL AGALLVGVSA AKAYAAALDV PLYGVNHLAG
HIAVDTLQHG PLPTPSLALL VSGGHTQLLR VDDIACKITE IGSTVDDAAG EAYDKVARVL
GLPYPGGPPI DRAAREGDPN AIAFPRGMTG PRDAPFDFSF SGLKTAVARW VEGAQARGEN
IPVADVAASF QEAVADVLTA KAVRAATELG IGTLVISGGV AANSRLASLA AERCADAGVE
LRVPRPRLCT DNGAMIAALG AHLVAAGCPP SPLDLSADPS LPVSTVCL