Gene Svir_18200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_18200 
Symbol 
ID8387147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp1887572 
End bp1888987 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content69% 
IMG OID644975888 
Productsubtilisin-like serine protease 
Protein accessionYP_003133670 
Protein GI257055838 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGAT TCAGATCCCT GCTGTTACCC GCCGCCCTGA CCTTGGGCGT CGGCGGTGTG 
CTGGCCGCTC CCGTGGCCGC CGCCGAACCC ACCGCAGCGG CGGAGTGCGA CACCACCAGC
ACCCCCTACA CCTACGTGGT GCTGTATCAC CCGCGCACGC CGCAAGCCGT GGTCGACGCC
GAACTGGCCG CCAAGTGCGG GGAGCGGGTC GCCTACTACC CGGAGATCGG GGTGGCGGTG
GCCAGCTCCC GTAACGCCGA CTTCGCCGAC CGGATCGGTG TCTACCGCGC CTACTCCGGC
TCCCGCGAGG TCGCCCACCC CGACACGGCG GCCGCGGTGG CCCGGGCGCA GGCTCGTGCG
GAGGTCGAGA CCGAGGAGAC CGTCGACGTG GTCTCGACCG CGGATCTCTC CGCGCAACAG
TGGGACATGC ACATGATCCA CGCGCCGGAA GCCCATGCGA TCAATGAGGG CAGTCCCTCG
GTCACGGTGG GCGTACTGGA CTCCGGGATC GAACCCACTC ACCCGGCGCT GGTCGACTCG
CTGGACCCGG AAACCTCCGT CGGCTGCAAC ACGGGCGCAC CTGACACCCG CCCCGAGGCG
TGGGCTTCGA CGAACATCGA CCACGGCACC CACGTCGCCG GCACGATCTC CGGCAAGGAC
ACCGAGCGCG GTTTCACCGG TGTCGCCCCC GGCGTGCGGA TCGCGTCGGT GAAGGTCGTC
AACGACGAGG GCTACATCTA CCCCGAAGCG GCGGTCTGCG GTTTCATGTG GGCGGCCGAG
CACCGTTTCG AGGTGACCAA CAACAGTTAC TACGTCGACC CGGGGATGTT CTACTGTCCG
AGCCAACCCG GTGACGCGGC GGCCTACGAA GCCGTGCGGC GTGCGGTGGC CTACTCCCAG
AAGCGGGGCG TGCTCAACGT CGCCGCGGCC GGCAACAGCG ACTTCGACCT GGCGGACCCG
CCGGCCGACG ACCCCAACCG CCAGCACCCT GTGAACTCCG GGTGCGCCAT CCTGCCCAAG
GGGCTCGACG GTGTGGTGAC CGTTTCGTCC GTCGGCTACG AAGCCACCAA GTCCTCGTTC
AGCAACTACG GCCTCCGCGA GGTCGACGTG GCCGCACCGG GTGGCGACCG TGATCAGTTG
CCCCCGGGAG CGACGTCGGG CTGCATCCTC TCCACGGTGT TCAACGGCCA GTACGGCACC
AAGTGCGGCA CCTCGATGGC CGCGCCACAC GCCGCCGGAG TGGCCGCGCT GATCGCCAGC
AAACGTCCCC AGCTTCCGCC GCAGGCCATC TCGGCACTGC TGCGGGCCAA GGCCGACAAC
ATGCCCTGTC CCGACGACGA CCGGTGCACC GGTCCTGCGG CGTACAACTC GTTCTACGGC
CACGGGCTCG TGAACGCGCT GGCCGCCGTC AAGTAA
 
Protein sequence
MSRFRSLLLP AALTLGVGGV LAAPVAAAEP TAAAECDTTS TPYTYVVLYH PRTPQAVVDA 
ELAAKCGERV AYYPEIGVAV ASSRNADFAD RIGVYRAYSG SREVAHPDTA AAVARAQARA
EVETEETVDV VSTADLSAQQ WDMHMIHAPE AHAINEGSPS VTVGVLDSGI EPTHPALVDS
LDPETSVGCN TGAPDTRPEA WASTNIDHGT HVAGTISGKD TERGFTGVAP GVRIASVKVV
NDEGYIYPEA AVCGFMWAAE HRFEVTNNSY YVDPGMFYCP SQPGDAAAYE AVRRAVAYSQ
KRGVLNVAAA GNSDFDLADP PADDPNRQHP VNSGCAILPK GLDGVVTVSS VGYEATKSSF
SNYGLREVDV AAPGGDRDQL PPGATSGCIL STVFNGQYGT KCGTSMAAPH AAGVAALIAS
KRPQLPPQAI SALLRAKADN MPCPDDDRCT GPAAYNSFYG HGLVNALAAV K