Gene Sros_4894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4894 
Symbol 
ID8668188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5421346 
End bp5425320 
Gene Length3975 bp 
Protein Length1324 aa 
Translation table11 
GC content72% 
IMG OID 
ProductSubtilisin-like protein serine protease-like protein 
Protein accessionYP_003340454 
Protein GI271966258 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.570577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.462175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGGC TCCGGCGCGG CTCGCTCGCG CTCACCTCGG CTCTCGCGCT GGTCACCGCC 
ACGCTGGCGG TCCAGCCTTC CGCGACGGCA CAGGCGCTGA CGCCGGCTCC CCAGGCGCAG
CCGCACCAGC CGCTCACCCT CAAGGGCGTG GGCAAGGGAC CGCACACCGT CACGCTGATC
ACCGGCGACA AGGTCACGCT GACCGACACC GGCGGCGGTC GCTACGCGAT CGGGCAAGCC
GGCGGGCCCA GGTCGGACGG CCGTTTCCCG TCGCTGTTCG TACGGTCGGG CCCGGACGGC
GTCTACGTCC TGCCCGACGA CGCCTTCCCC GCGATCCAGT CGGGCACCCT GGACCGCGAG
CTGTTCAACG TCAAGTACCT GGCCGAGAAC GGCTACACCG ACGCCGAGAC CGAGCGGCTG
CCCGTCATCG TGCAGTACCC GGAGGGTTTC GCCGCCGCCA AGAGCTCGGC CGACGCCATC
CCGGCCAGCG TGCCCACCGC GACGCTGGAG AGCATCAACG CTGCCGCACT CGACGTCGCC
AAGGGCGAGG CGGGCGCCTT CTGGGCCGCG CTGCACGCCG AGCCGGACGC CAAGGGCAGG
GCAGGGCTGC GCGGCGGCGT CGGCAAGGTC TGGCTGGACC GCAAGGTCAA GGCCGACCTG
TCGGAGAGCG TCCCGCAGGT CGGGGCGCCG GAGGCGTGGA AGGGCGGGCA CGACGGCACG
GGCGTGACCG TCGCCGTCCT GGACACCGGC GTCGACGCCG CTCATCCCGA CCTGGCCGGG
AAGATCGCCG AGGCGCGCTC GTTCGTACCG GACGAGAGCG CGCAGGACGG CCACGGTCAC
GGCACCCACG TCGCCTCCAC CGTCGCGGGC AGTGGCGCCG CCTCGGGCGG CGCGAACAAA
GGCGTCGCGC CGGGCGCGCG GCTCCTGGTC GGCAAGGTGC TGGACAACTC CGGCAGCGGC
ACGGAGTCGG GGATCGTCGA CGCCATGGAG TGGGCCACAG CGAGCGGCGC CAAGGTGGTC
AGCCTGAGCC TGGGCGCGAA CGCGACCGAC GGCACCGACC CGATGAGCCA GGCGGTCAAC
GACCTGACCG CCGCCACCGG CGCGCTGTTC GTCATCGCCG CGGGCAACGT CGGCACGCCC
GAGAGCGTCT CCACGCCCGG CACCGCCGAC GCGGCGCTGA CCGTGGCCGC CGTCGACAAG
GCCGACCAGC AGGCCTGGTT CTCCAGTCAG GGGCCGCGCG TCGGCGACGC CGCGCTGAAG
CCGGACATCA CCGCGCCCGG CGTCGACATC GCCGCCGCGC GCGCCTCCGG CACCGCGATG
GGCAGCCCGG TCGACGACCA CTACACCAAG GCGTCCGGCA CCTCGATGGC CACGCCGCAC
GTGGCGGGCG CCGCGGCGAT CGTCGCACAG GTGCACCCGG ACTGGACGCC GCAGCAGCTC
AAGGCCGCGC TGATGTCGAC GGTCAAGGAC GTCGGCGGGA CCGTCTACCA GCGCGGCGCC
GGCCGGCTCG ACGTCGCCCG CGCGGCCTCG CAGACCGTGT TCGCCACCAC GCCGAACCTC
GACTTCGGCC TGTTGGACGA GTCCGGAAAG CCGCTGACCC GCGAGCTGGC CTACACCAAC
CTCGGCGACC AGCCGGTCAC CCTCACGCTC ACGGCGGCCA TGGGCGAGAC CCGGCTGAGC
ACGGCCGACG CCACGCTCAC CGTCCCGGCC AAGGGCACGG CCGCCACCAC CGTCACGCTC
GTCACCCAGG GCCTGGAACT CGGCACCTAC AGCGGCGCGG TCACCGCGCA GGCCGACGGC
GTGCGCCTGA CCACCCCGGC GGGCGCGGTG CGGGAGGCCC CGACCTACCA GCTGACCATC
CGCACCCTGG GCCGCGACGG CAAGCCGCGC ACCCCCTTCG CCCAGGACGT CGTGGACCTC
GAAGGACGCA AGGGCCACCT GAGCCCGCAC CTGATCGTCG ACGAGGGCGT CGTGGTCACC
CGCGTCCCGG CCGGGACGGT CAGCGTCCTG CAGGTGATGG AGTGGACCGA CGCCGACAGC
AGGAGCAACC GGGTCTGGCT GTTCGACCCC GAGCTCACCA TCACGGGCGA CACCGAGATC
ACGATGGACG CCAGGAAGGC CAGCCAGGTC CGCTTCAGCA CGCCCCAGCC GGCCGAGCCG
CTGAACAACG CGTTCACCAG CTTCTACCAG CGCACGAACG CCAGGGGCGA GGTCTTCGCC
GGGTCGGTGC TGCAGACGGT GCCGATCGGC TCCTGGGGCA AGCTGTGGGT GCTGCCGACC
AAGAAGGTCA CCAAGGGCGC CTTCCGCTTC GCCACGCAGT GGACGCTCGG GCAGTCCGAG
ATCGCCATGA GCGTGCGTGG CCGGGGCAAG ATGGAGCTGA ACCCGGCGGC GAACCTCCAC
TGGCAGGGAG AGGTCAACCA CCACCCTGAC TGGACGCCGT TCACCGGGAC CAAGGACCTG
CTCCTGACCG ACGTCGGCCA GGGCACCCCC GAGGAGCTCG CCGGGCGTGA CCTGCGCGGC
AGGCTCGTCC TGATGGCGGC CGAGGGAACG GTCGACTTCC TCGGCAACCC GACCTGCGGC
GTGCAGATCG AGCGCATCGG CGCCGTCAGG GACGCCGGCG CCGCCGGCCT GGTGATCTAC
CCCACGCAGG ACTCCGGCTG CCCCATCCCG CTGCCGATCT GGCAGAAGCC GTTCACGGGC
GACCCCAAGC CGCTCGGCAT CGCCAACGCG TACGTGTCCA CCAAGGAGGG TCTCGCGCTG
CGTGAGCAGG CCAGGCGCGG CCCGCTCACC ATCCGGGTGA CAGGCACGCC GCACTCGCCC
TACACCTACG CGTTCTCGCC GTACGAGGAG GGGCGCATCC CCTCGTCGAT GCACTACACG
GTGCGCGAGC GGGACGTCGC CCGCGTCGAT CTGGACATTC ACGCGTCCAA GCCGGGCGGC
TACTGGGAGT GGACTCTCGC CTACAAGCAG GACGACGCCC AGCGCTGGAG CGTCTCGCCC
TCCGACGCCG ACGTCGCCGG CATCGCGCCG CAGGTCCGCA CGGAGTACGT CTGGCCCACC
GACCCCTCCG TCGTGCACAT CCGCGGCATG GCGCCCGACC CCAAGGGCAC CAACGGCGTG
CACAGCCGTT ACCTGACGGA GGTCTACAAG CGGCCGGGCC GTACCGGGCA GGTGTGGTTC
GCGCCCGGCA CGCCTGGCGC CGCGACGGTG TCCGACGCGG CGGCCGCGCT GCCCGACCCG
AAGGCCGGGG TGCTCAAGGA GCAGGGGTTG GGAATCAACT GCGCGATCTG CGTGCAGGGC
GACAAGCTGT GGGCCGACTT CTCCGAGGTC TCCGGCGTCG CGGACTCCCG CGTCGACAGC
GACGACTACT GGTCGACCGG GCAGATGTTC ACCCCGATGT ACGAGACGCA CCTGTACCGC
GACGGCAAGG AGATCCCGCG CCAGGGGGTC GAGCCGCTGG CGGGCAACGC GCCGCGGTTC
ACCCTGCCGG CCTCCGAGGG CGTCTACCGG CTGACGGCCA AGAGCGCCAC GAACGACGTG
GAGTGGACCT TCACCGGGCC ACCCGCGAAG GACGCGGTCC AGCCCGGCGC CGCCTGCACC
TCGTGGTTCG TCGAGGGGTA CGGCGAGCAC TGCAGGCCCA TGCCGGCGGT GTTCGTCAGC
TACGCGCTGG GCGGCGACCC CGGCAACGCC GTGGCGGCGG GGCGCAAGCA CACCTTCCAG
GTCGAGGCCT ACCATTCGCG GTCGACGGCG AGGATGCCGA AGATCGCCGG GCTGAAGCTG
TGGGCGAGCA CCGACGACGG CGCCACCTGG CAGCCCGTCA CGCTCAAGCG CGGCTCAGGC
GGCCTGTACA CGGCCAGTGT CAAGTATTCC GCCTTGCGCG CCACCACCGG CGCGGTCAGC
CTGAGGGCCG AGGCGTGGGA CGAGGCGGGC AACCGGGTCA AGCAGACCAG CACCCGGGTC
TTCCCGCTGC GCTAG
 
Protein sequence
MSRLRRGSLA LTSALALVTA TLAVQPSATA QALTPAPQAQ PHQPLTLKGV GKGPHTVTLI 
TGDKVTLTDT GGGRYAIGQA GGPRSDGRFP SLFVRSGPDG VYVLPDDAFP AIQSGTLDRE
LFNVKYLAEN GYTDAETERL PVIVQYPEGF AAAKSSADAI PASVPTATLE SINAAALDVA
KGEAGAFWAA LHAEPDAKGR AGLRGGVGKV WLDRKVKADL SESVPQVGAP EAWKGGHDGT
GVTVAVLDTG VDAAHPDLAG KIAEARSFVP DESAQDGHGH GTHVASTVAG SGAASGGANK
GVAPGARLLV GKVLDNSGSG TESGIVDAME WATASGAKVV SLSLGANATD GTDPMSQAVN
DLTAATGALF VIAAGNVGTP ESVSTPGTAD AALTVAAVDK ADQQAWFSSQ GPRVGDAALK
PDITAPGVDI AAARASGTAM GSPVDDHYTK ASGTSMATPH VAGAAAIVAQ VHPDWTPQQL
KAALMSTVKD VGGTVYQRGA GRLDVARAAS QTVFATTPNL DFGLLDESGK PLTRELAYTN
LGDQPVTLTL TAAMGETRLS TADATLTVPA KGTAATTVTL VTQGLELGTY SGAVTAQADG
VRLTTPAGAV REAPTYQLTI RTLGRDGKPR TPFAQDVVDL EGRKGHLSPH LIVDEGVVVT
RVPAGTVSVL QVMEWTDADS RSNRVWLFDP ELTITGDTEI TMDARKASQV RFSTPQPAEP
LNNAFTSFYQ RTNARGEVFA GSVLQTVPIG SWGKLWVLPT KKVTKGAFRF ATQWTLGQSE
IAMSVRGRGK MELNPAANLH WQGEVNHHPD WTPFTGTKDL LLTDVGQGTP EELAGRDLRG
RLVLMAAEGT VDFLGNPTCG VQIERIGAVR DAGAAGLVIY PTQDSGCPIP LPIWQKPFTG
DPKPLGIANA YVSTKEGLAL REQARRGPLT IRVTGTPHSP YTYAFSPYEE GRIPSSMHYT
VRERDVARVD LDIHASKPGG YWEWTLAYKQ DDAQRWSVSP SDADVAGIAP QVRTEYVWPT
DPSVVHIRGM APDPKGTNGV HSRYLTEVYK RPGRTGQVWF APGTPGAATV SDAAAALPDP
KAGVLKEQGL GINCAICVQG DKLWADFSEV SGVADSRVDS DDYWSTGQMF TPMYETHLYR
DGKEIPRQGV EPLAGNAPRF TLPASEGVYR LTAKSATNDV EWTFTGPPAK DAVQPGAACT
SWFVEGYGEH CRPMPAVFVS YALGGDPGNA VAAGRKHTFQ VEAYHSRSTA RMPKIAGLKL
WASTDDGATW QPVTLKRGSG GLYTASVKYS ALRATTGAVS LRAEAWDEAG NRVKQTSTRV
FPLR