Gene Sros_4203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4203 
Symbol 
ID8667497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4678790 
End bp4682791 
Gene Length4002 bp 
Protein Length1333 aa 
Translation table11 
GC content70% 
IMG OID 
ProductSubtilisin-like protein serine protease-like protein 
Protein accessionYP_003339848 
Protein GI271965652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.5318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00171277 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAACGCA CCGTCCTGCT GACCTCACTC CTCGCACTGG CCGCGACCGC GCTGCCGCAG 
ACGCCGGCGG CACTGGCCGC CGGGCCGGCG CAGCCGCCCC CGGCGGCTCC CGTCACACTC
CAGGGTGTCG GCCCGGGCCC GCACGTCATC ACCCTGGTCA CCGGTGACAA GGTCACGCTC
ACCCAGGCCG CAGCCGACCG CTTCGACGTG AAGACCGAAC CCGCCGCCCG CTCGGACGGC
CGCCGGCCGC GGCTGGTCAC CCGGACCACT CCCGAGGGCG TCTACGTCCT GCCCACCGAC
GCGATGCCCG CCATTCAGGC GGGGCGGCTG GACCGGCGGC TGTTCGACGT CAGATATCTG
GCGGAGAACG GCTACGCCGA CGACCGGACC AAGCAGGTCC CGGTGATCGT GCAGTACCCG
AAGGACCGCG AGGGCGCCAC CCTCAAGCGG GCTGCCGACG AGATCCCGGC CAGCGTCCCC
ACCAAGACGC TGGACAGCAT CCACGCCTCC GCCGTGAACG TCACCAAGGC GGAGGCGGGG
ACGTTCTGGG AAGCCGTACG GACCGCCCCC TCAGCCGGTC TGAAAGCGTC CGACACGCTG
AGCGGCGGGA TCGCCAAGCT GTGGCTGGAC GGCAAGGTCA AGGCCGTCCT GGACGAGAGC
GTGGCCATGA TCGGCGCGCC GCAGGCCTGG GCCGGCGGCC ACGACGGCAC CGGGGTGAAG
GTCGCGGTGC TGGACACCGG CATCGACGCC ACCCACCCCG ACTTCGCAGG GAAGATCGCC
ACCTCCCAGT CCTTCGTCCC CGACGCGCCG GTCACGGACG GCCACGGGCA CGGCACCCAC
GTGGCCTCCA CCATCGCCGG CTCCGGAGCC GCGTCCGGCG GCAAGTACAA GGGAGTCGCG
CCGGGCGCGC AGCTGGTCGT CGGCAAGGTG CTGGCCGACG ACGGCTCCGG CCAGGACTCC
TGGATCATCG AGGGTATGGA GTGGGCCGCC AACAGCGGCT CCAAGGTCGT CAGCATGAGC
CTGGGCGCCG GCCCCAGCGA CGGCACCGAC CCGATCAGCC AGGCGGTCAA CGACCTGAGC
GCGTCCACCG GCACCCTGTT CGTCATCGCG GCGGGCAACT CCGGCGCCCT CCCCGGGACC
GTGGCCCTGC CCGGCGCGGC CGAGGCCGCG CTCACCGTCG CCGCGGTCGA CAAGCAGGAC
CAGATGGCCT ACTTCTCCGG CCGGGGGCCG CGGTTCGGCG ACTCGGGGCT CAAGCCCGAC
ATCGCCGCGC CCGGTGTGGA CATCGCCGCC GCCCGCGCCA CGGGCACCAC GATGGGCACC
CCCGTCGACG ACCGCTACAC CGAGGCGTCC GGCACCTCGA TGGCCACGCC GCACGTGGCG
GGCGCGGCGG CGATCATGGC GCAGCAGCAC CCCGACTGGA AGGGGCCGCT GCTCAAGGCC
GCGCTGATGT CCACTTCCAA GGACGACGGC TTCACCGTGT ACGAGCAGGG CGCCGGGCGG
GTGGACCTCG CCAAGGCGTA CACCCAGCGG GTCTTCGCCA CCACCGGCGG GATCGACTTC
GGCTCGGTGA CCGAGGAGGG TGAGACCCTG CAGCGGGAGC TCTCCTACAG CAACCTCACC
GACCAGCCGG TCACCCTCAC GCTCACCCCC GCCCTGCGCA CCGTCGGCGG GACCGCGGTG
GAAGGCAGGC TGAGCACCGA TCCGACCCTC ACCGTGCCCG CGAACGGCAC CGCCACGGCG
ACCGTCACCC TCGACACCGC CGGCCTGGAG TTCGGCACCT ACACCGGCGC CGTGACGGCC
GAGGCCGACG GCGTCCGGCT GACCACCCCG GTGGGCACGG TGCGCGAGGC TCCGCTCGTC
CAGCTCACCG TCCGCACCAT CGGCAGGGAC GGCAAACCCG TCTCGCCCTG GTACCAGCAG
ACCCTCGACG TCGGCGGCGC GAAGGGATAC ATCGACGGCA CCGTGGTGTC GGACGAGGGG
ATCACGATCA CCCGGGTTCC GGTCGGCACG CACTCGGTGA TGCAGCTGGT GGCCGCGGTG
GACTCCGACG ACCGGCTCAA CGAGACGTTG CTGATCAACC CGGAGGTCAC CGTCACCGGT
GACACCGAGA TCACGCTCGA CGCCCGGCAG GCCTCCCAGG TCCGCTTCAG CACCCCGAAG
CCCGCCGAGC CGCTGAACAA CTTCTGGGTC GCCGCCACCC AGCGGACCAT CGCCAACGGC
CTGACATACG CCACCCAAAT GACCCCGGGC TCGTCGGCCA GCGCATGGGC GAAGCTCTGG
GTCACCCCCA CCAAGCCCGT CACCAAGGGC AAGTTCCGCT TCTCCGGCCA GTGGACGCTG
GGACAGGCCC AGGTCACCAT GAGCGCGGGC AAGCGCAAGC CGATCACCCT GGATCCGGTT
GCCCCCCAGC ACCAGGGCTT CTCGGACGGC CAACGCCAGA ACGAACTGCC CGACTGGAGG
CCGTTCAGCG GGACGCAGAA CCTGCCGCTC ATCGACGTCG GCGAGGGCCG GCCCGAGGAC
CTCGCCGGGA AGGACCTGCG CGGCAAGCTC GTCCTCATGG AGACGGGATC CACGGCCAAC
TATGACTTGA GCTGCTCCAC CATGATCACC CAGATCATCC CCATCCGTGA GGCCGGTGCG
GCCGGATTGG CGCTCTTCCC GTCAAAGAGC GGCTCCTGCT CCCTTCCGGT CAACATCTAC
CAGAAGATCA ACACCGGCGA TCCCAAGCCC GTCGGTATCC CGAACGTCTC CCTGTCGAAC
AAGGAGGGAC TGGAGCTGCG CGCCCGGCTC GCGAGCGAAC CGGTCAGCGT CCGGGTGACC
GGCACACCGG AAACGCCCTA CTCCTACTTC CTCAAGCCGT ACGCCGAGGG CAGCGTGCCG
AAGTCCCTGC ACTACACCTT CACCGGCAAG CAGCTCGCCC AGGTCGACAT GGACATCCAT
GCCACCCAGC CGACCAGCCA CAACAACTGG CGCATGATCT ACAAGCAGGA CGACGTGGCG
ACCACGGTGA CGGCCACCTC GATGTGGTCG CCGGTGGCCT TCACCGGCCC GACCAGCAGG
ACCGATTGGG TCGGGCCCAT CGATCCCGAG GTCATCAACT CGCACGGCAT GAGCTCCGCC
AGCCTCGACG GGACGGTGCC CTTCGAGACC CGGTTCCGCG TCGAGAAGCT CGACCGTCCC
GGCCGTACCC GGCAGCAGTG GTTCACCGCC GGCCTGACGC CGGGCGCCTT CACCGCGTCG
GAGAAGGTCT ACGCGATGGC CGACAAGGAC GCGCCGCCGC TCACCACGAT GGGCATCGAC
CTGGTGTGCA CGATCTGCGT GCAGGGTGAC ACCCTCTGGG CCGAGTTCGC CCCGGCAGGC
ACACCCGACA AGCATCCGAC CAGCGACGGA TTCTGGAAGT CCGACTCGCT CCTCACCCCG
GACTACGATC TCCACCTCTA TCGGGACAGC AAGGAGATCC CGCGCGTCAC GGTGCCCGGC
CTGGACAACC TCCCCGCGTT CACCCTGCCC GAGGGCGCCG GCACCTACCG GCTGACCGCG
AAGAACGCCC AGCACGACAC CGAGTGGACG TTCCGCGCCC CGCCGGCCAA GGAACACGTG
CTGCCGGGAT CGTTCTGCAA GTATTGGATC ATCGAGGGAG TCACCGAGCA GTGCCGGCCC
ACCCCGGTCG TCTTCGCCGG CTACGACCTG GGCGACACCC TGGCGATGGA CAACACCGTC
CGCGCCGGCC GATCCCACAC GTTCACCGTC GAGGCCTACC ACTCGCCGTC CGCGGCGAAG
ATGCCGAAGA TCGCCGGGCT CAAGCTGTGG ATGAGCACCG ACGACGGCGC CAAGTGGACG
CCCGTCCCGG TCAAGCGCGA CCGTGACGGC GCCTACACCG CCAGCACCCG CTACCCGTCC
CTGCGCGACA CCAAGGGCGC GGTCAGCCTG AAGGTCGAAG CCTGGGACGC CGAGGGCAAC
CGCCTCAAGC AGACCAGCAC CCGCCTGTTC AACCTGCGCT GA
 
Protein sequence
MKRTVLLTSL LALAATALPQ TPAALAAGPA QPPPAAPVTL QGVGPGPHVI TLVTGDKVTL 
TQAAADRFDV KTEPAARSDG RRPRLVTRTT PEGVYVLPTD AMPAIQAGRL DRRLFDVRYL
AENGYADDRT KQVPVIVQYP KDREGATLKR AADEIPASVP TKTLDSIHAS AVNVTKAEAG
TFWEAVRTAP SAGLKASDTL SGGIAKLWLD GKVKAVLDES VAMIGAPQAW AGGHDGTGVK
VAVLDTGIDA THPDFAGKIA TSQSFVPDAP VTDGHGHGTH VASTIAGSGA ASGGKYKGVA
PGAQLVVGKV LADDGSGQDS WIIEGMEWAA NSGSKVVSMS LGAGPSDGTD PISQAVNDLS
ASTGTLFVIA AGNSGALPGT VALPGAAEAA LTVAAVDKQD QMAYFSGRGP RFGDSGLKPD
IAAPGVDIAA ARATGTTMGT PVDDRYTEAS GTSMATPHVA GAAAIMAQQH PDWKGPLLKA
ALMSTSKDDG FTVYEQGAGR VDLAKAYTQR VFATTGGIDF GSVTEEGETL QRELSYSNLT
DQPVTLTLTP ALRTVGGTAV EGRLSTDPTL TVPANGTATA TVTLDTAGLE FGTYTGAVTA
EADGVRLTTP VGTVREAPLV QLTVRTIGRD GKPVSPWYQQ TLDVGGAKGY IDGTVVSDEG
ITITRVPVGT HSVMQLVAAV DSDDRLNETL LINPEVTVTG DTEITLDARQ ASQVRFSTPK
PAEPLNNFWV AATQRTIANG LTYATQMTPG SSASAWAKLW VTPTKPVTKG KFRFSGQWTL
GQAQVTMSAG KRKPITLDPV APQHQGFSDG QRQNELPDWR PFSGTQNLPL IDVGEGRPED
LAGKDLRGKL VLMETGSTAN YDLSCSTMIT QIIPIREAGA AGLALFPSKS GSCSLPVNIY
QKINTGDPKP VGIPNVSLSN KEGLELRARL ASEPVSVRVT GTPETPYSYF LKPYAEGSVP
KSLHYTFTGK QLAQVDMDIH ATQPTSHNNW RMIYKQDDVA TTVTATSMWS PVAFTGPTSR
TDWVGPIDPE VINSHGMSSA SLDGTVPFET RFRVEKLDRP GRTRQQWFTA GLTPGAFTAS
EKVYAMADKD APPLTTMGID LVCTICVQGD TLWAEFAPAG TPDKHPTSDG FWKSDSLLTP
DYDLHLYRDS KEIPRVTVPG LDNLPAFTLP EGAGTYRLTA KNAQHDTEWT FRAPPAKEHV
LPGSFCKYWI IEGVTEQCRP TPVVFAGYDL GDTLAMDNTV RAGRSHTFTV EAYHSPSAAK
MPKIAGLKLW MSTDDGAKWT PVPVKRDRDG AYTASTRYPS LRDTKGAVSL KVEAWDAEGN
RLKQTSTRLF NLR