Gene Sros_8183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8183 
Symbol 
ID8671511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9026768 
End bp9030487 
Gene Length3720 bp 
Protein Length1239 aa 
Translation table11 
GC content73% 
IMG OID 
ProductSubtilisin-like protein serine protease-like protein 
Protein accessionYP_003343577 
Protein GI271969381 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.574663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTGGC TCGCGCGTCT TACCGCCGCC GGAGCGGCAC TCTCCCTCGC CCTACCGGCG 
TTACCGTCCG GAGCCGAGAC CGCTTCCGCC CCGCCCCCGG CCGCCGGTCT CCACCGGATC
ACCCTGATCA CCGGCGACGT GGTGACCGTC CGCCAGGCCG CGGCCGGGAA GTGGGCCGTC
ACGGTCGATC CCGCCAAGGG CCGGGAGAAG GTCAGGTTCA CCACCCACGA CCTCGACGGG
AAGCTTCAGG TGCTGCCCGA CGACATGGTC CCGTACGTGG CCGAGAACCT GGTCGACAAG
GGTCTTTTCG CCGTCGGCGA CCTGATCGAG CAGGGGTACG ACGACGCCGG CAGCCGCACG
CTGCCGCTGC TGCTCGCCTA CGACAAGGGC GCACGGACCT CCGGCGCCCT GCCCGGCGCC
AAGGCGGGGA CCGAGCTGGA AAGCATCGGC GCGCGGGCGG TGGACGCCGG CAAGGGCGAG
CTGGCCGCGT TCTGGAAGGC CGTCGAGGCG GCGCCGTCGG CCCGCTCCGG CGGGCCCCGG
CTGGCCTCGG GAATCCGGAA GATCTGGCTG GACGCCAAGG TCACCGCGGA CCTGGAGCAC
AGCGTTCCGC AGATCGGCGC CCCCGACGCC TGGAAGAGCG GGTTCGACGG CAAGGGCGTC
AAGGTGGCCG TCCTCGACAC CGGCGCCGAC GAGAACCATC CCGACCTCGC CGGAAAGATC
ACCGACAGGC GCAACTTCAC CGCCGACCCC TCGACCCAGG ACGGCCACGG GCACGGCACC
CACGTGGCGA CCACCGTCGC GGGCCTCGGG ACGGCCTCCC AGGGCCGCCG CAAGGGCGTG
GCCCCCGGCG CCGAGCTGAT CATCGGCAAG GTGCTGGACA GCGGCGGCTC GGGCCAGTTC
TCCCAGATCA TCGAGGGCAT GGAGTGGGCC GCGGCGTCCG GCGCCGACGT GGTGAACCTG
AGCCTCGGCG GCGAGGCGAC CGACGGCACC GACCCCGCCA GCGCCGCCCT GAACGCGCTG
ACCGAGCAGA CCGGCACCCT CTTCGTCGTC GCGGCCGGCA ACGAGGGCCG GGAGTACGCC
GTCGGCACCC CCGGCGCCGC GACCTCCGCG CTGACCGTCG GCGCGGTCGG CGCCGACGAG
ACGCTCGCCC CCTTCTCCAG CCGTGGCCCG CGTCTCGACG GAGGCGCCAA GCCGGACATC
ACCGCCCCGG GAGTGGCGAT CGTGGCCGCG CGGGCCGAGG GCACCTCGAT GGGCCAGCCC
GCCGACGAGC GCTACACCGC CGCGTCCGGG ACCTCGATGG CGACCCCGCA CGTGGCCGGC
GCCGCCGCGA TCCTCAAGCA GCGGCACCCG GACTGGAAGG CCAAGCAGCT CAAGGACGCC
CTGATCTCGA CCGCCAGGAC CGCGCGGGAT CTGACCGTCT ACGAGCAGGG CGGCGGCCGG
GTGGACGTCG CGCGCGCCGT ACGGCAGGAC GTCACGGCCA CCGGCGTGCT CGACCTGGGC
ACCCACCAGG ACGGCGGCTC CACGGCCCCC TCCGGGACCG TCGCCTACAC CAACTCCACC
CAGGCGGCCG TCTCCCTGGC GCTGACCGCC ACCCTGAGCA ACCTGGACGG TGACGCCCCC
GCGCAGGGCG CGCTCACCCT GGGGTCGGCG TCGATCACCG TCGACGCCGG CGCCACGGTC
ACGGTCCCGG TGAGCGCCGA CCTGGCGAAG CTGGCCCACG GCCGGCACTC CGGCCATCTC
ACCGCGACCA CCGCGGACGG CTCGGTCGCC CTGCAGACCA CGCTGGCGCT GACCAGGAGC
CCCCGCACCC ACAAGGTGCG CATCAGCGCC GTCGGCAAGG ACGGCAGGCC GGCGAGCGTG
AGCACCGTCT TCATGTTCGG CCCCGGCACG CGGGACAACC TCCTCACCTA CATCATGCCG
TGGGAGGCCG AGGAGGGGAA AACCTTCGAG GTCCCCGAGG GCACCTACTA CGCGCAGAGT
CAGTTCGGCG AGGAACGGTC GGGCTCGCGC GTCATCGACA TCAAGATCGA CATCCCCGAG
TTCCCGGTGA CCGGCGACGC GGAGCTGGTG TTCGACGCGC GGAAGACGCG GCCGATCGTG
ATCAAGACGC CGCAGCCGGC GGTCCAGGAG GGCATCTCCA CGTTCGCCAG CTACCGCGAC
ACCGGCACCC GGAAGATCTC CTCGTCGTTC ATGAACTTCC CCTCGGTCGA CGAGCTGCAC
GTGGCCGAGA CGCAGCCGGT CCGCCAGGGT GCCTTCGAGT TCACCTCGCG CTGGCAGTAC
GGCGCGCCGA GGCTGTCGTC ACGGGTCAGC GGCCTCAAGG GGCCGCTGGA CCTCTCCCCG
ACGGTGAGGT CACCGGAGTG GAACGGCAGG TACCGGTGGG AGCTCGTCGA CGGTGGACAC
GGCACCCCCG AGGAGCTCGG CGCTCTCAAG CTGCGCGGCC GGGCGGTCGT CATGTCGAGC
GCCTCGCAGG ACGACCCCCA GTGGGACGAG ATGATCGCCG CCGCCGCCGA GGCGGGCGCC
GCGGCGGCCG TCGTCGTACC GGCCGCCGAC GACAGCCCGT GGCAGTACTG GTCGCCGGTC
ATGGACAGGC AGGCGATCCC GGCCGCCGCG ATCCCCTATG AGCAGGGGAA GAAGCTGCTG
GAGCGGGTCC GCAAGGGCAA GGCCGTCCTC GACGTGACCG GGAACATGGC CATCCCCTAC
CTCTACGACA TCTCGCAGGT CTCCAAGGGG CGGATCCCGG AGCAGATCGT CTACGAGGCC
AACGCCTCCA ACCTGGCCCG GGTGGACACC GGCTACCACG AGACCGGCGG CTTCGGCTGG
GCCAAGGAGC AGCGGTTCGG CTGGCGGCCG TGGCAGGTCT TCTCCAACGA GGGACAGCGG
TGGGTCCGCA CGGGATCCGC GCGCGCCGAG TACGTCACCT CGGGCGACAC CGAGTGGGAG
CACGTGGCGC AGCACATGTT CACCTGGGAG TCGATGCGCC CGCTCACCCC CGGGCTCACC
GGCGGGCTGC GCAGCTACGC CGCCGGTGAG AAGGTCGGCG AGCGATGGTT CGGCCCCGTC
GTGCGGCCGG CGGTCCCGCC GGGCAGGCCG GAGTCCGTCC CGACCCGGAC GGGGGACACC
CTGAGGCTCG ACATCCCCGA GTTCGTCGAC GCCGCCGGCC ACTACGGCTA CGCGTTCAGC
TCCGACGAGG AGGACACCGT CTCGGCCCGC TTCTACCGCG ACGGCACGCT CGTCGAGGAG
CCGCGACGGG CCGTGGGGAA CTTCCCCGCC GTTCCCGGGA AGGCCACCTA CCGGATGGAG
CTGTCCACCA AGCGCTCGTC GGAGGAGTGG ACGTACGCCA CCGAGACCAG CACGGCGTGG
ACGTTCGGGT CCGCCCGCCC GAAGTCCGGC TCCGAACCGC TGCCGCTGCT CGGCGTCGAC
TACACCGTGC CCGCCGACCT CGACGGCCGG GTCCGCCGGA CGCTGCCGGT CCCGCTGGAC
TTCAGCGTGC GGAGCACCGC GCCCGGCCTC GCCCTGCGGC AGGTCACGGC CGAGCTCTCC
TATGACGACG GAAAGAGCTG GAAGCGCCTG GTGCTGCTGC CCCGCGGCAA GGACCGCTAC
TCCACCCTGG TCTCCCACCA GGCGGGCAGG GGACAGTACG TCTCGCTTCG GGTCACCGCG
AACGACGCCG CCGGGAACGC GGTCGAGCAG ACCGTGCTCC GCGCCTACGG CGTGAAGTAG
 
Protein sequence
MHWLARLTAA GAALSLALPA LPSGAETASA PPPAAGLHRI TLITGDVVTV RQAAAGKWAV 
TVDPAKGREK VRFTTHDLDG KLQVLPDDMV PYVAENLVDK GLFAVGDLIE QGYDDAGSRT
LPLLLAYDKG ARTSGALPGA KAGTELESIG ARAVDAGKGE LAAFWKAVEA APSARSGGPR
LASGIRKIWL DAKVTADLEH SVPQIGAPDA WKSGFDGKGV KVAVLDTGAD ENHPDLAGKI
TDRRNFTADP STQDGHGHGT HVATTVAGLG TASQGRRKGV APGAELIIGK VLDSGGSGQF
SQIIEGMEWA AASGADVVNL SLGGEATDGT DPASAALNAL TEQTGTLFVV AAGNEGREYA
VGTPGAATSA LTVGAVGADE TLAPFSSRGP RLDGGAKPDI TAPGVAIVAA RAEGTSMGQP
ADERYTAASG TSMATPHVAG AAAILKQRHP DWKAKQLKDA LISTARTARD LTVYEQGGGR
VDVARAVRQD VTATGVLDLG THQDGGSTAP SGTVAYTNST QAAVSLALTA TLSNLDGDAP
AQGALTLGSA SITVDAGATV TVPVSADLAK LAHGRHSGHL TATTADGSVA LQTTLALTRS
PRTHKVRISA VGKDGRPASV STVFMFGPGT RDNLLTYIMP WEAEEGKTFE VPEGTYYAQS
QFGEERSGSR VIDIKIDIPE FPVTGDAELV FDARKTRPIV IKTPQPAVQE GISTFASYRD
TGTRKISSSF MNFPSVDELH VAETQPVRQG AFEFTSRWQY GAPRLSSRVS GLKGPLDLSP
TVRSPEWNGR YRWELVDGGH GTPEELGALK LRGRAVVMSS ASQDDPQWDE MIAAAAEAGA
AAAVVVPAAD DSPWQYWSPV MDRQAIPAAA IPYEQGKKLL ERVRKGKAVL DVTGNMAIPY
LYDISQVSKG RIPEQIVYEA NASNLARVDT GYHETGGFGW AKEQRFGWRP WQVFSNEGQR
WVRTGSARAE YVTSGDTEWE HVAQHMFTWE SMRPLTPGLT GGLRSYAAGE KVGERWFGPV
VRPAVPPGRP ESVPTRTGDT LRLDIPEFVD AAGHYGYAFS SDEEDTVSAR FYRDGTLVEE
PRRAVGNFPA VPGKATYRME LSTKRSSEEW TYATETSTAW TFGSARPKSG SEPLPLLGVD
YTVPADLDGR VRRTLPVPLD FSVRSTAPGL ALRQVTAELS YDDGKSWKRL VLLPRGKDRY
STLVSHQAGR GQYVSLRVTA NDAAGNAVEQ TVLRAYGVK