Gene Sros_5216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5216 
Symbol 
ID8668510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5733510 
End bp5736425 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content71% 
IMG OID 
Productserine protease 
Protein accessionYP_003340731 
Protein GI271966535 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.415091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.502606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCA ACCCCCGCGT CTACAGCGCC ATGGCGCTCG CCGCCGCACT GGCGGTCACC 
TCCGCCGCGG CCCCGGCGCT CGCCGAACCC GCTGCGACAG GCACCCCCTC CGCCTACATC
GTCACCCTCG CCGGTCAGCC GCTCGCCACC TACGGCGGCG GCGTGGACGG CCTCGCCGCC
ACCAAGCCGG GCAAGGGCAA GAAGGTCGAC ACGGTCAGCG CCGGCGCCAA GCGCTACCGC
GAGCACCTGA CCAAGCGCCA GGACGAGACC GCCCGCAGCG TCGGCGCGAC CGCGCAGCGG
CACAACTCCG TCGCCTCCAA CAGCTTCGTC GCCGAGCTCA CCCCCGCTCA GGCCCTCAAG
CTGCACCGCA CCGGCGGCGT CGTGTCCGTC GTCCAGGACA CCCTGCGCAA GGCTCTGGAC
GACCGAAACT CCAGCGACTT CCTCGGCCTG TCAGGTGACA AGGGCATATG GGCCTCGCTC
GGCGGCACCG CCAAGGCGGG CAAGGGCATC GTGGTCGGCG TCATCGACAC CGGCGTCTGG
CCGGAGAACC CCTCCTTCGC CGGACCCGCG CTCGGAACCG AGGCGCCCAC CGCCGCCGAC
CCCTACCGGC CCTACCGCCA GGGCACGGCC ACGGTCATGA AGAAGGCCGA CGGCTCCACC
TTCACCGGCC TGTGCGAGAC CGGCACGGAG TTCACCGCCG ATCTGTGCAA TCAGAAGCTG
GTGAGCGCCA GGTACTTCGG CAAGGCCTGG CTGAAGGACA ACGACCCCGC CGCCACCGGC
GAGTACGCCT CGCCGCGCGA CCGGGGCGGG CACGGCTCCC ACACCGCAGG CACCGCCGCG
GGCAACCACG CCGTCCCCGC CACCGCCAAC GGCATCGACT TCGGCCAGAT CTCCGGCGTC
GCGCCCGGAG CAGCCGTCTC CGTCTACAAG GCGCTGTGGG AAGGCCCGGA CGGAGGCACC
GGCTACACCT CCGACATCAT CGAGGCCATC GACCAGGCCG TCGCCGACGG CGTCGACGTG
ATCAACTACT CTGTCGGCGG TTCGACGGAG TCCTCCACCG ACGACCCGGT CCAGCTGGCC
TTCCTGGCCG CCGCGGACGC CGGCATCTTC GTCGCCACCG CGGGCGGCAA CTCCGGACCC
GACGCCTCCA CCCTGGACAA CACCGCCCCG TGGACGACCA CCGTCGCCGC GAGCACCGTC
GCGCCCTATC TGGCCGACGT GCGGCTGGGC GACGGAAGCA CCTTCCGCGG CGCCAGCACC
ACCGTCAGCG CGCCGTTCGG GCCCAATCCG CTGGCCACCT CGGTGTCGGT GAAGAACGCC
GCCGCCTCCG ACTCCGACGC CCAGATCTGC GCGGAGGGGT CGCTGGACCC GGCCAAGGCC
GCCGGGAAGG TCATCTACTG CGTGCGCGGC GTCACCCCGC GCGTGGACAA GTCCGCCGAG
GTCAAGCGCG CCGGCGGCGT GGGCATGGTG CTGGGAAACC CCAGTGACCA GGACACCGGC
GCCGACGTGC ACGCCGTACC GACCGTACAC ATCAACACCC CCGACACCGA GAAGGTCCTC
GCCTACGCCG CCACCCCCGG CGCGACGGTG ACGCTGCTCC CCGCCTCCTC CACCGAGGGC
GCCGAATACC CGCAGGTCGC CAGCTTCTCC TCGCGCGGCC CCTCCCTCAG CAACAACGGC
GACCTGATCA AGCCCGACAT CGCCGCCCCC GGCGTGTCGA TCCTCGCCGC CGTCGCGCCC
CCCGGGAACC AGGGCAAGGA CTTCGACTTC TACTCCGGCA CCTCGATGGC CACCCCGCAC
ATCGCCGGCC TGGCCGCCCT GTACCTGGGC ACGGACCCGC TCCTGTCGCC CGCCGCCGTC
AAGTCGGCGA TGATGACCAC CGCCTACGAC ACCAAGACGC CCGACCTCTT CGCGCAGGGC
TCGGGTCACG TCGACCCCGC CCGCATGCTG AAGCCCGGCC TGGTCTACGA CGCCGCGGCC
CAGGACTGGT ACGGGTACCT GGAGGGGCTG GGCGTCAAGA CCGGCACCGG CGCCGCGCCC
GTCGCCACCA GCGACCTGAA CTACCCCAGC ATCGCGGTCG GCGCCCTGTT CGGCTCACGG
ACCGTCACCC GCAAGGTCAC GGCGCTGACC CCCGGCGTCT ACCACGCGGC CGTCGACCTG
CCGGGCATCA AGACCAAGGT CAAGCCCTCC ACCCTGGTCT TCAAGAAGGC GGGTGAGACC
AAGGAGTTCA CCGTCTCCAT GGAGATGACG CGCCAGACCG GCGGCGACGC CATCGTCGGC
TCGCTGACCT GGCAGGGTAA GAACACCGCC GTCCGCAGCC CCGTGATGGT CACCCCGCTG
AGCGCCAAGG CGCCCGCCGA GGTCAAGGGC GCGGGCTCGA ACGGCTCGCT CACCTTCGAC
GTGACCCCCG GCGTGTCGAA GTTCCCCGTC AAGACCTACG GCCCCGTCTC CGCCGACCCG
GTGCCCGGCA CCGTGAACCC CAGCGACATC TGGGGCAAGG AAGTCTCCGT CGTCGTGCCC
GAAGGCGCCA AGGCCGTCTC CTTCAAGCTC ATCCCCGGTG ACCCCGAGAC GGAGGTCGGC
GCCATCCTCG GCTACACCGA GGGCGGCGCG CAGCAGGGCC TGGCCTGGGT CACCTCGTGG
GAGGACTACC GCGCGGTCCT GGCCAACCCC AGGCCCGGCA AGTACACCCT GTTCGTGCTC
ACCTTCGGGA GCGCCGAGAC GCCGTTCAGC ACGCAGATCA ACGTCGTGGG CGCCGACTCC
GGGTCCGGCG CGCTGACGGT GACGCCCGAC AAGCCGAAGG TCACGCCGGG AACGGCGTTC
CCGCTGACGG CCACCTGGTC CGGCCAGCCT GACCGGACGG CGACCGGCTA CATCGAGTAC
CCCAACGAGG CCGGCACGAT CGTGACGATC AACTGA
 
Protein sequence
MSRNPRVYSA MALAAALAVT SAAAPALAEP AATGTPSAYI VTLAGQPLAT YGGGVDGLAA 
TKPGKGKKVD TVSAGAKRYR EHLTKRQDET ARSVGATAQR HNSVASNSFV AELTPAQALK
LHRTGGVVSV VQDTLRKALD DRNSSDFLGL SGDKGIWASL GGTAKAGKGI VVGVIDTGVW
PENPSFAGPA LGTEAPTAAD PYRPYRQGTA TVMKKADGST FTGLCETGTE FTADLCNQKL
VSARYFGKAW LKDNDPAATG EYASPRDRGG HGSHTAGTAA GNHAVPATAN GIDFGQISGV
APGAAVSVYK ALWEGPDGGT GYTSDIIEAI DQAVADGVDV INYSVGGSTE SSTDDPVQLA
FLAAADAGIF VATAGGNSGP DASTLDNTAP WTTTVAASTV APYLADVRLG DGSTFRGAST
TVSAPFGPNP LATSVSVKNA AASDSDAQIC AEGSLDPAKA AGKVIYCVRG VTPRVDKSAE
VKRAGGVGMV LGNPSDQDTG ADVHAVPTVH INTPDTEKVL AYAATPGATV TLLPASSTEG
AEYPQVASFS SRGPSLSNNG DLIKPDIAAP GVSILAAVAP PGNQGKDFDF YSGTSMATPH
IAGLAALYLG TDPLLSPAAV KSAMMTTAYD TKTPDLFAQG SGHVDPARML KPGLVYDAAA
QDWYGYLEGL GVKTGTGAAP VATSDLNYPS IAVGALFGSR TVTRKVTALT PGVYHAAVDL
PGIKTKVKPS TLVFKKAGET KEFTVSMEMT RQTGGDAIVG SLTWQGKNTA VRSPVMVTPL
SAKAPAEVKG AGSNGSLTFD VTPGVSKFPV KTYGPVSADP VPGTVNPSDI WGKEVSVVVP
EGAKAVSFKL IPGDPETEVG AILGYTEGGA QQGLAWVTSW EDYRAVLANP RPGKYTLFVL
TFGSAETPFS TQINVVGADS GSGALTVTPD KPKVTPGTAF PLTATWSGQP DRTATGYIEY
PNEAGTIVTI N