Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5216 |
Symbol | |
ID | 8668510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5733510 |
End bp | 5736425 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | serine protease |
Protein accession | YP_003340731 |
Protein GI | 271966535 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.415091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.502606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGCA ACCCCCGCGT CTACAGCGCC ATGGCGCTCG CCGCCGCACT GGCGGTCACC TCCGCCGCGG CCCCGGCGCT CGCCGAACCC GCTGCGACAG GCACCCCCTC CGCCTACATC GTCACCCTCG CCGGTCAGCC GCTCGCCACC TACGGCGGCG GCGTGGACGG CCTCGCCGCC ACCAAGCCGG GCAAGGGCAA GAAGGTCGAC ACGGTCAGCG CCGGCGCCAA GCGCTACCGC GAGCACCTGA CCAAGCGCCA GGACGAGACC GCCCGCAGCG TCGGCGCGAC CGCGCAGCGG CACAACTCCG TCGCCTCCAA CAGCTTCGTC GCCGAGCTCA CCCCCGCTCA GGCCCTCAAG CTGCACCGCA CCGGCGGCGT CGTGTCCGTC GTCCAGGACA CCCTGCGCAA GGCTCTGGAC GACCGAAACT CCAGCGACTT CCTCGGCCTG TCAGGTGACA AGGGCATATG GGCCTCGCTC GGCGGCACCG CCAAGGCGGG CAAGGGCATC GTGGTCGGCG TCATCGACAC CGGCGTCTGG CCGGAGAACC CCTCCTTCGC CGGACCCGCG CTCGGAACCG AGGCGCCCAC CGCCGCCGAC CCCTACCGGC CCTACCGCCA GGGCACGGCC ACGGTCATGA AGAAGGCCGA CGGCTCCACC TTCACCGGCC TGTGCGAGAC CGGCACGGAG TTCACCGCCG ATCTGTGCAA TCAGAAGCTG GTGAGCGCCA GGTACTTCGG CAAGGCCTGG CTGAAGGACA ACGACCCCGC CGCCACCGGC GAGTACGCCT CGCCGCGCGA CCGGGGCGGG CACGGCTCCC ACACCGCAGG CACCGCCGCG GGCAACCACG CCGTCCCCGC CACCGCCAAC GGCATCGACT TCGGCCAGAT CTCCGGCGTC GCGCCCGGAG CAGCCGTCTC CGTCTACAAG GCGCTGTGGG AAGGCCCGGA CGGAGGCACC GGCTACACCT CCGACATCAT CGAGGCCATC GACCAGGCCG TCGCCGACGG CGTCGACGTG ATCAACTACT CTGTCGGCGG TTCGACGGAG TCCTCCACCG ACGACCCGGT CCAGCTGGCC TTCCTGGCCG CCGCGGACGC CGGCATCTTC GTCGCCACCG CGGGCGGCAA CTCCGGACCC GACGCCTCCA CCCTGGACAA CACCGCCCCG TGGACGACCA CCGTCGCCGC GAGCACCGTC GCGCCCTATC TGGCCGACGT GCGGCTGGGC GACGGAAGCA CCTTCCGCGG CGCCAGCACC ACCGTCAGCG CGCCGTTCGG GCCCAATCCG CTGGCCACCT CGGTGTCGGT GAAGAACGCC GCCGCCTCCG ACTCCGACGC CCAGATCTGC GCGGAGGGGT CGCTGGACCC GGCCAAGGCC GCCGGGAAGG TCATCTACTG CGTGCGCGGC GTCACCCCGC GCGTGGACAA GTCCGCCGAG GTCAAGCGCG CCGGCGGCGT GGGCATGGTG CTGGGAAACC CCAGTGACCA GGACACCGGC GCCGACGTGC ACGCCGTACC GACCGTACAC ATCAACACCC CCGACACCGA GAAGGTCCTC GCCTACGCCG CCACCCCCGG CGCGACGGTG ACGCTGCTCC CCGCCTCCTC CACCGAGGGC GCCGAATACC CGCAGGTCGC CAGCTTCTCC TCGCGCGGCC CCTCCCTCAG CAACAACGGC GACCTGATCA AGCCCGACAT CGCCGCCCCC GGCGTGTCGA TCCTCGCCGC CGTCGCGCCC CCCGGGAACC AGGGCAAGGA CTTCGACTTC TACTCCGGCA CCTCGATGGC CACCCCGCAC ATCGCCGGCC TGGCCGCCCT GTACCTGGGC ACGGACCCGC TCCTGTCGCC CGCCGCCGTC AAGTCGGCGA TGATGACCAC CGCCTACGAC ACCAAGACGC CCGACCTCTT CGCGCAGGGC TCGGGTCACG TCGACCCCGC CCGCATGCTG AAGCCCGGCC TGGTCTACGA CGCCGCGGCC CAGGACTGGT ACGGGTACCT GGAGGGGCTG GGCGTCAAGA CCGGCACCGG CGCCGCGCCC GTCGCCACCA GCGACCTGAA CTACCCCAGC ATCGCGGTCG GCGCCCTGTT CGGCTCACGG ACCGTCACCC GCAAGGTCAC GGCGCTGACC CCCGGCGTCT ACCACGCGGC CGTCGACCTG CCGGGCATCA AGACCAAGGT CAAGCCCTCC ACCCTGGTCT TCAAGAAGGC GGGTGAGACC AAGGAGTTCA CCGTCTCCAT GGAGATGACG CGCCAGACCG GCGGCGACGC CATCGTCGGC TCGCTGACCT GGCAGGGTAA GAACACCGCC GTCCGCAGCC CCGTGATGGT CACCCCGCTG AGCGCCAAGG CGCCCGCCGA GGTCAAGGGC GCGGGCTCGA ACGGCTCGCT CACCTTCGAC GTGACCCCCG GCGTGTCGAA GTTCCCCGTC AAGACCTACG GCCCCGTCTC CGCCGACCCG GTGCCCGGCA CCGTGAACCC CAGCGACATC TGGGGCAAGG AAGTCTCCGT CGTCGTGCCC GAAGGCGCCA AGGCCGTCTC CTTCAAGCTC ATCCCCGGTG ACCCCGAGAC GGAGGTCGGC GCCATCCTCG GCTACACCGA GGGCGGCGCG CAGCAGGGCC TGGCCTGGGT CACCTCGTGG GAGGACTACC GCGCGGTCCT GGCCAACCCC AGGCCCGGCA AGTACACCCT GTTCGTGCTC ACCTTCGGGA GCGCCGAGAC GCCGTTCAGC ACGCAGATCA ACGTCGTGGG CGCCGACTCC GGGTCCGGCG CGCTGACGGT GACGCCCGAC AAGCCGAAGG TCACGCCGGG AACGGCGTTC CCGCTGACGG CCACCTGGTC CGGCCAGCCT GACCGGACGG CGACCGGCTA CATCGAGTAC CCCAACGAGG CCGGCACGAT CGTGACGATC AACTGA
|
Protein sequence | MSRNPRVYSA MALAAALAVT SAAAPALAEP AATGTPSAYI VTLAGQPLAT YGGGVDGLAA TKPGKGKKVD TVSAGAKRYR EHLTKRQDET ARSVGATAQR HNSVASNSFV AELTPAQALK LHRTGGVVSV VQDTLRKALD DRNSSDFLGL SGDKGIWASL GGTAKAGKGI VVGVIDTGVW PENPSFAGPA LGTEAPTAAD PYRPYRQGTA TVMKKADGST FTGLCETGTE FTADLCNQKL VSARYFGKAW LKDNDPAATG EYASPRDRGG HGSHTAGTAA GNHAVPATAN GIDFGQISGV APGAAVSVYK ALWEGPDGGT GYTSDIIEAI DQAVADGVDV INYSVGGSTE SSTDDPVQLA FLAAADAGIF VATAGGNSGP DASTLDNTAP WTTTVAASTV APYLADVRLG DGSTFRGAST TVSAPFGPNP LATSVSVKNA AASDSDAQIC AEGSLDPAKA AGKVIYCVRG VTPRVDKSAE VKRAGGVGMV LGNPSDQDTG ADVHAVPTVH INTPDTEKVL AYAATPGATV TLLPASSTEG AEYPQVASFS SRGPSLSNNG DLIKPDIAAP GVSILAAVAP PGNQGKDFDF YSGTSMATPH IAGLAALYLG TDPLLSPAAV KSAMMTTAYD TKTPDLFAQG SGHVDPARML KPGLVYDAAA QDWYGYLEGL GVKTGTGAAP VATSDLNYPS IAVGALFGSR TVTRKVTALT PGVYHAAVDL PGIKTKVKPS TLVFKKAGET KEFTVSMEMT RQTGGDAIVG SLTWQGKNTA VRSPVMVTPL SAKAPAEVKG AGSNGSLTFD VTPGVSKFPV KTYGPVSADP VPGTVNPSDI WGKEVSVVVP EGAKAVSFKL IPGDPETEVG AILGYTEGGA QQGLAWVTSW EDYRAVLANP RPGKYTLFVL TFGSAETPFS TQINVVGADS GSGALTVTPD KPKVTPGTAF PLTATWSGQP DRTATGYIEY PNEAGTIVTI N
|
| |