Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_0871 |
Symbol | |
ID | 8664143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 891897 |
End bp | 895133 |
Gene Length | 3237 bp |
Protein Length | 1078 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Subtilisin-like protein serine protease-like protein |
Protein accession | YP_003336624 |
Protein GI | 271962428 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAC GCAGGAAGAG ATCATTTCCC CTCACGCAAA GGCAGATCAT GAGTTTAGGT AAGTCGTTCC TCGCCGCCGC CATGGCAGGA ACGATCGTGG GCATCACACC GGCACCAGCG CCGGCGCAGA CACCGCAGCC TCCCTCCCCG CCGGCGGACG GGGTGACCCT GATCACCGGT GACCGTGTCG TGGTCACCGG GCACGGGCAC CGGGTGGAGC CCGGACCCGG CCGGCAGGAG GTCGGCTTCA CGAGCCAGGT ACGTGAGAAG CACCTGTACG TGATTCCGTC CGACGCTCAG CCCCTGGTCG CTCAGGGGGT GCTCGACCGG CGGCTGTTCG ACGTCACCCA ACTGCTGCAG TGGCGGTACG GGGACGCCGA GATCCGCGAC ATCCCGCTGA TCACGCGGTC GGACGCGGGC CCGGCCCCTG CGCTCAGGGG CGCGCAGGGT ACCCGGCGGC TCGCCGGCCT TGGCATGACC ACGCTCCGCC TGCCCAAGAG CGACGCGGCT CGGGCGTGGA AGGAGATGAC GGGCGGCGGC CGTACGCCGG CCGCGGGCAC GACGAAGATC TGGCTGGACG GCCGCCGGTC CTTCAGCCTC GACCGGAGCA CCGAACAGAT CGGCGCCACC GAGGCGTGGA AGCAGGGGAT GACCGGCGAG GGCGTCACGG TCGCCGTCCT CGACTCCGGC TACGACCCCG GCCATCCGGA CCTGAAGGGC GTGGTGGCGC AGGAACGCAA CTTCAGCGAG GAGCCCGACA TCCGCGACAA CCACGGTCAC GGCACCCATG TCGCCTCGAC CGTCGCGGGT AACGGCGAGA AGTACCGGGG AGTGGCTCCC GGCGCACGGC TGGCCATCGG AAAGGTGGGC GACAGGTTCG GCGCCTCCGA GTCCGCCATC CTGGCGGGCA TGGAGTGGGC CTCCCTTGAG GTCAAGGCCA AGGTCGTCAA CTTCAGCATG GGTGCCCCCG ACCAGCCCGA GATCGACCCC GTGGAGCAGG CGGTGAACAC GCTGTCGGCG GAGACGGGCA CGCTGTTCGT CGTCGCCGCG GGGAACGACG GCGGGAGGCG GCCGGTGAGC AGCCCCGCCA GCGCCGACGC CGCCCTCGCG GTCGGCGCCG TCGACAGGCA GGATCGGGTG GCCGGGTTCT CCAGCACCGG CCCCCGCGTA GGCGACCACG CCGTCAAACC GGACCTCACC GCACCGGGGG TCGCCATCGT CGCGGCGGCG GCCGAGGGCA CCGCCGACGG CGCTCACGTC GAGATGAGCG GCACCTCCAT GGCCGCACCG CACGTGGCCG GGGCTGCGGC CATCCTCGCC CAGCGGCACC CCGGCTGGAC CGGGCAGCAG CTCAAGGCCG CTCTGGTCGG CAGCGCCGCT CCCTCGTCCG GCGCCACGCC GTTCCAGCAG GGCACCGGAC GGGTGGACGT GGTCCGCGCC CTGAAGCAGC AGGTCGTGGC CCAGACGGCG GGCACCTGGG CCGTCTTCCC CTGGGACGGT CCGGACGGAC GCAAGAAGAC CGGGACCGTC ACCTACACCA ACTCCGGCGA CGCCCCGGTC AGCCTCGACC TGACCGTCGA GGGAGAAGTG CTCGAACTCG GCACCCGGCG GCTCGACGTG CCCGCCGGAG GACAGGCATC GGTCACGCTC AGCATCGACG CGAGCGGCAA GGCCCCCGGC GACTACGCCG GGACGATCAC CGCCACCTCG GGCGACAGCG TGATCCGCAC CCTGGCGGGT GCGTACGTCG AGCCCGAGTC CTACGACGTC ACCATCGCCG CCATCGGCAA GCAGGGCCAA CCGGTCGACC CCTGGTCGGC TGAGATCTAT GACGCGAAGA CGGGGGCCGT CACCGAGCCG TTCTTCCGGA ACGGCATGGC CACGGTACGG CTCCCCAAGG GCGACTGGGA CCTCTACACC TGGATCGCAG AGAGGATCGA CGGGAAGCTG AACGTCACCG CCGCCAACTC CCCGCTGAAG GTCGATGGAG GCAGCCGCCG GCTGACGGTG GACGCACGCC AGGGCAAGGC GACCAAGGTC ACACTCGACG ATCCGACCGC CACGCCTCGG CGTGGTTTCG ATCTCGGGAT GGCCCACGGC GCATGGAATT CGTGGTCGTC GACGAACATG GACGCCAACA CCGAGCTCTT CGTCGTGCCG GTTCACCGGC CAGGCCTGAC CTACACGTTG AGAACCACGT GGCTGAGCAA GGACGTGTCT CCCAGTCCCT ACGTCTACGA CCTCGTCGAC CGTCGCACCG ACGGCGTTCC CGAGAATCCC GTCTACGACG CCAGGCAGAA GGATCTGGCG AAGGTCTCCG CGACCTACCG GGCCTCGGGA GTGGCGGCCT TGGGGACGCC GATGGCCGGA CTACAGGTCG GGGGCTTCCT GGGCTCGTTC CTGGCACCAC TGGTCGGTGA CATCCCCCTG CCCGGCACGC TCATCCACTA CCGGACCCCC GGGCTGACCT ACGAAAGCGG ACTTCAGGTC GGCACCTCCC TGACCTTCGA CGGCGGCAAG CTCATGAAGC GCGGGCAGAC CAGTGAAGTC TGGAACACCG CGGTCACCGG TCCGTCGTTC CTGCTGCCCG GCGGCAGCCG TACCGGCGAC AAGCTGACCT TCTCCGCGGT GGGGCTGTTC ACCGACGGGG GCCCGGGAAG AACGGGCTCG GACACCGCCG CCACCGGCAC CGCCACCCTC GCCAGAGACG GGAAGGTGCT GGCCAAGGCC GACATCGCCG ACTGCGAGGT CTACCGGAGG GAAGGGTGCG AGCTCCACGC CGACCTTCCC GCCGGGTCCG GCGCCTACAC GCTGACCGCG TCGATGCGCA GGCAGGTTCC GCACTCGACA CTCTCCACCG GCGTGGAGTC CGTATGGAGG TTCCGGTCCG CGACCACGGC GAAGGGGCTG CCACTGCCGC TGACGGCGGT CCGTTACAGC CCCGCCGGCC TGGATGAGTC CAATCGCGCC AAGCCGGGCA GCGTGACCCG TCTTCCCCTG TGGATCGAGC GCAACCCCGG CTCCACCGGG GCGGCGATCG AGTCGGTCCA GGTGGAGATG TCCATCGACG ACGGGGCGAA GTGGCGTCGT ATCCCGATCG TCCGCACCGG CTCGGGCTGG ACCGCCGCGC TGCCGAACCC GCGCACGCCC GGATTCGTCT CCCTCCGCGC GGTGGTGACC GACACGGCGG GCACCGGCCT GACCCAGACG ATCACTCGCG CCTACGCCGT CGGCTGA
|
Protein sequence | MAERRKRSFP LTQRQIMSLG KSFLAAAMAG TIVGITPAPA PAQTPQPPSP PADGVTLITG DRVVVTGHGH RVEPGPGRQE VGFTSQVREK HLYVIPSDAQ PLVAQGVLDR RLFDVTQLLQ WRYGDAEIRD IPLITRSDAG PAPALRGAQG TRRLAGLGMT TLRLPKSDAA RAWKEMTGGG RTPAAGTTKI WLDGRRSFSL DRSTEQIGAT EAWKQGMTGE GVTVAVLDSG YDPGHPDLKG VVAQERNFSE EPDIRDNHGH GTHVASTVAG NGEKYRGVAP GARLAIGKVG DRFGASESAI LAGMEWASLE VKAKVVNFSM GAPDQPEIDP VEQAVNTLSA ETGTLFVVAA GNDGGRRPVS SPASADAALA VGAVDRQDRV AGFSSTGPRV GDHAVKPDLT APGVAIVAAA AEGTADGAHV EMSGTSMAAP HVAGAAAILA QRHPGWTGQQ LKAALVGSAA PSSGATPFQQ GTGRVDVVRA LKQQVVAQTA GTWAVFPWDG PDGRKKTGTV TYTNSGDAPV SLDLTVEGEV LELGTRRLDV PAGGQASVTL SIDASGKAPG DYAGTITATS GDSVIRTLAG AYVEPESYDV TIAAIGKQGQ PVDPWSAEIY DAKTGAVTEP FFRNGMATVR LPKGDWDLYT WIAERIDGKL NVTAANSPLK VDGGSRRLTV DARQGKATKV TLDDPTATPR RGFDLGMAHG AWNSWSSTNM DANTELFVVP VHRPGLTYTL RTTWLSKDVS PSPYVYDLVD RRTDGVPENP VYDARQKDLA KVSATYRASG VAALGTPMAG LQVGGFLGSF LAPLVGDIPL PGTLIHYRTP GLTYESGLQV GTSLTFDGGK LMKRGQTSEV WNTAVTGPSF LLPGGSRTGD KLTFSAVGLF TDGGPGRTGS DTAATGTATL ARDGKVLAKA DIADCEVYRR EGCELHADLP AGSGAYTLTA SMRRQVPHST LSTGVESVWR FRSATTAKGL PLPLTAVRYS PAGLDESNRA KPGSVTRLPL WIERNPGSTG AAIESVQVEM SIDDGAKWRR IPIVRTGSGW TAALPNPRTP GFVSLRAVVT DTAGTGLTQT ITRAYAVG
|
| |