Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4602 |
Symbol | |
ID | 8667896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5117119 |
End bp | 5121141 |
Gene Length | 4023 bp |
Protein Length | 1340 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Subtilisin-like protein serine protease-like protein |
Protein accession | YP_003340206 |
Protein GI | 271966010 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.823259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0573734 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCTGG GCACCCTGGT GGCGCCGGGC ACGGCCGCCT TCGCCCAGAA ACAGGAGCCC GCCCCCGCCG CCCCGGCGAC GGGACCGGTC ACCGGCCCGA TCGCGCTGAC CGGACTGGGC CAGGGTGACC GCACGGTCAC CCTGATCACC GGGGACAAGG TGAGGTTGAG CGACGCCGGC GGCGGCAAGT ACGCCGTGCG CCCCGAGGCG AGCGTCCGCC CCGACGGCAC CAGCGCCCAG CTGTCCACGC TGGCCACCCC GAAGGGCGTG TACGTCACCC CCAGCGACGC CGTGCCCGCC ATCAACGCCG GCCGCCTGGA CCGGGAGCTG TTCAACGTCA CCTACCTGGC CGAGAACGGC TACACCGACG ACAAGACCAA GCAGCTGCCC GTCATCGTGC AGTACCCGCG TCAGCACACG GACAAGGCCG TCGCCTCGGC CGCCAAGGCC ATCCCGGCCA GCGAGCCCGT GCTCACCCTG GAGAGCGTCA ACGCCTCGGC GCTCCAGGTC ACCAAGGCCG AGGCGGGCAC GTTCTGGAAC GCGGTGCGGG CCGTACCGGA CAAGAAGAGC GCGAACGGGA TCGCCGGGCC GGCCAACGTG CCGGACACGC TGCGCGGCGA CATCGCCAAG GTGTGGCTGG ACGGCAAGGT GAAGGCCGAC CTCGCCGAGA GCGTCCCGAT GATCGGCGCG CCGCAGGCGT GGGCCGGCGG CCACGACGGC GCCGGCGTGA AGGTCGCTGT CCTGGACACC GGCGTCGACG CCAAGCACCC CGACCTGGCC GACAGGATCG TCGACAGCAG GTCCTTCATC CCCGGCCAGG AGGTGCAGGA CGGCCACGGC CACGGCACCC ACGTCGCCTC CACCATCGCC GGCTCCGGCG CGGCGGGGGG CGGCAAGCAC AAGGGCGTCG CGCCCGGCGC CCAGCTGATC GTCGGCAAGG TGCTGGCCAA CGAGGGCTCC GGCAGCGACT CCCAGATCAT CGAGGGCATG GAGTGGGCTG CGGCCTCCGG CGCCAAGGTG ATCAGCATGA GCCTCGGCGG CGGCGCCTCC GACGGGACCG ACCCGATGAG CCAGGCGGTC AACGTGCTCA GCGCCTCCAC CGGCGCGCTG TTCGTGATCG CCGCGGGCAA CGCCGGCGCG TCGGGCGCCG AGACCGTCGC CACCCCCGGC ACGGCCGACG CCGCGCTGAC CGTGGCCGCC GTCGACAAGA GCGACGCGTG GGCCACTTTC TCCAGCCAGG GACCGCGTGT CGGCGGTGGC CTCAAGCCGG ACATCGCCGC TCCTGGCGTG GCCATCGCCG CTGCGCGCGC CGCCGGCACG ACCATGGGCA CCCCGCTCGA CGAGCACTAC ACGGCCGCGA ACGGCACCTC GATGGCCACC CCGCACGTCG CCGGCGCCGC CGCCATCATG ACGCAGCAGC ACCCGGACTG GACGGGCCCG CAGATCAAGG CCGCGCTCAT GTCGACCGCC AAGGACGACG CGCTCAGCGT GTACAAGCAG GGCGCCGGCC GGGTCGACGT GGCCAGGGCC TACACCCAGC AGGTCTTCGC CGTCACCACC GGGGCCGACT TCGGCGCCGT CGAGAGCGAC GCCGCCCCCG TGACCCGGGA GCTGACCTAC ACCAACCTGG GCGGCCGGCC GGTCACCCTG ACCCTGACCC CCGGCCTGCG CAAGTCGGAC GGCAGCGCGG TCGAGGGCGG GCTGAGCATC GCCGAGACGA CGCTCACCGT GCCGGCCGGA GGCACCGCGA CCACCACCGC CACCGTCGAC CCCAAGACCC TGGCCGACCT GGACAACTAC ACCGGCGCCG TCACCGCGAC CGCCGACGGC GTCCAGCTGC GCATCCCCGT AGGCGTGGTG CGTGAGGTGC CCAAGGCCAC GCTCACCATC CACACCCTGG GCCGCGACGG CAAGCCGCGC AGTCCTTTGG CCCAGGACAC CATCGACGTG TCCGGCGACA AGGGCGTCCT CGGCGGTGTC GCCCTGACCG CCGAAGGGAC CACGGTCACC CGCGTCCCGC AGGGTGTCAT CAGCGTGACG CAGGTGCTGA GCTGGGTCGG CGACGACGAA AGGGGCAACC TGGCGTTCCT GTCCGTGCCC GAGCTCACCG TCACCGGGGA CACCGAGATC ACGCTCGACG CCCGCAAGCT GACCGAGATC CGATTCACCA CCCCGCAGCC GGCCGAACCG TTGAGCAACG TCCCCTACCT GGCGTATCAG CGGACCGTGA GCAACGGCAC CCCCTACATG GGATTCACGT GGCCCGACCG CACCTGGTCC CGGCTGTGGG CCCTGCCCAC GGAGAAGGTC ACCAAGGGCG CCTTCCGCTT CCACACCCGC TTCACTCTCG GCAGGCCTGA GGTCGAAATG AGCATACGCG GGCGGGGCGG GCTCACCCTG CACCCGGTCT CCGCGCTGCA CGGCATCAAC ACCTACGGCG TGCACGAGCA GTACGACGGC TTCCCCGACT TCCGGCCGTT CACCGGCACC CGCGACCTGG AGGTCGTCGA CGTCGGCGAG GGCAGGCCCG AGGACATCGC GGGCCGCGAC CTGCGCGGCA AGCTCGTCCT GATGGAAGCG CCGATGTCAG AGGGTTTGTC CGGCCCCATG TGCGGAGTCC AGATCGAGCG GATCGGCCCG ATCCGCGACG CCGGCGCGGC CGCGATCGCG TACTTCCCGC AGCCCGGGAC CGGTTGCGCG ATCCCGCTGA GCATCACGCA GATCCCGTTC ACCGGAGAGG CCAAGCCCAT CGGCGTCCCC ACCGTGTCCC TGCCCTCCCG TGAGGGGGTC GGCCTGCGCG ACCGGCTCGC CGGCGGCAAG CCGGTCACCC TCCGGGTGAC CGGCACCGAG GAGTCGCCCT ACACCTACAC CTTCGCCCCC TACGAAGAGG GCCGGGTGCC CCGCTCGCTG CACTACACCT TCGCCGAGCG CGACCTGGCA CGGATCGACG TGGACACCCA CACCGTCGCC CCCGCGCGCT ACAACGACTG GCGGTACGCG CAGAAGCCCG ACGACGTGAT GCCGATGTCG ACCTCCGTCT CCGCGTGGGG CGGCCCCCGG CTCACCCTGC AGGAACGCCG CGACTGGGTC GGACCGCTCG ACTCCGGGGT TCTCTGGTCG CACGGCATGG AGGAGATGCG CAGTACACCA GCTGATGCCC GGGTTCCGCA GTGGGCCCTG GAGGTGTTCG ACAAGCCGGT CCGCACCCGG CAGTCCTGGC TGACCACCCC GTTCACGCCC GGCGTCGCGA CCGGCTCCGA CAAGGTCTAC AAGCTCGCCA AGCCGGGCGC GAACCTCGGC TGGTTCTTCC GGTGCATGCT CTGCGTCCAG GGCGACCGTC TGTGGGCGGA GTTCGAGCCC AGCTACGGCT CGCCGGGCAC CAGGAAGTAC AACGGCGGCT ACTGGCCGAC GGACGACCTG ACCAAGCCGG GCTTCGACGT CCGCCTCTGG CAGGACGGCA AGGAGATCCC GCGCACCAGC ACCGGCGGCA CCACCATCCT GCCGGTCTTC ACCCTGCCGG AAGGGCCCGG CAGCTACCGG CTGACCGCCA AGAACGACCG GCACGACGCC GAATGGACCT TCACCGCCCC GGTCAAGGCC GAGCGACTGC CCGGATCCTT CTGCTCCCTC GAAGCGCTGT ACGGCACCGC GGAGCCCTGC AAGCCCGCCC CGGTCGTGTT CGTCAGCTAC GACCTGGGCG ACACCCTGGA CGCCGCCAAC AGCGTCCGCG CGGGCCGTAC GCACACCTTC ACCGTCTCCC CCTACCACTC CCCCTCCGCC TCCAAGATGC CTGACATCGC CGGCCTGAAG CTGTGGGCCA GCACCGACGA CGGCGCCACC TACACCCCCG TCTCCGTCAA GCGCGACAAG GACGGGACCT ACACCGCCAC CACCCGCTAC CCCGCCCTCC AGCAGACCAA GGGTGCCGTG ACCCTCAAGG TCGAGGCCTG GGACAAGGCG GGCAACACCG TCAAGCAGAC CACCGTCCGC GCCTTCAACC TCCGCGGCCA CGCCGCCGGA TAA
|
Protein sequence | MVLGTLVAPG TAAFAQKQEP APAAPATGPV TGPIALTGLG QGDRTVTLIT GDKVRLSDAG GGKYAVRPEA SVRPDGTSAQ LSTLATPKGV YVTPSDAVPA INAGRLDREL FNVTYLAENG YTDDKTKQLP VIVQYPRQHT DKAVASAAKA IPASEPVLTL ESVNASALQV TKAEAGTFWN AVRAVPDKKS ANGIAGPANV PDTLRGDIAK VWLDGKVKAD LAESVPMIGA PQAWAGGHDG AGVKVAVLDT GVDAKHPDLA DRIVDSRSFI PGQEVQDGHG HGTHVASTIA GSGAAGGGKH KGVAPGAQLI VGKVLANEGS GSDSQIIEGM EWAAASGAKV ISMSLGGGAS DGTDPMSQAV NVLSASTGAL FVIAAGNAGA SGAETVATPG TADAALTVAA VDKSDAWATF SSQGPRVGGG LKPDIAAPGV AIAAARAAGT TMGTPLDEHY TAANGTSMAT PHVAGAAAIM TQQHPDWTGP QIKAALMSTA KDDALSVYKQ GAGRVDVARA YTQQVFAVTT GADFGAVESD AAPVTRELTY TNLGGRPVTL TLTPGLRKSD GSAVEGGLSI AETTLTVPAG GTATTTATVD PKTLADLDNY TGAVTATADG VQLRIPVGVV REVPKATLTI HTLGRDGKPR SPLAQDTIDV SGDKGVLGGV ALTAEGTTVT RVPQGVISVT QVLSWVGDDE RGNLAFLSVP ELTVTGDTEI TLDARKLTEI RFTTPQPAEP LSNVPYLAYQ RTVSNGTPYM GFTWPDRTWS RLWALPTEKV TKGAFRFHTR FTLGRPEVEM SIRGRGGLTL HPVSALHGIN TYGVHEQYDG FPDFRPFTGT RDLEVVDVGE GRPEDIAGRD LRGKLVLMEA PMSEGLSGPM CGVQIERIGP IRDAGAAAIA YFPQPGTGCA IPLSITQIPF TGEAKPIGVP TVSLPSREGV GLRDRLAGGK PVTLRVTGTE ESPYTYTFAP YEEGRVPRSL HYTFAERDLA RIDVDTHTVA PARYNDWRYA QKPDDVMPMS TSVSAWGGPR LTLQERRDWV GPLDSGVLWS HGMEEMRSTP ADARVPQWAL EVFDKPVRTR QSWLTTPFTP GVATGSDKVY KLAKPGANLG WFFRCMLCVQ GDRLWAEFEP SYGSPGTRKY NGGYWPTDDL TKPGFDVRLW QDGKEIPRTS TGGTTILPVF TLPEGPGSYR LTAKNDRHDA EWTFTAPVKA ERLPGSFCSL EALYGTAEPC KPAPVVFVSY DLGDTLDAAN SVRAGRTHTF TVSPYHSPSA SKMPDIAGLK LWASTDDGAT YTPVSVKRDK DGTYTATTRY PALQQTKGAV TLKVEAWDKA GNTVKQTTVR AFNLRGHAAG
|
| |