Gene Sros_4312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4312 
Symbol 
ID8667606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4818661 
End bp4822422 
Gene Length3762 bp 
Protein Length1253 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003339943 
Protein GI271965747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.338305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATCA GACGTAGGGC GGCACTACTC AGTGTCTCCC TCGCCGCCAC GCTTGTCGCG 
GCCGGCACCG GAAGCGCGCA CGGCGAGTCG CCGGCCCCCG GCGACGGGGT CGCCGGCATA
CCGGCGCCGT CGGCCATCAC CGCGCGGGTC ACCCTCATCA CCGGTGACCA GGTCGCCGTG
ACCAGCCAGG CGGGCAAGAC CGTCTCGGTC AACGTGACCC CGTACAACCG CTCGATCAGC
GACATCCGCA CCTTCACGGT CGGCACCGAC GTGTACGTGA TGCCGAAGGA GGCCGAGGCG
CTCGTCGGCT CCGGCCAGGT GGACAAGCGG CTGTTCAACG TCACCCAGCT GATCGCCCAG
GGCTACGACG ACGCGCACAG CGAGTCGATC CCGATGATCG TGCGGTACTC GGAGAAGGCC
GCCAAGAAGG CCGGCGAGCC GAGCGCGCCC GACGGTGGCC GGATCACCCG GCGCCTGCCC
GGCGTCCGCG GAGCGGCCGT CAGCGCGGAC AAGAAGCAGG GCGACACGTT CTGGGAGGCC
GTCGACGACG ACACCTCCAA GGCGGGTACT CCCACGACCC AGCTGTCGGG CGACATCCAG
CGCCTCTGGC TGGACGGCAA GGTGAAGGCG CTGCTCGACA GGAGCGTGCC GCAGATCGGC
GCGCCCGAGG CCTGGGCCGC CGGGTACGAC GGCGCCGGCG TCAAGGTCGC GGTGCTGGAC
ACCGGCGCGG ACTTGAGGCA TCCCGACCTC GTCGACAGGA TCGCCGACAG CCGTAGCTTC
GTCCCGGACG AGGCGGTGCA GGACGGCCAC GGCCACGGTA CGCATGTCGC CTCCACGATC
GCCGGCTCCG GCGCGGCGTC CGGCGGCAAG TACAAGGGTG TGGCGCCCGG CGCCAAGCTC
CTCGTGGGCA AGGTGCTCGC CGACGGGGGC GCCGGCATGG AGTCGTGGAT CCTCGACGGG
ATGACCTGGG CCGCCCACTC GGGCGCGAAG GTCGTCTCGA TGAGCCTCGG CGGCCAGGAG
GGCGCCGACG GCACCGATCC GATGGCGATG GCGGTCAACC AGCTGACCGC CGAGACCGGC
GTGCTGTTCA CCATCGCCGC GGGGAACAGC GGACCGGGGG CGACCACGGT GGGCTCACCC
GGCGCAGCCG ACGCGGCCCT GACCGTCGGC GCGGTCGACT CCGCCGACGC GGTCACCGAC
TTCTCCAGCC GCGGCCCCCG CGGGGGCGAC GGCGCGCTCA AGCCGGAGAT CACCGCCCCC
GGCTTCAAGA TCGTGGCCGC CCGTGCGACC GGCACCTCGA TGGGCACGCC GGTGGACGAC
ACCTACACCA CCGCGAGCGG TACCTCCATG GCCACGCCGC ATGTGGCCGG CGCCGCCGCG
ATCCTCGCCC AGGAGCACCC GGACTGGACC GCCGCGCGGC TGAAGAGCCA GCTCATCAGC
ACGGCGAAGA CCACGGCCGG TACGCCGGTG CACTCCCAGG GCGCCGGCCG GGTCGACGTG
AGCCGCGCCG TCCGGCAGCC GGTCCACGGG CCGGGCGTGA TCGACTTCGG CCTCGCGGAC
TGGGACTCCG GCAGCGGTCC GGCCACCAAG CAGATCGACT ACGTCAACGA CGGCGACCAG
CCGGTCACCC TGGCGCTGTC GGCAAAGGGC GCGGGCGAGG ACCTGCCGGC CGGGGCGCTG
ACCCTCGGCG CCGAAACGGT GACCGTGCCG GCCCACGGCA CCGCCGCGGC GAGCGTCACC
GTTGACACGG CCGCCACGGC GGCGGGGGCA CACGGCGCGC ACGTCACCGC CACCTCCGCC
GACGGCCAGA CGGTCGTGAC GACGGCGGTC GGTTTCGTCA GGGACGTGGA GCGCTTCGAC
GTCACCCTCA AGGTGCTCGA CCGGGACGGC AAGCCGACGA CCGGCGTCGC CGTCGTACAT
GACTTCAGTC TGGGGAAGCT GGGCCTGCCG GACATGCACG TGCCCGGGGA GGACGGCACC
GTCGTGGCGC GGCTGCCCCG CGGCGAGCAC GCCTTCATGG CGGAGATCTT CCACTACAGC
GCCGACGGGA ACCGCGTCTG GGAGTACACC TACGGCGGAG AGCCCGGCAC CCTCATCGAC
TCCGACCGGA CCATCACCTT CGACGGCGGC AAGGCAGGCC CGGTCACCAT CGACACCCCG
AAGCCGAACC AGACGGACAG GCTCCGGCTG GGCCTGGCCA TCCAGAACTC GACCGACACG
AACGCGTTCG GCGTCGAGCA GTGGATGAAC GGCGAGACGA AGGTGTACTC CATCCCGTCG
CAGGTCACCA GCAGCCACTT CGAGTCGCGC GTCGGATGGT CGCTCAGCGC GCCGGAACTC
CGGGCGCAGG TCGTCGGCCG CGGTGGCGGG CCGATCGAGC CGGTGTACTT CCGAGGCAGG
GCCGGCTCGG CGCGCATCGA CGGTACGCGG GTGCTGCGGG TGGCGGACGC CGGCACCGGC
CGGCCGCAGG ACTACGCCGG GCAGGCGGTC CAGGGCAGGC TCGCGCTGGT GAAGCGCAGC
GCCGACCTCA CCCCGCAACA GCAGGTCGAC AACGCCGCGG CGGCCGGCGC CGACGCGGTC
GTCATCTACA ACGACAACCC CGGCAACTGG GGCGCCGACA ACTGGAGCGT GACGGCGACC
AAGATCCCGG CGATGACCAT CTCGGGCACG CAGGGTGCCC GGCTGGCGGA GCTGGCGGCC
CGCGGCAGGG CCAAGGTCGA GCTCACGGGC ACGGCGGTGA GCCCGTACTC CTACGAACTG
CTCAAGTATC GGCAGGGCGG CATCCCCGCC GACCAGCGGT ACCGGGTACG GACGGACGAG
CTCGCCACGG TCGAGTCGTC GTTCCACGCC TCGGTGCCCG GCACGGAGGC CGGGTACTCC
CGGCTGATGG TCTCACCGAT GCAGACCGTG GCGTACCTGC TCTTCAACCG GATCGTCATG
CCGCGCCCCC TCACCCAATA CGTCACGGCC GACGTGAAGA CCTGGGAGGC GATGCAGGTC
GGCACCGTGT GGGAGGCCGG CCAGGCCGGG CAGCAGACCG CGCCGACGGT GCTCAAGGCC
GGTCAGCGGG TGTCCCGCGA CTGGAACAAG GCGGTGGTCC GTACGGCGCT GCCAAGCGGG
CTGGACAAGT CGGCCTACCG GGACGGTGCC GTCGCCGTGG TGTACGCGGG CGGATTGTCC
GACACGGTGC CCGACCAGTG GTTCATGAGT CGTGCCTTCA CCGACAAGGA GCGGACCACG
GTGTACCGCA ACGGTGAGCT GCTCGGATCC GCACCGTCAG CGGTCGTCGC CTTCCCGGTG
GTGCCGGAGC GCGCCGAGTA CCGCCTGGTG GTCGACGCGC AGCGGGACCA GCCTTGGTGG
ACCACCTCCA CCAAGGTGAA CACCGACTGG ACGTTCCACT CCGAGGAGGC CGGCGCCGGG
ACGCCGCTGC CGGTCCTGTC GGTCGACTAC GACCTCGATG TCGACCAGGC CAACAGCGCC
TCGGTGAAGT CCGCGACCCG TGTCGGGCTC GGCCTGCGCT ACCCGAAGGG CAAGGCCGGC
CCACGGATCA CCGAGGCGAA GCTGTGGGCC TCCTACGACG ACGGCACCAC CTGGCAGCAG
GTGCGGCTCG CCGCCAAGGG TGACGCGGCC TTCACCGGCA CGATCAGGAA CCCCGCCTCC
GCGAAGGCCG GCGGGTCCGT CTCGCTGCGG GCACAGGCCA CCGACGCCGA CGGCAACACC
GTCCTGCAGA CCGTGACCCG GGCGTACCGA CTGCAGCGGT GA
 
Protein sequence
MPIRRRAALL SVSLAATLVA AGTGSAHGES PAPGDGVAGI PAPSAITARV TLITGDQVAV 
TSQAGKTVSV NVTPYNRSIS DIRTFTVGTD VYVMPKEAEA LVGSGQVDKR LFNVTQLIAQ
GYDDAHSESI PMIVRYSEKA AKKAGEPSAP DGGRITRRLP GVRGAAVSAD KKQGDTFWEA
VDDDTSKAGT PTTQLSGDIQ RLWLDGKVKA LLDRSVPQIG APEAWAAGYD GAGVKVAVLD
TGADLRHPDL VDRIADSRSF VPDEAVQDGH GHGTHVASTI AGSGAASGGK YKGVAPGAKL
LVGKVLADGG AGMESWILDG MTWAAHSGAK VVSMSLGGQE GADGTDPMAM AVNQLTAETG
VLFTIAAGNS GPGATTVGSP GAADAALTVG AVDSADAVTD FSSRGPRGGD GALKPEITAP
GFKIVAARAT GTSMGTPVDD TYTTASGTSM ATPHVAGAAA ILAQEHPDWT AARLKSQLIS
TAKTTAGTPV HSQGAGRVDV SRAVRQPVHG PGVIDFGLAD WDSGSGPATK QIDYVNDGDQ
PVTLALSAKG AGEDLPAGAL TLGAETVTVP AHGTAAASVT VDTAATAAGA HGAHVTATSA
DGQTVVTTAV GFVRDVERFD VTLKVLDRDG KPTTGVAVVH DFSLGKLGLP DMHVPGEDGT
VVARLPRGEH AFMAEIFHYS ADGNRVWEYT YGGEPGTLID SDRTITFDGG KAGPVTIDTP
KPNQTDRLRL GLAIQNSTDT NAFGVEQWMN GETKVYSIPS QVTSSHFESR VGWSLSAPEL
RAQVVGRGGG PIEPVYFRGR AGSARIDGTR VLRVADAGTG RPQDYAGQAV QGRLALVKRS
ADLTPQQQVD NAAAAGADAV VIYNDNPGNW GADNWSVTAT KIPAMTISGT QGARLAELAA
RGRAKVELTG TAVSPYSYEL LKYRQGGIPA DQRYRVRTDE LATVESSFHA SVPGTEAGYS
RLMVSPMQTV AYLLFNRIVM PRPLTQYVTA DVKTWEAMQV GTVWEAGQAG QQTAPTVLKA
GQRVSRDWNK AVVRTALPSG LDKSAYRDGA VAVVYAGGLS DTVPDQWFMS RAFTDKERTT
VYRNGELLGS APSAVVAFPV VPERAEYRLV VDAQRDQPWW TTSTKVNTDW TFHSEEAGAG
TPLPVLSVDY DLDVDQANSA SVKSATRVGL GLRYPKGKAG PRITEAKLWA SYDDGTTWQQ
VRLAAKGDAA FTGTIRNPAS AKAGGSVSLR AQATDADGNT VLQTVTRAYR LQR