Gene Sros_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1707 
Symbol 
ID8664984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1822636 
End bp1826079 
Gene Length3444 bp 
Protein Length1147 aa 
Translation table11 
GC content71% 
IMG OID 
ProductZinc metalloprotease (elastase)-like protein 
Protein accessionYP_003337441 
Protein GI271963245 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.255269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCA AAGTCACGTT AGGCGCCGTC GCCATATTGG CGGTCGCGGC CCTGACCACC 
TCCACGCTGG CGAATCCGGC GAACGCCGCC GCCACGGGAG GCACCGCCCC CTCGGACCCG
GTGCCGGTGA GAGTCTCCGC GGAACGCGAG CCCGCCTTCA CCCCGCCGGA CGCGCAGAGC
CGCAAGCGCG CGATCACCAC CGCCGAACGC ACCCTGGCGG CGGAGTCCGC GTCCCTGCGC
AAGGCGTCCG GAGACAGTTT CACCCGCGAC AAGGTCGTCG TGGGAACGCA CGGTCTCCAG
TATCTCCACT ACAGGCGCGC CTACCGCGGA CTGCCGGTCT ACGGCGGTGA CGTGATCGTC
ATGACCAACA GGTCCGGCGA CAGGGTGGAG AGCCTCAGCT CCGGCCAGCG GGTCAAGCTG
AACCTCGACA CCAAGGCCAC CGTCGGCGCG GAAGCCGCCT CGGCCACCGC CCGCGGCCAG
CTCACCACCG TCGAGACCGC CGCCGCCCCC GTGCTCATGG TGCACGCGGC CACCGACAGG
CCCGTGCTGG CCTGGGAGGT CTCGCTCACC GGCCGCAACG CGCGGACCCC GAGCGTCCTG
CACGTCTACG TCGACGCCAC GACCGGCGCC GTCATCGACA AGGTGGAGGA GGTCAAGGCG
GGCACCGGCA ACAGCTTCTA CAACGGCAAC CCGGTGACCA TCCAGACCTC CGGCTCCTAC
TCGATGACCG ACACCACCCG GCCGGGGCTG CGCTGCGGCG GCCAGAACGG CTCGGCCTAC
ACCGGCACCG ACGACGCCTG GGGCAACGGC CAGGGCACCA ACCTGGAGAC CGCCTGCGTC
GACGCGCTGT ACGCCGCCCA GAAGGAATGG GACATGCTGC GCGACTGGCT GGGCCGCAGC
GGCTTCAACG GCTCCGGCGG CGCCTTCCCC GCCCGCGTGG GCCTGGCGGA CGTCAACGCC
TACTGGAACG GCTCCTACAC CAACTTCGGC CACAACCAGG CCAACACCAA ACAGGCCACC
CCGATGGACG TGGTCGCCCA CGAGTACGGC CACGCGATCT TCCAGTTCTC CGGGTCGGGG
GGCGCGGGCA GCGGCAACGA GGCCGGCGGC CTGAACGAGT CGACCGGCGA CATCTTCGGC
GCGCTGACCG AGCATTTCGT CAACCACCCC GCCAACCTGG ACGAGCCGGA CTACCTGGTC
GGCGAGGAGG TCGACCTGGT CGGCCAGGGC CCGATCCGCA ACATGTACAA CCCGGGCGCG
GTGGGCGACC CCAACTGCTA CAGCTCCTCG ATCCCCAACA CCGAGGTGCA CGCGGCGGCC
GGACCGCAGA ACCACTGGTT CTACCTGCTG GCCGAGGGCA CCAACCCGGG CGGCGGCAAG
CCCTCCAGCA CGGTCTGCTC CGGCCCGTCC AACCTGACCG GCATCGGCAT CCAGAAGGCC
GGCCAGATCT TCATGGGCGG CCTGAACGCC AAGACCACTC CGTGGACGCA CGCCAAGGCG
CGCCTGCGGA CCCTGGAGGC CGCCAAGGCG CTGTTCCCCA GCAGCTGCGT GGAGTTCAAC
GCGACCAAGG CCGCCTGGGC CGCCGTCAAC GTCCCGGCGC AGGGCGGCGA GGTGACCTGC
ACCGCGACGG GCGACGACTA CGCGATCACG GTGAGCCCGT CGTCGGGGTC GGTCCAGCCC
GGCGGGTCGG CCACGGCGAC GCTGAACACC TCCGTCGTCT CGGGCAGCGC GCAGACCGTG
ACCCTGTCGG CGGCCGGCGT CCCCTCGGGC GCCACGGTGA GCTTCAACCC CGCCTCCGTC
ACGGCGGGCC AGAGCTCGAC GGTGACGCTC GCCACCTCGG CGAACACGCC GCAGGGGACC
TTCCCGATCA CCCTGAACGC CAGCGCGCCC TCCGGCGCCA AGTCGGTGAC CTACTCGCTG
ACCGTCGGCA CCGGCAACCC GCCGACCGGG GCACCGGACA TCCCCGTGGC CAACGTGACG
GCGCACCTGA ACCAGCTGCA GTCGATCGCC TCCGGCAACG GCGGCAACCG GGCCTCGGCC
ACCTCCGGCT ACACCGCCTC GCTGAACTAC ATCAAGGGCA AGCTGGACGC GGCCGGCTAC
ACCACCGCGG TGCAGAACTT CACCTACAAC GGCCAGACCC ACTCCAACCT GATCGCCAAC
TGGCCGGCCG GTCCCACCGG CCCGACGATC ATGCTCGGCA GCCACCTGGA CAGCGTCAGC
TCCGGGCCCG GCATCAACGA CAACGGCTCG GGCTCGGCGG CGCTGCTGGA GGTCGCGCTG
ACGCTGGCCG ACCGCAACCC CACGCTGGAC AAGCACGTCC GCTTCGCCTG GTGGGGCGCC
GAGGAGCTGG GCCTGCGCGG CTCGCAGCAC TACGTGTCGA ACTCCGGTGT CTCCGGCATC
GAGGCCTACC TCAACTTCGA CATGATCGCC TCGCCGAACC CGGGCTACTT CGTCTACGAC
GACGACACCG CGCTGGAGAA GGTGTTCAAG GACTACTTCG CCACGCTCAA CGTCCCGACC
GAGATCGAGA CCGCGGGTGA CGGCCGCAGC GACCACGCCC CGTTCAAGAA CGCGGGCGTC
AGAGTGGGCG GCCTGTTCAC CGGCGCCGAG ACGGCCAAGA CCTCGGCACA GGCCGCCAAG
TGGGGCGGCA CCGCGGGCCA GGCCTTCGAC CGCTGCTACC ACTCCGCCTG CGACACCACC
TCCAACATCA ACAGCACCGC GCTGGACCGC AACAGCGACG CCGTGGCCAA CGCGCTGTGG
AAGCTGGCGG TGCGCCCCGG CCCGGTCGGC GACGACTACT CGATGGCGGT GAACCCGGCG
TCCGGCACCG TCCAGGCGGG CCAGTCGGCC GACGCGACGC TGAGCACCAC GGTCACCGGC
GGCAACGCGC AGAGCGTGTC GCTGTCGGCC TCCGGCGCCC CCGCCGGGAC CACGGTGACC
TTCACCCCCT CGACCGTCAC GGCCGGCCAG ACGTCCGCGG TGCGGATCAC GACCTCCGCC
TCCACCCCGG CGGGCACCTA CACCATCGCC CTCAACGGCA CCGGGACCTC CGCCAACCGC
TCGGCGATCT ACACCCTGAC GGTCGGCGGG ACCGGCGGCG GCCGCACCTT CACCAACGAC
ACGCCCTTCG AGATCAACGA CGGCTACCAG GACTCCAGTG ACATCCCGGT CACCCTCACC
GGGTCCCCGA ACGCCACGTT CACGGTCTCG GCCGACATCG ACCACACCTG CTCGCAGGAC
CTGCGCCTGA CCCTCGTCCG GCCGAACGGG ACCTCACAGG TGCTGAAGTA CGAGTCCTAC
ACCGCCTGCA CGCCGTACAG CGGGCCGGTG CGGTTCACGG TGAACAACCC CTCCCGCTTC
GGCAACGGCA CCTGGTCCCT CGTGGTCGGC GACTACTACC AGGGCGACAC CGGCACCCTC
AACGCCTGGA GCATCACCTT CTAG
 
Protein sequence
MSTKVTLGAV AILAVAALTT STLANPANAA ATGGTAPSDP VPVRVSAERE PAFTPPDAQS 
RKRAITTAER TLAAESASLR KASGDSFTRD KVVVGTHGLQ YLHYRRAYRG LPVYGGDVIV
MTNRSGDRVE SLSSGQRVKL NLDTKATVGA EAASATARGQ LTTVETAAAP VLMVHAATDR
PVLAWEVSLT GRNARTPSVL HVYVDATTGA VIDKVEEVKA GTGNSFYNGN PVTIQTSGSY
SMTDTTRPGL RCGGQNGSAY TGTDDAWGNG QGTNLETACV DALYAAQKEW DMLRDWLGRS
GFNGSGGAFP ARVGLADVNA YWNGSYTNFG HNQANTKQAT PMDVVAHEYG HAIFQFSGSG
GAGSGNEAGG LNESTGDIFG ALTEHFVNHP ANLDEPDYLV GEEVDLVGQG PIRNMYNPGA
VGDPNCYSSS IPNTEVHAAA GPQNHWFYLL AEGTNPGGGK PSSTVCSGPS NLTGIGIQKA
GQIFMGGLNA KTTPWTHAKA RLRTLEAAKA LFPSSCVEFN ATKAAWAAVN VPAQGGEVTC
TATGDDYAIT VSPSSGSVQP GGSATATLNT SVVSGSAQTV TLSAAGVPSG ATVSFNPASV
TAGQSSTVTL ATSANTPQGT FPITLNASAP SGAKSVTYSL TVGTGNPPTG APDIPVANVT
AHLNQLQSIA SGNGGNRASA TSGYTASLNY IKGKLDAAGY TTAVQNFTYN GQTHSNLIAN
WPAGPTGPTI MLGSHLDSVS SGPGINDNGS GSAALLEVAL TLADRNPTLD KHVRFAWWGA
EELGLRGSQH YVSNSGVSGI EAYLNFDMIA SPNPGYFVYD DDTALEKVFK DYFATLNVPT
EIETAGDGRS DHAPFKNAGV RVGGLFTGAE TAKTSAQAAK WGGTAGQAFD RCYHSACDTT
SNINSTALDR NSDAVANALW KLAVRPGPVG DDYSMAVNPA SGTVQAGQSA DATLSTTVTG
GNAQSVSLSA SGAPAGTTVT FTPSTVTAGQ TSAVRITTSA STPAGTYTIA LNGTGTSANR
SAIYTLTVGG TGGGRTFTND TPFEINDGYQ DSSDIPVTLT GSPNATFTVS ADIDHTCSQD
LRLTLVRPNG TSQVLKYESY TACTPYSGPV RFTVNNPSRF GNGTWSLVVG DYYQGDTGTL
NAWSITF