Gene Sros_3786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3786 
Symbol 
ID8667076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4212514 
End bp4215909 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content72% 
IMG OID 
ProductZinc metalloprotease (elastase)-like protein 
Protein accessionYP_003339450 
Protein GI271965254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.550121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.857915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACGCA GAGCTGTACT CGCGGCCGCC GTCGGCTTGG CCGTGGTCGC CGCGGTGGCA 
CCTTCCTCAC CTGCGGCCGC GAAACCCTCC CCCGGAACGG CCGGCTACCA CCGCTCCCCC
GCCGACCCGC CCCAGCTCGC CGTGGCCGCC GCCGACCAGG CAGTCGCGAG CGGCCTGGAC
GAACTGCGGA AAAGCCCGGA CGAGACCTAC CGGCGCACCG CCGTGAGCAC GGGCGCGGGC
GGCATGTACT CCGTCGCCTA CGAGCGCACG TACAAGGGAC TGCCCGTGGT GGGCGGCGAC
GCGGTCGTGA TCACCGACTC CGCCGGGAAC GTGCGCGACA CCGCGGCGGC GGCCGGCGCC
ATCCGGAACG TGCCGACCTC GGCGGCGGTG ACCGCCGCGG ACGCCAGGGC GAAGGCGAAG
ACCCGGCTGT CCCGGACCGA CGACAGCGCG GACCCGCGCC TGGTGGTCCT GGCCTGGGGG
GACAAGCCAC GGCTTGCCTG GGAGACGCTG GTCTCCGGCA TCGCCGACGG CAGACCGAGC
AAGCAGCACG TGTTCGTCGA CGCGCGCACC GGTGAGATCG CCGACTCCTA CGACGAGGTG
CGGGCGGGCA CCGGGACCGG CTACTACTAC GGCCAGGTCA CCATCGGCAC GAGCGGCTCC
GGCAGTTCGT ACTCGATGAC CGACACGACC CGCAGCGGCA TCCGCTGCGG CGGGCAGAAC
GGTTCCGCCT ACACCGGCAC CGACGACGTC TGGGGGAACG GCTCCGGGAC GAACCTGGAG
AGCGCCTGCG TGGACGCGCT CTACGCGGTT CAGAAGGAAT GGGACATGCT GCGCGACTGG
CTCGGTCGCA ACGGCATCAA CGGCAGCGGC GGCGGCTTCC CGGCCCGGGT GGGGCTGGCC
GATGTCAACG CCTACTGGAA CGGCAGCTAC ACCAACTTCG GCCGCTCGCA GGACGGCCAG
CGCCAGGCCA CGCCGATCGA CGTGGTGGGA CACGAGTTCG GCCACGCCAT CTTCCAGACC
ACGCCCGGCG GCGCCGGCTC CGGCAACGAG AACGGCGGCA TCAACGAGGC CGTCGGCGAC
GTCTTCGGGG CGCTCACCGA GGCCTACGCC AACAACCCGA ACGATCCGCC GGACTACGAG
GTCGGCGAGG AGGTGGACCT CGTCGGCGAC GGCCCGATCC GGTACATGTA CGAACCGTCC
CGGGTGGGCG ACCCCAACTG CTGGTCCTCC TCCATCCCGA GCACCGAGGT GCACGCCGCG
GCCGGCCCGT TCAACCACTG GTTCTACCTG CTGGCCGAGG GCTCCTCGCC GGGCGGCGGC
AAGCCGAACA GCCCGATCTG CTCCGGCGGC CCCTCCTCGG TGTCCGGCCT CGGCATCCAG
AAGGCCGGAA AGATCTTCAT GGGCGCGCTG ATGCGCAAGA CCTCCACCTG GAGGTACGCC
AACGCCAGGT CGGCGTCGCT GGCGGCGGCC GTGGAACTGT ACGGCGCGGG CAGCCCCGAG
TGCGCCGCCA CCAAGGCGGC CTGGAACGCG GTCAGCGTAC CGGTGCAGTC GGGCGAGCCG
ACCTGCGCCG CGACGGGCAA CGACTTCTCC GTCTCGCTGA ACCCGGCCGC GGGCTCGGTA
CAGCCCGGCC AGCAGGTCAC CTCGACCGTC GGCACGCAGA CCACCTCGGG CAGCGCGCAG
ACGGTCGCCC TCACCGCTTC CGGCCTGCCG GCGGGCGCCA CCGCGAGTTT CAGCCCGTCC
TCGGTGACCT CGGGCGGCTC CTCCACCATG GCCGTCAGCA CCGCCGGGTC GACGCCGGCC
GGGACCTACA TCGTCACGGT CACCGGCTCC GCCACCTCCG GCACGCACAC GGCGGCCTAC
ACCCTGACGG TCGGCAGCGG GCCCGCCCCG ACCGACCCGC CGGACATCAG CCTCGCGAAC
GTCAAGGCGC ACCTGCAGCA GTTCCAGAGC ATCGCCACCG CCAACGGCGG CACCCGCAGG
TCCACGGGCG CCGGCTACAC CGCGTCGGTG TCCTACATCG AGCAGAAGCT CACCGCCGCC
GGCTACACCG TGGTGCGGCA GCCCTGCACC TCCGGCTGCA CCTCCGGGGC CGGGCCGAAC
CTGATCGCCG ACTGGCCCGG CGGCGACGCC AACCAGGTCG TCATGGCCGG CGCCCATCTG
GACAGCGTCT CGGCCGGTCC GGGCATCAAC GACAACGCCT CGGGTTCGTC GGCGCTGCTG
GAGGTCGCGC TGACCCTGGC CGCGAAGAAC CCGGCGATGG CCAAGCACGT GCGGTTCGGC
TGGTGGACCG ACGAGGAGCA GGGCCTCAAC GGTTCGGAGT TCTACGTCAA CTCGCTGGGC
TCCACCGAGC GGAGCAAGAT CACGGTCTAC CACAACTATG ACATGGTGGG CTCGACCAAC
GGCGGCTACT TCATCAACAA CATCACCACC TCGGCGGCGA CGCACCTCAA GGCCTTCTAC
GACGGCCTGA ACCTCCAGCC GGAGGAGAAC ACCGAGGGGG CCAACCGCTC CGACGACGCG
TCGTTCCGCA ACGCGGGGAT CGCCACCTCC GGCGTGGCGG CCGGCGCCAG CGCCGTCAAG
ACATCGGCCC AGGCCGCCAA GTGGGGCGGT ACGGCGGGCC AGGCGTACGA CCCCTGCTAC
CACCGCGCGT GCGACACGAC GAGCAACATC AGCGACACCG TTCTGGACCG GGCCGCCGAC
GCCTCGGCGT ACGCGATCTG GAAGCTCGCG ACGGGCACCG CCACGAGCCG GGACTTCTCC
ATCTCCGCGA GCCCGTCGTC GGGGACGGTC CAGGCGGGCC AGGCGGTGAC CTCCACGGTC
GCCACCGCCA CCACGGCGGG AACCGCGCAG ACGGTCAGCC TGTCCGCGTC GGGGCTGCCC
GGCGGCGCCA CGGCCGGTTT CAACCCGGCG TCGGTGACCT CCGGCGGCTC GTCCACGCTG
ACCATCGCCA CCACGGCCAC GACGCCCCCC GGCACCTACC AGGTGACGGT CACCGGCACC
GGCGAGACCG CCACCCGCAC CTCGGTGTAC ACGCTGACCG TGCAGGGCAC CTCGTCCGGC
CGCACGTTCC GCAATGACAC CGACCACACC ATCGACGACT TCGGCACCGT CGAGAGCCCG
ATCACCTCCA CCGCGACCGG GACGGCGACG TCGCCCGTCA AGCTCACCGT CACGATCGAC
CACACCTGCG CCGAGGACCT GGAGATCTGG CTCCGCGGCC CGAACGGACG GTGGTACCTG
CTCGACAGCA GCGGCGGCTC CACCTGCACC GCGTACGGCA CGCGGACCTA CACGGTCCCG
GTCACCCAGC AGGCGGCCGG GCAGTGGCTG CTGGAGGTGA GCGACTACTA CTTCCAGGAC
ACCGGCACTC TCGACTGGTG GAGCATCACC GTCTGA
 
Protein sequence
MRRRAVLAAA VGLAVVAAVA PSSPAAAKPS PGTAGYHRSP ADPPQLAVAA ADQAVASGLD 
ELRKSPDETY RRTAVSTGAG GMYSVAYERT YKGLPVVGGD AVVITDSAGN VRDTAAAAGA
IRNVPTSAAV TAADARAKAK TRLSRTDDSA DPRLVVLAWG DKPRLAWETL VSGIADGRPS
KQHVFVDART GEIADSYDEV RAGTGTGYYY GQVTIGTSGS GSSYSMTDTT RSGIRCGGQN
GSAYTGTDDV WGNGSGTNLE SACVDALYAV QKEWDMLRDW LGRNGINGSG GGFPARVGLA
DVNAYWNGSY TNFGRSQDGQ RQATPIDVVG HEFGHAIFQT TPGGAGSGNE NGGINEAVGD
VFGALTEAYA NNPNDPPDYE VGEEVDLVGD GPIRYMYEPS RVGDPNCWSS SIPSTEVHAA
AGPFNHWFYL LAEGSSPGGG KPNSPICSGG PSSVSGLGIQ KAGKIFMGAL MRKTSTWRYA
NARSASLAAA VELYGAGSPE CAATKAAWNA VSVPVQSGEP TCAATGNDFS VSLNPAAGSV
QPGQQVTSTV GTQTTSGSAQ TVALTASGLP AGATASFSPS SVTSGGSSTM AVSTAGSTPA
GTYIVTVTGS ATSGTHTAAY TLTVGSGPAP TDPPDISLAN VKAHLQQFQS IATANGGTRR
STGAGYTASV SYIEQKLTAA GYTVVRQPCT SGCTSGAGPN LIADWPGGDA NQVVMAGAHL
DSVSAGPGIN DNASGSSALL EVALTLAAKN PAMAKHVRFG WWTDEEQGLN GSEFYVNSLG
STERSKITVY HNYDMVGSTN GGYFINNITT SAATHLKAFY DGLNLQPEEN TEGANRSDDA
SFRNAGIATS GVAAGASAVK TSAQAAKWGG TAGQAYDPCY HRACDTTSNI SDTVLDRAAD
ASAYAIWKLA TGTATSRDFS ISASPSSGTV QAGQAVTSTV ATATTAGTAQ TVSLSASGLP
GGATAGFNPA SVTSGGSSTL TIATTATTPP GTYQVTVTGT GETATRTSVY TLTVQGTSSG
RTFRNDTDHT IDDFGTVESP ITSTATGTAT SPVKLTVTID HTCAEDLEIW LRGPNGRWYL
LDSSGGSTCT AYGTRTYTVP VTQQAAGQWL LEVSDYYFQD TGTLDWWSIT V