Gene Sros_3895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3895 
Symbol 
ID8667185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4338931 
End bp4342164 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content71% 
IMG OID 
ProductZinc metalloprotease (elastase)-like protein 
Protein accessionYP_003339555 
Protein GI271965359 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCCCA AGGTCAAACT GGGCGCGGTC GCCGTACTGG CCGCCGTCGC CATGACCGCC 
ACCTCGGTGG TGAGCGCCGC CGGCGCCGCC ACAGCCCCCG CGGGCATCGT CGCCCGCACC
ATATCCGCAG ACCCACCGGC GTTCGCCCCG CCGGACCCCG AATCCCGCAA GCGCGCGATA
TCCAGCGCCG ACGGCGCCCT CGCCCTCAGG TCCGACGTCC TGTTCAAGGC GGCGGAGGAC
GACTTCACGC TGACGAACAC CGTCGCGGGA ACCCGCGGCC TGCAGTACCT GACCTACTCG
CGGACCCACC GCGGGCTGCC CGTCTACGGC GGTGACGTCG TCGTCACCAC CGACAAGACG
GGCCAGGAGG TCGGGTCCGT GGCCTCGGGC CAGCGTGCCG AGATCAAGGT CGGCGTCAAG
AGCAAGGTCG ACGCCACCAC CGCTGCCGTC ACCGCCCGCG GCGAGCTGCC GACCGTGGAG
AGCGTGAGCA CGCCCCGGCT GGTCGTGCAC GCGGCGGGCA AGCGGCCCAG GCTGGCCTGG
GAGGTGGTCG TCACCGGCGC CACGAAGCAG GCCCCGAGCG TCCTGCACGT CTTCGTGGAC
GCCCTCGACG GGTCGGTCGT GGACTCCTAC GACGACGTCC GCGCCGGAAC CGGCAACGGC
TTCTACAACG GCAACCCGGT GACCATCCAG ACCTCCGGGT CCGGTGGTTC GTACTCGATG
ACCGACACCA CCCGGCCGGG GCTGCGCTGC GGCGGCCAGA ACGGCTCGGC CTACACCGGC
ACCGACGACG CCTGGGGCAA CGGCCAGGGC ACCAACCTGG AGACCGCCTG CGTCGACGCG
CTGTACGCCG CCCAGACCGA GTGGAACATG CTGCGCGACT GGCTGGGCCG CAGTGGCTTC
AACGGCTCCG GCGGCGCCTT CCCCGCCCGC GTGGGGCTGA CGGACGTCAA CGCCTACTGG
AACGGCTCCT ACACCAACTT CGGCCGCAAC CAGGCCAACA CCCAGCAGGC CACCCCGATG
GACGTGGTCG GCCACGAGTA CGGCCACGCG ATCTTCCAGT TCTCCGGATC CGGCGGCGCG
GGCAGCGGCA ACGAGGCCGG CGGCCTGAAC GAGTCGACCG GTGACATCTT CGGCGCGCTG
ACCGAGCACT TCGCGGCCAA CGCCTCCGAC CCGCCGGACT ACCTGGTCGG CGAGGAGGTC
AACCTGGTCG GCCAGGGCCC GATCCGCAAC ATGTACAACC CTGGCGCGCT GGGCGACCCC
AACTGCTACA GCTCCTCGAT CCCCAACACC GAGGTGCACG CGGCGGCCGG ACCGCAGAAC
CACTGGTTCT ACCTGCTGGC CGAGGGCACC AACCCGGGCG GCGGCAAGCC CTCCAGCACG
GTCTGCTCCG GCCCGTCCAG CCTGACCGGC ATCGGCATCC AGAAGGCCGG CCAGATCTTC
ATGTCCGGCC TGAACAGCAA GACCACGCCG TGGACGCACG CCAAGGCCCG CTCGACCACC
GTGGCCGCGG CCAAGCAGCT CTTCCCCAAC AGCTGCGTCG AGGTCAACGC GACCAAGGCC
GCGTGGGCCG CCGTCAACGT CCCGGCGCAG AGCGGCGAGG CGCCCTGCAC GGCGAACCCC
GGCAACGACT TCTCGCTCTC GCTCAGCCCG ACCTCGGGCA GCGTGCAGGC CGGGCAGTCC
GCCACCACCA CCGTGAGGAC CACGGTGACC GGCGGTAACG CCCAGTCCAT CACGCTGCGG
GCCTCCGGCC TGCCCTCGGG CGCGACCGCG TCCTTCAGCC CGGCCACCAT CACGGCCGGC
CAGACGTCGA CGCTGACGCT GGCCACCTCG GGGAGCACCC CGTCCGGCAC CTCCTCGGTG
ACGGTTACCG CCGACGGCGC CGACGTGGAC AGGACAGCGA GCTACTCGCT GACGGTCGGT
ACCGGCAACC CGCCGGGAGC GCCGGACATC CCCGTGGCCA ACGTGACGGC GCACCTGAAC
CAGCTGCAGT CGATCGCCTC CAGCAACGGC GGCAACCGGG CCTCGGCCAC CTCCGGCTAC
ACCGCCTCGC TGAACTACAT CAAGGGCAGG CTGGACGCGG CCGGCTACAC CACCGCGGTG
CAGAACTTCA CCTACAACGG CCAGACCCAC TCCAACCTGA TCGCCAACTG GCCGGCCGGT
CCCACCGGCC CGACGATCAT GCTCGGCAGC CACCTGGACA GCGTCAGCTC CGGGCCCGGC
ATCAACGACA ACGGCTCGGG CTCGGCGGCG CTGCTGGAGG TCGCGCTGAC CCTGGCGAAC
CGCAACCCCA CGCTGGACAA GCACGTCCGC TTCGCCTGGT GGGGCGCCGA GGAGCTGGGC
CTGCGCGGCT CGCAGTACTA CGTGCAGAAC GGCGGAGCCA CCGGGGTCGA GACCTACCTC
AACTTCGACA TGATCGCCTC GCCGAACCCG GGCTACTTCG TCTACGACGA CAACCCGGCC
ATCGAGAAGA TCTTCAAGGA CTACTACGCC ACGCTCAACG TCCCGACCGA GATCGAGACC
GAGGGTGACG GCCGCAGCGA CCACGCCCCG TTCAAGAACG CGGGTGTGCC GGTCGGCGGC
GTCTTCACCG GCGCCTCCAG CGTGAAGAGC TCGGCTCAGG CCACCAAGTG GGGCGGCACC
TCGGGCCTGG CCTTCGACCG CTGCTACCAC TCCGCCTGCG ACACCACCTC CAACATCAAC
AGCACCGCGC TGGACCGCAA CGCGGACGCC ATCGCCAACG CCCTGTGGAA GCTCGCCGTG
GGCGACACCC CCACCCCGAC GGACGACTAC TCGGTCTCGG CGAGCCCGTC CTCGGCCTCG
GTGCAGCCCG GCCAGTCGGC CGGCACGACG CTCAGCACCC AGGTGACCTC CGGTAACGCC
CAGGCCATCA CGCTGAGCGC CTCCGGCCTG CCCGCCGGCG CGACCGCGTC CTTCAGCCCG
GCGAACATCA ACTCCGGCCA GTCCTCCGCG GTCACGATCG CCACCTCGGC GAGCACGCCC
ACGGGGACCT ACACCGTCAA CCTCAACGCG GACGGCGCGA GCTCCGACCG CTCGGCCACC
TTCACCCTGA CCGTCGGCGG CGGGCAGGGC GGCACCACCT GGCAGACGTG GACCCTCTAC
GCGGCCGGGG ACACCGTGAC CTACAACGGC GTCAGCTACC GATGCCTGCA GGGGCACACC
TCACTGCCCG GATGGGAGCC GCCGAACGTT CCGGCCCTGT GGCAGCAGCT CTGA
 
Protein sequence
MNPKVKLGAV AVLAAVAMTA TSVVSAAGAA TAPAGIVART ISADPPAFAP PDPESRKRAI 
SSADGALALR SDVLFKAAED DFTLTNTVAG TRGLQYLTYS RTHRGLPVYG GDVVVTTDKT
GQEVGSVASG QRAEIKVGVK SKVDATTAAV TARGELPTVE SVSTPRLVVH AAGKRPRLAW
EVVVTGATKQ APSVLHVFVD ALDGSVVDSY DDVRAGTGNG FYNGNPVTIQ TSGSGGSYSM
TDTTRPGLRC GGQNGSAYTG TDDAWGNGQG TNLETACVDA LYAAQTEWNM LRDWLGRSGF
NGSGGAFPAR VGLTDVNAYW NGSYTNFGRN QANTQQATPM DVVGHEYGHA IFQFSGSGGA
GSGNEAGGLN ESTGDIFGAL TEHFAANASD PPDYLVGEEV NLVGQGPIRN MYNPGALGDP
NCYSSSIPNT EVHAAAGPQN HWFYLLAEGT NPGGGKPSST VCSGPSSLTG IGIQKAGQIF
MSGLNSKTTP WTHAKARSTT VAAAKQLFPN SCVEVNATKA AWAAVNVPAQ SGEAPCTANP
GNDFSLSLSP TSGSVQAGQS ATTTVRTTVT GGNAQSITLR ASGLPSGATA SFSPATITAG
QTSTLTLATS GSTPSGTSSV TVTADGADVD RTASYSLTVG TGNPPGAPDI PVANVTAHLN
QLQSIASSNG GNRASATSGY TASLNYIKGR LDAAGYTTAV QNFTYNGQTH SNLIANWPAG
PTGPTIMLGS HLDSVSSGPG INDNGSGSAA LLEVALTLAN RNPTLDKHVR FAWWGAEELG
LRGSQYYVQN GGATGVETYL NFDMIASPNP GYFVYDDNPA IEKIFKDYYA TLNVPTEIET
EGDGRSDHAP FKNAGVPVGG VFTGASSVKS SAQATKWGGT SGLAFDRCYH SACDTTSNIN
STALDRNADA IANALWKLAV GDTPTPTDDY SVSASPSSAS VQPGQSAGTT LSTQVTSGNA
QAITLSASGL PAGATASFSP ANINSGQSSA VTIATSASTP TGTYTVNLNA DGASSDRSAT
FTLTVGGGQG GTTWQTWTLY AAGDTVTYNG VSYRCLQGHT SLPGWEPPNV PALWQQL