Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3895 |
Symbol | |
ID | 8667185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4338931 |
End bp | 4342164 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Zinc metalloprotease (elastase)-like protein |
Protein accession | YP_003339555 |
Protein GI | 271965359 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATCCCA AGGTCAAACT GGGCGCGGTC GCCGTACTGG CCGCCGTCGC CATGACCGCC ACCTCGGTGG TGAGCGCCGC CGGCGCCGCC ACAGCCCCCG CGGGCATCGT CGCCCGCACC ATATCCGCAG ACCCACCGGC GTTCGCCCCG CCGGACCCCG AATCCCGCAA GCGCGCGATA TCCAGCGCCG ACGGCGCCCT CGCCCTCAGG TCCGACGTCC TGTTCAAGGC GGCGGAGGAC GACTTCACGC TGACGAACAC CGTCGCGGGA ACCCGCGGCC TGCAGTACCT GACCTACTCG CGGACCCACC GCGGGCTGCC CGTCTACGGC GGTGACGTCG TCGTCACCAC CGACAAGACG GGCCAGGAGG TCGGGTCCGT GGCCTCGGGC CAGCGTGCCG AGATCAAGGT CGGCGTCAAG AGCAAGGTCG ACGCCACCAC CGCTGCCGTC ACCGCCCGCG GCGAGCTGCC GACCGTGGAG AGCGTGAGCA CGCCCCGGCT GGTCGTGCAC GCGGCGGGCA AGCGGCCCAG GCTGGCCTGG GAGGTGGTCG TCACCGGCGC CACGAAGCAG GCCCCGAGCG TCCTGCACGT CTTCGTGGAC GCCCTCGACG GGTCGGTCGT GGACTCCTAC GACGACGTCC GCGCCGGAAC CGGCAACGGC TTCTACAACG GCAACCCGGT GACCATCCAG ACCTCCGGGT CCGGTGGTTC GTACTCGATG ACCGACACCA CCCGGCCGGG GCTGCGCTGC GGCGGCCAGA ACGGCTCGGC CTACACCGGC ACCGACGACG CCTGGGGCAA CGGCCAGGGC ACCAACCTGG AGACCGCCTG CGTCGACGCG CTGTACGCCG CCCAGACCGA GTGGAACATG CTGCGCGACT GGCTGGGCCG CAGTGGCTTC AACGGCTCCG GCGGCGCCTT CCCCGCCCGC GTGGGGCTGA CGGACGTCAA CGCCTACTGG AACGGCTCCT ACACCAACTT CGGCCGCAAC CAGGCCAACA CCCAGCAGGC CACCCCGATG GACGTGGTCG GCCACGAGTA CGGCCACGCG ATCTTCCAGT TCTCCGGATC CGGCGGCGCG GGCAGCGGCA ACGAGGCCGG CGGCCTGAAC GAGTCGACCG GTGACATCTT CGGCGCGCTG ACCGAGCACT TCGCGGCCAA CGCCTCCGAC CCGCCGGACT ACCTGGTCGG CGAGGAGGTC AACCTGGTCG GCCAGGGCCC GATCCGCAAC ATGTACAACC CTGGCGCGCT GGGCGACCCC AACTGCTACA GCTCCTCGAT CCCCAACACC GAGGTGCACG CGGCGGCCGG ACCGCAGAAC CACTGGTTCT ACCTGCTGGC CGAGGGCACC AACCCGGGCG GCGGCAAGCC CTCCAGCACG GTCTGCTCCG GCCCGTCCAG CCTGACCGGC ATCGGCATCC AGAAGGCCGG CCAGATCTTC ATGTCCGGCC TGAACAGCAA GACCACGCCG TGGACGCACG CCAAGGCCCG CTCGACCACC GTGGCCGCGG CCAAGCAGCT CTTCCCCAAC AGCTGCGTCG AGGTCAACGC GACCAAGGCC GCGTGGGCCG CCGTCAACGT CCCGGCGCAG AGCGGCGAGG CGCCCTGCAC GGCGAACCCC GGCAACGACT TCTCGCTCTC GCTCAGCCCG ACCTCGGGCA GCGTGCAGGC CGGGCAGTCC GCCACCACCA CCGTGAGGAC CACGGTGACC GGCGGTAACG CCCAGTCCAT CACGCTGCGG GCCTCCGGCC TGCCCTCGGG CGCGACCGCG TCCTTCAGCC CGGCCACCAT CACGGCCGGC CAGACGTCGA CGCTGACGCT GGCCACCTCG GGGAGCACCC CGTCCGGCAC CTCCTCGGTG ACGGTTACCG CCGACGGCGC CGACGTGGAC AGGACAGCGA GCTACTCGCT GACGGTCGGT ACCGGCAACC CGCCGGGAGC GCCGGACATC CCCGTGGCCA ACGTGACGGC GCACCTGAAC CAGCTGCAGT CGATCGCCTC CAGCAACGGC GGCAACCGGG CCTCGGCCAC CTCCGGCTAC ACCGCCTCGC TGAACTACAT CAAGGGCAGG CTGGACGCGG CCGGCTACAC CACCGCGGTG CAGAACTTCA CCTACAACGG CCAGACCCAC TCCAACCTGA TCGCCAACTG GCCGGCCGGT CCCACCGGCC CGACGATCAT GCTCGGCAGC CACCTGGACA GCGTCAGCTC CGGGCCCGGC ATCAACGACA ACGGCTCGGG CTCGGCGGCG CTGCTGGAGG TCGCGCTGAC CCTGGCGAAC CGCAACCCCA CGCTGGACAA GCACGTCCGC TTCGCCTGGT GGGGCGCCGA GGAGCTGGGC CTGCGCGGCT CGCAGTACTA CGTGCAGAAC GGCGGAGCCA CCGGGGTCGA GACCTACCTC AACTTCGACA TGATCGCCTC GCCGAACCCG GGCTACTTCG TCTACGACGA CAACCCGGCC ATCGAGAAGA TCTTCAAGGA CTACTACGCC ACGCTCAACG TCCCGACCGA GATCGAGACC GAGGGTGACG GCCGCAGCGA CCACGCCCCG TTCAAGAACG CGGGTGTGCC GGTCGGCGGC GTCTTCACCG GCGCCTCCAG CGTGAAGAGC TCGGCTCAGG CCACCAAGTG GGGCGGCACC TCGGGCCTGG CCTTCGACCG CTGCTACCAC TCCGCCTGCG ACACCACCTC CAACATCAAC AGCACCGCGC TGGACCGCAA CGCGGACGCC ATCGCCAACG CCCTGTGGAA GCTCGCCGTG GGCGACACCC CCACCCCGAC GGACGACTAC TCGGTCTCGG CGAGCCCGTC CTCGGCCTCG GTGCAGCCCG GCCAGTCGGC CGGCACGACG CTCAGCACCC AGGTGACCTC CGGTAACGCC CAGGCCATCA CGCTGAGCGC CTCCGGCCTG CCCGCCGGCG CGACCGCGTC CTTCAGCCCG GCGAACATCA ACTCCGGCCA GTCCTCCGCG GTCACGATCG CCACCTCGGC GAGCACGCCC ACGGGGACCT ACACCGTCAA CCTCAACGCG GACGGCGCGA GCTCCGACCG CTCGGCCACC TTCACCCTGA CCGTCGGCGG CGGGCAGGGC GGCACCACCT GGCAGACGTG GACCCTCTAC GCGGCCGGGG ACACCGTGAC CTACAACGGC GTCAGCTACC GATGCCTGCA GGGGCACACC TCACTGCCCG GATGGGAGCC GCCGAACGTT CCGGCCCTGT GGCAGCAGCT CTGA
|
Protein sequence | MNPKVKLGAV AVLAAVAMTA TSVVSAAGAA TAPAGIVART ISADPPAFAP PDPESRKRAI SSADGALALR SDVLFKAAED DFTLTNTVAG TRGLQYLTYS RTHRGLPVYG GDVVVTTDKT GQEVGSVASG QRAEIKVGVK SKVDATTAAV TARGELPTVE SVSTPRLVVH AAGKRPRLAW EVVVTGATKQ APSVLHVFVD ALDGSVVDSY DDVRAGTGNG FYNGNPVTIQ TSGSGGSYSM TDTTRPGLRC GGQNGSAYTG TDDAWGNGQG TNLETACVDA LYAAQTEWNM LRDWLGRSGF NGSGGAFPAR VGLTDVNAYW NGSYTNFGRN QANTQQATPM DVVGHEYGHA IFQFSGSGGA GSGNEAGGLN ESTGDIFGAL TEHFAANASD PPDYLVGEEV NLVGQGPIRN MYNPGALGDP NCYSSSIPNT EVHAAAGPQN HWFYLLAEGT NPGGGKPSST VCSGPSSLTG IGIQKAGQIF MSGLNSKTTP WTHAKARSTT VAAAKQLFPN SCVEVNATKA AWAAVNVPAQ SGEAPCTANP GNDFSLSLSP TSGSVQAGQS ATTTVRTTVT GGNAQSITLR ASGLPSGATA SFSPATITAG QTSTLTLATS GSTPSGTSSV TVTADGADVD RTASYSLTVG TGNPPGAPDI PVANVTAHLN QLQSIASSNG GNRASATSGY TASLNYIKGR LDAAGYTTAV QNFTYNGQTH SNLIANWPAG PTGPTIMLGS HLDSVSSGPG INDNGSGSAA LLEVALTLAN RNPTLDKHVR FAWWGAEELG LRGSQYYVQN GGATGVETYL NFDMIASPNP GYFVYDDNPA IEKIFKDYYA TLNVPTEIET EGDGRSDHAP FKNAGVPVGG VFTGASSVKS SAQATKWGGT SGLAFDRCYH SACDTTSNIN STALDRNADA IANALWKLAV GDTPTPTDDY SVSASPSSAS VQPGQSAGTT LSTQVTSGNA QAITLSASGL PAGATASFSP ANINSGQSSA VTIATSASTP TGTYTVNLNA DGASSDRSAT FTLTVGGGQG GTTWQTWTLY AAGDTVTYNG VSYRCLQGHT SLPGWEPPNV PALWQQL
|
| |