Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3516 |
Symbol | |
ID | 8666804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3891922 |
End bp | 3894480 |
Gene Length | 2559 bp |
Protein Length | 852 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Microbial collagenase |
Protein accession | YP_003339195 |
Protein GI | 271964999 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGATTCA CACCGTCCCT GCGCGCGTTC GGAAAACATC TGCCCGCACT GCTCGCACCC GTGCTGGGCC TGGGACTCGT CCTCCAGCCG CTGGTGGCCT CGCCCGCCGC TGCCGCCACC GCTCCGCAGC CCCCGTCCGT CGGGGTGCGC GCCGCCGACG ACGGATCGCA GCACGTCCAG GCGTCCCCCC TGGCGGTCAA GGAGCGCCCG CCGCTGTCGG CCTCGAAGGA CGCGCTCCGG CGCGACCTCG ACAAACCCAC GCCCACGCCG GAACATCCGA GGCCCTCGGC GAAGACCGAC AAATCCGCCA AGTCGACCGC CGCGGCGGCC GCGTGCAGCG CGAGCGACTT CACCAGCCGC TCCGGAAGCG CCCTCGTCCA GCAGATCAAG GCCTCGACCA CCGAGTGCGT CAACACCCTG TTCAACCTGA CCGGCGGCGA CGCCTACTAC GCGTTCCGCG AGTCGCAGAT GACCAGCGTC GCCTACGCGC TGCGCGACAA CGCCGCCTCC TACCAGGGAG ACAACAGCAC GAGCACCGCC CAGCTCGTGC TCTACCTGCG CGCCGGCTAC TACGTGCAGT GGTACCACCC CTCCACCGTG GGCACCTACG GGCCCTCCCT GAAGACCGCC ATCCAGTCCG GACTCGACGG ATTCTTCGGC AACTCCCGGT CCTCGACCGT CAGCGACGCC AACGGCGAGA TCCTCGCCGA GGCCGTCACC CTGATCGACA GCGCCCAGGA GAACGACCGC TACCTCCACG TCGTCAAGCG GCTGCTCAAC GGCTACAACA GCTCCTACGA CGCCAGCTGG TGGATGCTCA ACGCGGTCAA CAACGTCTAC ACGGTGCTCT TCCGCGGCCA CGAGGTTCCG GCGTTCGTCA CCCTGGTCAA GTCCGACCCG AGCGTCATCG ACACCCTCAA CACCTTCGCG CTGAACCACC TCGACCTGCT CGGCACCGAC CGTGACTACC TGACCGCCAA CGCCGGCCGG GAGATCGGGC GCTTCCTCCA GCACTCCACG CTGCGGGCCA AGGTCCGGCC GATGGCCAAG GACCTGCTCA CCCGTAGCAA CATCACCGGG ACGACCGCCC CGCTGTGGGT CGGCGTAGCC GAGATGACCG ACGCCTACGA CAAGGCCAAC TGCGCCTACT TCAACACCTG CAACCTCCAG CAGGTCCTCG CGCAGACCAT CCTGCCCGTC AACCACACCT GCAGCCCCAG CATCAAGATC CGCGCCCAGG CGATGACCTC CGCGCAGCTC GCCGACAGCT GCACCAGCCT CATCAACCAG GACGCGTACT TCCACAACGT GGCCAAGGAC GGCAACCGCC CGGTCGCCGA CGACAACAAC ACCACGATCG AAGTCTGCAT CTTCGACTCG AGCACCGACT ACCAGACCTA CGCCGGTGCG ATGTTCGGCA TCAGCACCAA CAACGGCGGG ATGTACCTGG AGGGCAACCC GGCCGCCGCG GGCAACCAGC CGCGCTTCAT CGCCTACGAG GCCGAATGGG TACGGCCCGC CTTCCAGGTC TGGAACCTGA ACCACGAGTA CACCCACTAC CTCGACGGCC GGTTCAACAT GTACGGCGAC TTCGAGGACG GCGTCACCAC CCCGACCATC TGGTGGATCG AGGGCTTCGC CGAGTACATC TCCTACTCCT ACCGCAAGGA GACCTACGAC GCCGCGATCA CCGAGGCGGC CAGGCAGACC TACGCGCTCA GCACGCTGTG GGACACCACC TACGACAACG ACACGACCCG CATCTACCGC TGGGGCTACC TGGCGGTGCG GTACATGCTC GAGAAGCACC CCGCTGACGT CGCCACCGTC CTCGGCCACT ACCGGACCGG CGGCTGGAAC TCCGCCCGGA ACTTCCTGAC CTCGACGATC GGCACCCGCT ACGACAGTGA CTGGCGGACC TGGCTGACGG CGTGCGCGTC GGGCGGCTGC GCGGGCGGAG GCGGCGGCAA CCAGGCGCCG AGCGCGAACT TCACCTTCAC CGTCAACGGC CTCGCCACCA CCTTCACCGA CACCTCGACC GACTCGGACG GGACGATCGC GTCGCGCCAG TGGAACTTCG GCGACGGCAC CTCGTCGACC TCGGCCAACC CCTCGCACAC CTACACCACC GCGGGCACCT ACACGGTGCA GCTGACCGTC ACCGACAACG CCGGCGCCAC CGCCACCGCG GGCAAGCAGG TCGTGGTGAC CTCCGGTGGG TCCTCCGCCC CCGAATGCAC CGCCGGCAGG ACCGACGAGC TCGGCCGGAA CTGCAAGCGC AGCAACCTGT CAGCCGCCGC CGGCGAGTAC AAGTACCTCT ACCTGTACGT CCCGGCCGGC ACCGCCAAGC TGACCATCAC CTCCTCCGGC GGCACCGGAA ACGCCAACCT CTACTACAGC CCCACCTCCT GGGCCACCAC GTCGACCCAC ACCCAGCGGT CGACCAACGC GGGCAACGGC GAGACCCTGA CCATCACCAA CCCCCCGACC GGCTACAACT ACATCAGCCT CCACGCGGCA CAGGCCTTCG CCGGAGCCGC GCTCCAGGTC GACTACTGA
|
Protein sequence | MRFTPSLRAF GKHLPALLAP VLGLGLVLQP LVASPAAAAT APQPPSVGVR AADDGSQHVQ ASPLAVKERP PLSASKDALR RDLDKPTPTP EHPRPSAKTD KSAKSTAAAA ACSASDFTSR SGSALVQQIK ASTTECVNTL FNLTGGDAYY AFRESQMTSV AYALRDNAAS YQGDNSTSTA QLVLYLRAGY YVQWYHPSTV GTYGPSLKTA IQSGLDGFFG NSRSSTVSDA NGEILAEAVT LIDSAQENDR YLHVVKRLLN GYNSSYDASW WMLNAVNNVY TVLFRGHEVP AFVTLVKSDP SVIDTLNTFA LNHLDLLGTD RDYLTANAGR EIGRFLQHST LRAKVRPMAK DLLTRSNITG TTAPLWVGVA EMTDAYDKAN CAYFNTCNLQ QVLAQTILPV NHTCSPSIKI RAQAMTSAQL ADSCTSLINQ DAYFHNVAKD GNRPVADDNN TTIEVCIFDS STDYQTYAGA MFGISTNNGG MYLEGNPAAA GNQPRFIAYE AEWVRPAFQV WNLNHEYTHY LDGRFNMYGD FEDGVTTPTI WWIEGFAEYI SYSYRKETYD AAITEAARQT YALSTLWDTT YDNDTTRIYR WGYLAVRYML EKHPADVATV LGHYRTGGWN SARNFLTSTI GTRYDSDWRT WLTACASGGC AGGGGGNQAP SANFTFTVNG LATTFTDTST DSDGTIASRQ WNFGDGTSST SANPSHTYTT AGTYTVQLTV TDNAGATATA GKQVVVTSGG SSAPECTAGR TDELGRNCKR SNLSAAAGEY KYLYLYVPAG TAKLTITSSG GTGNANLYYS PTSWATTSTH TQRSTNAGNG ETLTITNPPT GYNYISLHAA QAFAGAALQV DY
|
| |