Gene Sros_3516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3516 
Symbol 
ID8666804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3891922 
End bp3894480 
Gene Length2559 bp 
Protein Length852 aa 
Translation table11 
GC content68% 
IMG OID 
ProductMicrobial collagenase 
Protein accessionYP_003339195 
Protein GI271964999 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGATTCA CACCGTCCCT GCGCGCGTTC GGAAAACATC TGCCCGCACT GCTCGCACCC 
GTGCTGGGCC TGGGACTCGT CCTCCAGCCG CTGGTGGCCT CGCCCGCCGC TGCCGCCACC
GCTCCGCAGC CCCCGTCCGT CGGGGTGCGC GCCGCCGACG ACGGATCGCA GCACGTCCAG
GCGTCCCCCC TGGCGGTCAA GGAGCGCCCG CCGCTGTCGG CCTCGAAGGA CGCGCTCCGG
CGCGACCTCG ACAAACCCAC GCCCACGCCG GAACATCCGA GGCCCTCGGC GAAGACCGAC
AAATCCGCCA AGTCGACCGC CGCGGCGGCC GCGTGCAGCG CGAGCGACTT CACCAGCCGC
TCCGGAAGCG CCCTCGTCCA GCAGATCAAG GCCTCGACCA CCGAGTGCGT CAACACCCTG
TTCAACCTGA CCGGCGGCGA CGCCTACTAC GCGTTCCGCG AGTCGCAGAT GACCAGCGTC
GCCTACGCGC TGCGCGACAA CGCCGCCTCC TACCAGGGAG ACAACAGCAC GAGCACCGCC
CAGCTCGTGC TCTACCTGCG CGCCGGCTAC TACGTGCAGT GGTACCACCC CTCCACCGTG
GGCACCTACG GGCCCTCCCT GAAGACCGCC ATCCAGTCCG GACTCGACGG ATTCTTCGGC
AACTCCCGGT CCTCGACCGT CAGCGACGCC AACGGCGAGA TCCTCGCCGA GGCCGTCACC
CTGATCGACA GCGCCCAGGA GAACGACCGC TACCTCCACG TCGTCAAGCG GCTGCTCAAC
GGCTACAACA GCTCCTACGA CGCCAGCTGG TGGATGCTCA ACGCGGTCAA CAACGTCTAC
ACGGTGCTCT TCCGCGGCCA CGAGGTTCCG GCGTTCGTCA CCCTGGTCAA GTCCGACCCG
AGCGTCATCG ACACCCTCAA CACCTTCGCG CTGAACCACC TCGACCTGCT CGGCACCGAC
CGTGACTACC TGACCGCCAA CGCCGGCCGG GAGATCGGGC GCTTCCTCCA GCACTCCACG
CTGCGGGCCA AGGTCCGGCC GATGGCCAAG GACCTGCTCA CCCGTAGCAA CATCACCGGG
ACGACCGCCC CGCTGTGGGT CGGCGTAGCC GAGATGACCG ACGCCTACGA CAAGGCCAAC
TGCGCCTACT TCAACACCTG CAACCTCCAG CAGGTCCTCG CGCAGACCAT CCTGCCCGTC
AACCACACCT GCAGCCCCAG CATCAAGATC CGCGCCCAGG CGATGACCTC CGCGCAGCTC
GCCGACAGCT GCACCAGCCT CATCAACCAG GACGCGTACT TCCACAACGT GGCCAAGGAC
GGCAACCGCC CGGTCGCCGA CGACAACAAC ACCACGATCG AAGTCTGCAT CTTCGACTCG
AGCACCGACT ACCAGACCTA CGCCGGTGCG ATGTTCGGCA TCAGCACCAA CAACGGCGGG
ATGTACCTGG AGGGCAACCC GGCCGCCGCG GGCAACCAGC CGCGCTTCAT CGCCTACGAG
GCCGAATGGG TACGGCCCGC CTTCCAGGTC TGGAACCTGA ACCACGAGTA CACCCACTAC
CTCGACGGCC GGTTCAACAT GTACGGCGAC TTCGAGGACG GCGTCACCAC CCCGACCATC
TGGTGGATCG AGGGCTTCGC CGAGTACATC TCCTACTCCT ACCGCAAGGA GACCTACGAC
GCCGCGATCA CCGAGGCGGC CAGGCAGACC TACGCGCTCA GCACGCTGTG GGACACCACC
TACGACAACG ACACGACCCG CATCTACCGC TGGGGCTACC TGGCGGTGCG GTACATGCTC
GAGAAGCACC CCGCTGACGT CGCCACCGTC CTCGGCCACT ACCGGACCGG CGGCTGGAAC
TCCGCCCGGA ACTTCCTGAC CTCGACGATC GGCACCCGCT ACGACAGTGA CTGGCGGACC
TGGCTGACGG CGTGCGCGTC GGGCGGCTGC GCGGGCGGAG GCGGCGGCAA CCAGGCGCCG
AGCGCGAACT TCACCTTCAC CGTCAACGGC CTCGCCACCA CCTTCACCGA CACCTCGACC
GACTCGGACG GGACGATCGC GTCGCGCCAG TGGAACTTCG GCGACGGCAC CTCGTCGACC
TCGGCCAACC CCTCGCACAC CTACACCACC GCGGGCACCT ACACGGTGCA GCTGACCGTC
ACCGACAACG CCGGCGCCAC CGCCACCGCG GGCAAGCAGG TCGTGGTGAC CTCCGGTGGG
TCCTCCGCCC CCGAATGCAC CGCCGGCAGG ACCGACGAGC TCGGCCGGAA CTGCAAGCGC
AGCAACCTGT CAGCCGCCGC CGGCGAGTAC AAGTACCTCT ACCTGTACGT CCCGGCCGGC
ACCGCCAAGC TGACCATCAC CTCCTCCGGC GGCACCGGAA ACGCCAACCT CTACTACAGC
CCCACCTCCT GGGCCACCAC GTCGACCCAC ACCCAGCGGT CGACCAACGC GGGCAACGGC
GAGACCCTGA CCATCACCAA CCCCCCGACC GGCTACAACT ACATCAGCCT CCACGCGGCA
CAGGCCTTCG CCGGAGCCGC GCTCCAGGTC GACTACTGA
 
Protein sequence
MRFTPSLRAF GKHLPALLAP VLGLGLVLQP LVASPAAAAT APQPPSVGVR AADDGSQHVQ 
ASPLAVKERP PLSASKDALR RDLDKPTPTP EHPRPSAKTD KSAKSTAAAA ACSASDFTSR
SGSALVQQIK ASTTECVNTL FNLTGGDAYY AFRESQMTSV AYALRDNAAS YQGDNSTSTA
QLVLYLRAGY YVQWYHPSTV GTYGPSLKTA IQSGLDGFFG NSRSSTVSDA NGEILAEAVT
LIDSAQENDR YLHVVKRLLN GYNSSYDASW WMLNAVNNVY TVLFRGHEVP AFVTLVKSDP
SVIDTLNTFA LNHLDLLGTD RDYLTANAGR EIGRFLQHST LRAKVRPMAK DLLTRSNITG
TTAPLWVGVA EMTDAYDKAN CAYFNTCNLQ QVLAQTILPV NHTCSPSIKI RAQAMTSAQL
ADSCTSLINQ DAYFHNVAKD GNRPVADDNN TTIEVCIFDS STDYQTYAGA MFGISTNNGG
MYLEGNPAAA GNQPRFIAYE AEWVRPAFQV WNLNHEYTHY LDGRFNMYGD FEDGVTTPTI
WWIEGFAEYI SYSYRKETYD AAITEAARQT YALSTLWDTT YDNDTTRIYR WGYLAVRYML
EKHPADVATV LGHYRTGGWN SARNFLTSTI GTRYDSDWRT WLTACASGGC AGGGGGNQAP
SANFTFTVNG LATTFTDTST DSDGTIASRQ WNFGDGTSST SANPSHTYTT AGTYTVQLTV
TDNAGATATA GKQVVVTSGG SSAPECTAGR TDELGRNCKR SNLSAAAGEY KYLYLYVPAG
TAKLTITSSG GTGNANLYYS PTSWATTSTH TQRSTNAGNG ETLTITNPPT GYNYISLHAA
QAFAGAALQV DY