Gene Sros_3161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3161 
Symbol 
ID8666449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3441991 
End bp3443979 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content75% 
IMG OID 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_003338849 
Protein GI271964653 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.464305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACG ACTCTCCCAT CCGCCCCGTC GACGTGGCCA GGGTGGACGA CCGTCCGCTC 
TGGGTGGAGA TCCTCGGGGA GGAGGTGTGG TGGGACGAGC CTCGTCCGCA TGAGGGCGGG
CGCAGGTGCG TGGTCCGGCG CGGGCCGGAC GGCGTGCCCC GTGACGCGAT CCCGCAGGGC
TGGAACTCCC GCAACCGCCT CATCGAGTAC GGCGGGCGCT CCTGGCGGCC CCTGCCGGAC
GGCGGGGTCG TGTTCACCAA CTGGGCCGAC CAGCGCATCT ACCTGTACGG CGGGCAGCCG
GCCCCCGGCT CCGGCGAGGC GGCCGGCGGA CCGGCCCGCG CCGGCGACGG CGACAGTGGC
GGCGACGGCG GGCAGCCGGT CCCCCTCACC CCCGACGACG GGGCCCGCTA CGGCGACCTG
TACCTGCCGC CGGGGCTGCG GGAGGTCTGG GCGGTCCGCG AGACCCATCC CGCCCCGGAG
ACCACCGCGC CCGGGACCGG CGCCGAGCCC GCTCCCCGGT CCCTGCCCTG CCGGGAGCTG
GTGGCGGTGC CGCTGGACGG CGGGCCGGTG CGGCTCATCG TCCGCGCGCA GCACTTCCTG
ACCAACCCCC GCCTCTCCCC CGACGGCACG CACATCGCCT GGATCGGCTG GGACCACCCC
GCCATGCCCT GGGACGGCAC CGAGCTGTGC GTGGCGCCCC TGGACGCGCT CGGCTCGGCC
GGGCCGTACC GGGTCGTCGC CGGCGGCCCG GAGGAGTCGG TGATCCAGGC CGAGTGGCGC
GACGACGGCG CGCTCTACGC CCTCACCGAC CCGGACGGCT GGTGGAACCT GCATCTGGTC
CCCCTCGACG GCTCCCCCGC CCGCAACCTG GCACCGCTCC AGGAGGACTG CGGCGACGCC
GTATGGCGGC TCGGCAACAC CTGGTTCTCG CTCGCCGGCG ACAGGATCGT CCTCGTGCAC
GGCACCCCGG ACCGCCGCCG CCTCGGCGTG CTGGACCCGG CCACCGGCGA GATGACCGAT
CTCGACGCCC CGCCCACCTA CTGGAACCCG ACGGTCTCCA CCGGCGGGGA CCTGGTGGCA
GGCGTGGCGG CCTCGCCGTA CACGCCGTTC GAGGTCGTCA CGGTGGACCT GCGCACCGGC
GCGCACGCCG TGCTGTCCCC GGAGAAGGAG CTGCCCGACC GCGACGTGCT GCCCGACCCC
GAGGCGGTGA CCTTCGACGG CGTGCACGCG CACCTGTACC CGCCGCGCGG CGTCACCGGC
CCCGCCCCCT ATGTGATCTT CGTGCACGGC GGGCCGACCA GCGCCAGCAC GATGGTCCTC
GACGTGGAGA TCGCCTACTT CACCAGCCGG GGCATCGGCG TGGCCGACGT GAACTACGGC
GGTTCCACCG GCTACGGCCG CGCCTACCGG GAGCGGCTGC GCCACCAGTG GGGCGTGGTG
GACGTGCGCG ACTGCGAGAC GGTGGCGCAC GGCCTGATCG CCGCCGGGCG GGCGCACCCG
TCGAAGGTTG CGATCCGGGG CGGCAGCGCG GGCGGCTGGA CGTCGGTGGC CGCGCTGGTG
CACAGCAAGG TGTTCCGCGG CGCGGTGGCC CACTACGCCA TCACCGACCC GGAGGGCTGG
GCGGCCGAGA CCCACGACTT CGAGTCGCGC TACCTCGACG GGCTGATCGG CCCGCTCCCC
GAGACCCGGC AGCGCTACCT GGACCGTTCC CCCACCCTGC ACGCCGCGAA CGCGTCGGGT
CCCGCGCTGC TCATGCACGG CCTGGAGGAC GCGATCGTCG ACCCGGTCCA GGCGGAGCGG
TTCGCCGCCG CGCTGGAGCG CGAGGGCACC CCGTGGGCCT ACCTGACCTT CCCCGGCGAG
CAGCACGGCT GGCGCCGGGA GGAGACCATC ATCGCCGCGA TGGAGGCCGA ACTGGCCTTC
TACGGCCTGA TCTTCGGTTT CCCGACGCCC GAGGTGCCGC CTCTGACCCT GCGGGGTATG
CACCAATGA
 
Protein sequence
MLNDSPIRPV DVARVDDRPL WVEILGEEVW WDEPRPHEGG RRCVVRRGPD GVPRDAIPQG 
WNSRNRLIEY GGRSWRPLPD GGVVFTNWAD QRIYLYGGQP APGSGEAAGG PARAGDGDSG
GDGGQPVPLT PDDGARYGDL YLPPGLREVW AVRETHPAPE TTAPGTGAEP APRSLPCREL
VAVPLDGGPV RLIVRAQHFL TNPRLSPDGT HIAWIGWDHP AMPWDGTELC VAPLDALGSA
GPYRVVAGGP EESVIQAEWR DDGALYALTD PDGWWNLHLV PLDGSPARNL APLQEDCGDA
VWRLGNTWFS LAGDRIVLVH GTPDRRRLGV LDPATGEMTD LDAPPTYWNP TVSTGGDLVA
GVAASPYTPF EVVTVDLRTG AHAVLSPEKE LPDRDVLPDP EAVTFDGVHA HLYPPRGVTG
PAPYVIFVHG GPTSASTMVL DVEIAYFTSR GIGVADVNYG GSTGYGRAYR ERLRHQWGVV
DVRDCETVAH GLIAAGRAHP SKVAIRGGSA GGWTSVAALV HSKVFRGAVA HYAITDPEGW
AAETHDFESR YLDGLIGPLP ETRQRYLDRS PTLHAANASG PALLMHGLED AIVDPVQAER
FAAALEREGT PWAYLTFPGE QHGWRREETI IAAMEAELAF YGLIFGFPTP EVPPLTLRGM
HQ