Gene Sros_6440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6440 
Symbol 
ID8669749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7050299 
End bp7052299 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content75% 
IMG OID 
ProductProlyl oligopeptidase 
Protein accessionYP_003341897 
Protein GI271967701 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.575381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.082879 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTACC CTCCCGCCGA GCGGCAGCCG ATCGTCGACC ACCTCCACGG CCGTGCCGTA 
CCCGATCCCT ACCGGTGGCT GGAGGATCCG TCGAGCCCGG CGACGCGGAG CTGGCTGGCC
GCCCAGGACG AGCTGTGGCG CGGCCACGCC GCCGCGCTCG CCGGGCGCGA CGGATGGCAC
GCCCGGGTCT CCGAGCTCAC CGGCGTCGGA ACGGTCAGCC CGCCGGTCTG GCGCGGGGAG
CGCCGGTTCT TCGTGCGCCG TACGGCCCGG CAGGAGCACG CGGTGCTCTA CACCGCCACC
CCCCGCCAGG CCGAGCAGGC GCTGGTCGAC CCGGTGGAGC TCGACCCCAC CGGCCTGACC
ACGCTGGACC AGTGGCAGCC CGGCCCGGAC GGCCGGCTGC TGGCCTACCA GCTGTCCAGG
AGCGGTGACG AGCGGTCGGA GCTGTATGTC ATGGACGTCG GCACCCGGCA GATCGTCGAC
GGCCCCATCG ACCGCTGCCG CTACGCGCCC GTGGCGTGGC TGCCGGACGG GAAGGCGTTC
TTCTACGTCC GCTCCCGCAG GGTGTGCCTG CACCGGCTCG GCACCCCGGC CGGGGATGAC
GTGGTGATCT GCGGGACCGA CCGGTCCTAC GGCCTGGGGA TCAGCCACGA CGGCCGCTGG
CTGACGGTGT CGGCGGCGGC CGGCGGCGGC AACGACCTCT GGCTGGCCGA CCTGGCCGCG
TCCTCCCCGG AGCGCCCCGC GCTGCGGGTG ATCCAGGAGG GGACGGACGC GGTGACGGCG
CCGGCCGTCG GCCCCGACGG GCGCCTGTAC CTCCTCACCA CCATGGGCGC GCCCCGGGGC
CGCCTGTGCG TCGCCGACCC GGCGCGTCCC GAGCCCGAGC ACTGGCTCGA CCTGGTCGGA
CCGGATCCGG AGGCGGTGAT CGGCGACTTC GCGATCCTCG GCGACTCGGT GCTCCTGGTC
GGCTGGACCC GGCACGCGGT CAGCGAGATC AGCGTCCACG ACCTGGACGG CGGCGAGCCG
CTCGGCCGGG TGCCGCTGCC CGGTCTCGGC TCGGCCGGGC GGATGTCCGT GCGCCCCGGA
GACGGGCACG AGGTCTGGTT CACCTACACC GACAGCGTCA CCCCCGGCAG CGTGCACCGC
TACGACGCGC GCACCGGCCG GACCACGCTC TGGGCCGCCG CGCCCGGCGC CGCCGAGGTG
CCGGAGCTGT CGGCCCGCCG GATCGTCTAC CCCTCGGCGG ACGGGACGCC GGTGCGGATG
GTGGTGCTGG CACGCCCGGG GACCGGTCCC CGGCCGACGA TCCTGTACGG CTACGGCGGG
TTCGGGCTCT CGCTGACCCC CTCCTACTCC AGCTACATCC TGCCCTGGGT GGAGGCCGGA
GGGGTCTTCG TCCTCGCCCA GCTGCGCGGC GGCGGCGAGG AGGGCGCGCA GTGGCACCGC
GCCGGGATGC TCGACGGCAA GCAGAACGTC TTCGACGACT TCGTGGCGGC GGCGGAACGG
CTCATCGCCG ACGGGTGGAC CACCTCCGCG CAGCTGGCCG CCTGCGGCGA GTCGAACGGC
GGGCTGCTGG TCGGAGCCGC GGTGACGCAG CGGCCGGACC TGTTCGCCGC GGCGGTCTGC
TCGGCCCCGC TGCTCGACAT GGTCCGCTAC GAGCGTTCCG GCCTCGGCCC GTCATGGCGC
TCGGAGTACG GCTCGGCCTC CGACCCGGAG CAGCTGGGCT GGCTGCTGGG CTACTCCCCC
TACCACCGGG TCAGGGACGG GGTCGACTAT CCGGCGACGC TGCTGACCGC CTTCGGCGGC
GACTCCCGCG TCGACCCCTT CCACGCCCGC AAGATGTGCG CGGCGCTGCA GGGGGCGACC
TCGGGCTCCC GGCCGATCCT GCTGCGCCAC GAGAGCGACG TCGGGCACGG GGCACGGGCC
ACCAGCCGGG CGGTCGGGCT GGCGGCCGAC ATGCTCGCCT TCCTGGCCGC GCACACCGGT
CTCACGAAGG TCCCGCGGTG A
 
Protein sequence
MIYPPAERQP IVDHLHGRAV PDPYRWLEDP SSPATRSWLA AQDELWRGHA AALAGRDGWH 
ARVSELTGVG TVSPPVWRGE RRFFVRRTAR QEHAVLYTAT PRQAEQALVD PVELDPTGLT
TLDQWQPGPD GRLLAYQLSR SGDERSELYV MDVGTRQIVD GPIDRCRYAP VAWLPDGKAF
FYVRSRRVCL HRLGTPAGDD VVICGTDRSY GLGISHDGRW LTVSAAAGGG NDLWLADLAA
SSPERPALRV IQEGTDAVTA PAVGPDGRLY LLTTMGAPRG RLCVADPARP EPEHWLDLVG
PDPEAVIGDF AILGDSVLLV GWTRHAVSEI SVHDLDGGEP LGRVPLPGLG SAGRMSVRPG
DGHEVWFTYT DSVTPGSVHR YDARTGRTTL WAAAPGAAEV PELSARRIVY PSADGTPVRM
VVLARPGTGP RPTILYGYGG FGLSLTPSYS SYILPWVEAG GVFVLAQLRG GGEEGAQWHR
AGMLDGKQNV FDDFVAAAER LIADGWTTSA QLAACGESNG GLLVGAAVTQ RPDLFAAAVC
SAPLLDMVRY ERSGLGPSWR SEYGSASDPE QLGWLLGYSP YHRVRDGVDY PATLLTAFGG
DSRVDPFHAR KMCAALQGAT SGSRPILLRH ESDVGHGARA TSRAVGLAAD MLAFLAAHTG
LTKVPR