Gene Sros_4277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4277 
Symbol 
ID8667571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4760696 
End bp4762813 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content72% 
IMG OID 
ProductProlyl oligopeptidase 
Protein accessionYP_003339913 
Protein GI271965717 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATGC GGATGAGCGA CATCAGCGGC CCCCCGATCG CCCGAGTGGA CGTGGTGCGC 
GACGTTCACC ACGGCATCAC GCTCGACGAT GCCTACCGGT GGATGGAGGC GGACGGCGAG
GAGTTCCACC GCTGGCTCGA CGGCCAGGCC AGCCATGCCC GCGAGCAGCT GGACACCCTG
CCCCGCCGGG AGGAGCTGCT GACCCGGATC CGCGAGCTGG GCGGCGCGCT CCCGCAGTAC
TCGGACTTCA CGCTGGCGGG CGACAGGATC TTCTACCTGG CGCGCGAACC GGGCGCCAGC
GTACTCGTGC TGATGGTGCG TGAGCCCGAG GGAGCGAGCA GGGTCCTGTT CGACCCGGAC
ACCGTGCCCG GCGAGGCGCA CCACGCCATC GCCTGGTACG TGCCCTCCCC CGACGGCCGC
CACATCGCCT GCGCCGTCTT CCCGTCCGGG TCGGAGGAGA GCACGATCCA GGTCATTGAC
GTCGGCTCCA GTGCCATCCT GGAGACGATC CCGCCCACGA CGCGCTTTGC CTTCCTGAGC
TGGCTGGAGG ACGGCCGGTC GTTCGTCTTC CACCGCTTCG TGGCTCCTCC GGCGGGCGCT
CCGCCGGAGG AGCGCCGCCT GGACAGCCGT TCCTTCCTGC ACCGGCTGGG CACCGACCCG
GCGCGGGATG AGGTGGTGCT GGCGCGCGGG CTCAACCCCC ACGTCGAGCT GACCCCGCAG
GATCGGCCCT TCCTTGTGGT GCCGCCGCGC GGTGAGTGGA TGCTGGCGAT CGTCTCCCAC
GGCGCGCTCG GCCCCTGGAC CAGCGAGCAG TTGAGCGATT GCTCGCTGTA CGTGGCGCCA
CGAGCCGCGC TCGCCGATCC GGCCACCTGC CCGTGGCGGA AGGTAGCGGG CGCGGCAGAC
GGTGTGACCG CCTTCGCCAC CAGCCACGAC ACCCTCTATC TGGTCTCCCA TCGGGACGCG
CCCCGCTCCC ACGTGCTGGC CGTTCCGTTC GCCGTACCGG ATCTGTCTCG AGCGCGGGTG
GTCGTGCCCG CGGGTGAGCG CGTGGTGGAG GCGGTGCGGG TCGTCGGCGA CCGGCTGCTG
GTGCGCGATC TCGACGGTGG CGTCCATCGG GTACGGCATG CGCCCCTGCC CGGCGGCGAG
CCCGCCGACC TCCCGCTGCC GGTAGAGGGC GCCATCTGGG AGTTGACCAC CCACCCGGAA
CGGCCCGAGG CCCTGCTGCT GACGGCCGGT TGGACAGACG CGCCGCGGCT CTACCGCTAC
GACGGCGAAA CCCTCGAGGA CACCGGGCTG GCGCCCCGCT CACCGGTCGA CTTCGGCGAC
GTGCACGCCC GCACCCTGCA TGTGCCCGCG AGGGACGGAA CACGGATCCC GCTCACCGTC
ATCCACCACA AGGACCTGGA GCTGGACGGC GACAACCCGG TCCTGCTCAC CGGCTATGGC
TCCCATGGCA TGTCCGAGCT GCCTGAATTC CGTCCCGAGA TGCTCGCCTG GTACGAGCGC
GGCGGCGTCT ACGCCGTGGC GCACCTGCGC GGTGGCGGCG CCTATGGCCG GGAGTGGCAC
GAGGCCGGGC GTGGCCTGCG CAAGGAGGCC ACCATCACGG ACTTCATCGA CTGCGCCGAG
CACCTCATCG CCCTGGGCTA CACCCGGCAG GGACGGCTGG CGGGCGAGGG CGTCAGCGCC
GGCGGCATCC CCACTGGCGG CGCGCTGGTG CGCCGGCCGG AGCTGTGGGC TGCCATGGCG
ATGCAGGTGC CGACGGTCAA CGCCACACGG ACCGAGTTCA GTGAGAACGG GCCGATCAAC
GTGCCGGAGT TCGGCAGCGT CACCACTGAG GATGGCCTGC GCGGTCTGCT GATCGCCGAT
GCCTACCTGC GGGTCGAGGA CAGCGTGCCG TACCCGGCGG TGCTGCTGAC CACCGGACGG
CGCGACGCGC GGGTGCCGCC GTGGCAGCCG GCGAAGCTGG CCGCCCGCCT GCAGGCGGCC
AGCGCCTCCG GCCGGCCGGT ACTGCTGCGT GTTGAGGAGC ACGGCGGTCA CGGCGCCGGC
TCGACCCGCG AGCAGGAACA TGCCCTGCTC GCTGATGTGC TGGCCTTCCT GTTGCACGCT
TTCGAGGCGT CACGATGA
 
Protein sequence
MIMRMSDISG PPIARVDVVR DVHHGITLDD AYRWMEADGE EFHRWLDGQA SHAREQLDTL 
PRREELLTRI RELGGALPQY SDFTLAGDRI FYLAREPGAS VLVLMVREPE GASRVLFDPD
TVPGEAHHAI AWYVPSPDGR HIACAVFPSG SEESTIQVID VGSSAILETI PPTTRFAFLS
WLEDGRSFVF HRFVAPPAGA PPEERRLDSR SFLHRLGTDP ARDEVVLARG LNPHVELTPQ
DRPFLVVPPR GEWMLAIVSH GALGPWTSEQ LSDCSLYVAP RAALADPATC PWRKVAGAAD
GVTAFATSHD TLYLVSHRDA PRSHVLAVPF AVPDLSRARV VVPAGERVVE AVRVVGDRLL
VRDLDGGVHR VRHAPLPGGE PADLPLPVEG AIWELTTHPE RPEALLLTAG WTDAPRLYRY
DGETLEDTGL APRSPVDFGD VHARTLHVPA RDGTRIPLTV IHHKDLELDG DNPVLLTGYG
SHGMSELPEF RPEMLAWYER GGVYAVAHLR GGGAYGREWH EAGRGLRKEA TITDFIDCAE
HLIALGYTRQ GRLAGEGVSA GGIPTGGALV RRPELWAAMA MQVPTVNATR TEFSENGPIN
VPEFGSVTTE DGLRGLLIAD AYLRVEDSVP YPAVLLTTGR RDARVPPWQP AKLAARLQAA
SASGRPVLLR VEEHGGHGAG STREQEHALL ADVLAFLLHA FEASR