Gene Sros_2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2219 
Symbol 
ID8665501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2399029 
End bp2400909 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content74% 
IMG OID 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_003337944 
Protein GI271963748 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.194335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCCG AGCAGGTCGT CCTGCCGTAC GCCACCTGGC CTTCCCCGAT CTCCAGCACC 
GGGGTGGCCA GGTCCGGGCT GCGCCTGGGA TTCCCGACGG TGGTCGGCGA AGAGGTGTGG
TGGACCGAGG ACCGCCCCGC GGAGAGCGGG CGGACCACGA TCGCGCACCG GGCCGCCGAC
GGCACGCACC GCGAGCTGCT GTCGGCGCCG TGGAGCGCCA GGACCCGCGT CCACGAGTAC
GGCGGGCGCT CCTACGCGGT GGTTCCCGGC GGGGGCGTGG TGTTCGCCAA CCTCGCCGAC
CAGCGGCTCT ACCTGCTGCC CCCCTGCGCC GAACCCCGGC CGATCACGCC GAGGCCCGAC
CGGGAGTCCG GGCTCCGCTA CGCGGACATG ATCGTCCACG ACGGGCAGGT CTGGTGCGTC
CAGGAGCGGC ACCACGACGG CGGCGGGATC AGCCGTTCGA TCGTGTCCGT CCCGCTGGAC
GGCGGGGACG TGCCGCGGGA GCGGGTCGGC GGGAGCGACT TCTACGCCTG TCTCGCCCTC
TCCCCCGACG GCGAGCACCT GGCCTACATC TGCTGGGACC ACCCCCGCAT GCCGTGGAAC
GGCACCGAGC TGCGGATCAC CCGGCTGGCC GACGGCACCT CCTGGACGGT CGGGGGCGGG
CCCTCCGAGT CGGTGCTCGC CCCGCAGTGG CGCGACGACC GGCATCTCTA TCTGGTCTCC
GACCGGTCGG GCTGGTGGAA CCTCTACCAG ATCGGCATCG ACGGCACCTC GCCCCGGGCG
CTCCACCCGG TGGAGGAGGA GTTCGCCGGA CCGCTGTGGC AGCTCGGCGG CCCGCCGTAC
CGGGTGCTGG CCGACGGGCG GATCGCGGTC CTGCACGGGC GGGGGGACAT GCGGCTGGGC
GTCCTCGACC CGGACAGCGG CGTGCTGACC GACCTGGACG TGCCCTACGA CGGCTGGGAG
CAGGTCCTCG CGTCCGACGG GCGCGTCCTG GCCGGGATCG GATACAGCGC GACGGTGCCC
CGGTCGATCG TCCGCGTGGA CACCGCGACC GGGCGGGCGG AGGAGCTCCG CCGTGACGTC
GACGAGCTGC CCGACCTCGC CTACCTGCCG CTCGCCCGGA CCGTGGAGAT CGAGGGCCGC
TCCGGCCGCC GGGTCCACGC GTTCGTCCAT CCGCCGTCGA ACCCGCAGGC CCGGGGCGAC
GGCGCCCCGC CCTACGTGGT GTTCGTCCAC GGTGGCCCCA CCGGGCGCAG CACCGGCGCC
CTCGACCTGG AGAAGGCGTT CTTCACCAGC CGGGGCATCG GCGTGCTCGA CCTCAACTAC
GGCGGTTCCA CCGGCTACGG CCGCGCCTAC CGCGACCGGC TGCGCGGCCA GTGGGGCGTG
GTCGACGTGG AGGACTCGGT CGCCGCCGCC GAATGGCTGG CCGCCGAGGG CCTGGCCGAC
CCGGAGCGGA TCGCGATCCG GGGCGGGAGC GCCGGCGGCT GGACGGTCAT GGCCGCCTGC
TGCGCGTCCG AGGTGTTCGC CGGCGGGGTC TCCTACTACG GTGTGAGCGC GCTCGCCTCG
TTCGTCGCGA CCACCCACGA CTTCGAGTCC CGCTACATCG AGTGGCTGGT GGGCCCCGAG
GATCCCGCCC TGTACAGCTC GCGCGAGCCG CTCGGCCAGG TCGCCGGGGT GAGCTGTCCC
ATGCTCCTCC TGCAGGGGCT GTCCGACCCG GTGGTCCCCG CCGCCCAGTC TCAGGCCTTC
GCCGACGCCC TCGCCGAACG CGGCGTGCCG TGCACCTACC TCACGTTCGA GGGCGAGGCC
CACGGCTTCC GCCGCGCCGA GACCCGCAGC GCGGCTCTGG CCACCGAGCT CGCCTTCTAC
CAGCAGATCT TCCGGAGCTG A
 
Protein sequence
MSPEQVVLPY ATWPSPISST GVARSGLRLG FPTVVGEEVW WTEDRPAESG RTTIAHRAAD 
GTHRELLSAP WSARTRVHEY GGRSYAVVPG GGVVFANLAD QRLYLLPPCA EPRPITPRPD
RESGLRYADM IVHDGQVWCV QERHHDGGGI SRSIVSVPLD GGDVPRERVG GSDFYACLAL
SPDGEHLAYI CWDHPRMPWN GTELRITRLA DGTSWTVGGG PSESVLAPQW RDDRHLYLVS
DRSGWWNLYQ IGIDGTSPRA LHPVEEEFAG PLWQLGGPPY RVLADGRIAV LHGRGDMRLG
VLDPDSGVLT DLDVPYDGWE QVLASDGRVL AGIGYSATVP RSIVRVDTAT GRAEELRRDV
DELPDLAYLP LARTVEIEGR SGRRVHAFVH PPSNPQARGD GAPPYVVFVH GGPTGRSTGA
LDLEKAFFTS RGIGVLDLNY GGSTGYGRAY RDRLRGQWGV VDVEDSVAAA EWLAAEGLAD
PERIAIRGGS AGGWTVMAAC CASEVFAGGV SYYGVSALAS FVATTHDFES RYIEWLVGPE
DPALYSSREP LGQVAGVSCP MLLLQGLSDP VVPAAQSQAF ADALAERGVP CTYLTFEGEA
HGFRRAETRS AALATELAFY QQIFRS