Gene Sros_8948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8948 
Symbol 
ID8672290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9891927 
End bp9894014 
Gene Length2088 bp 
Protein Length695 aa 
Translation table11 
GC content68% 
IMG OID 
ProductOligopeptidase B 
Protein accessionYP_003344323 
Protein GI271970127 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGCC AGACGCCGCC TACGGCGAAG AAGGTTCCCA GCGAGCGCGT GCACCACGGG 
GACACCGTGG TCGATGACTA CGCGTGGCTG ACGGTCAAGG ACGACCCGGA GACGGTCGCA
TATCTGGAGG CGGAGAACGC CTACACGCAG CAGGCCACCT CCCACCTCAA GCCTCTGCAG
GAAACGATCT TCCAGGAGAT CAAGGACCGG ACCCTGGAGA CGGACCTCTC GGTCCCCACC
CGCAAGAACG GCTGGTGGTA CTACTCCCGG ACCGAGGAGG GCAAGCAGTA CGGCATTCAG
TGCCGGGTGG CCGCCGACGG TGAGACTCCC CCGGAGCTGA AGGCGGGCGA GTCGCTCCCC
GGCGAGCAGA TCCTGCTCGA CGGGAACGAA CTGGCCGGGG AGAGCGACTT CTTCTCCATC
GGCACCAGCG CGGTCAGCCC CGGCGGCGAC CTGCTGGCCT ACTCCGTCAA CTTCACCGGC
GACGAGCGCT TCACCCTGAA GGTGAAGGAC CTCACCACGG GGGAGACCCT CCCCGACGAG
ATCCCCGGCA TCTTCTACGG CGGGGCGTGG TCGGCCGACG GCACCGCGTT CTTCTACACG
ACGGTCGACG AGGCCTGGAG GCCCAACCGG GTCCACCGGC ACACGATCGG CACCCCGGCC
GAGGAGGACG TGCTCGTCCA CGAGGAAGGT GACGAGCGGT TCTGGATCGG CATCGGGCTC
AGCCGGAGCG AGCGCTACCT CGTGCTCTCC GCCGGAAGCA AGATCACCAG CGAGGTCCGC
ATCCTTGAGG CGGACGACCC GGCGGGTGAG TTCCGGCTCG TCCGGCCGCG CGAGACCGGT
GTCGAGTACG GCATCGACCA CGCCGGCGAC CACTTCCTGG TCCTGCACAA CCGGAACGCG
GAGAACTTCG AGCTGGCCAC CGCGCCGCTC GACGCCCCGG GCGACTGGAC CCCGCTGATC
GAGCACCGCG AGGACACCCG GCTGCTCGAC GTCGACGCCT TCCAGAGCCA CACCGTCGTG
CACTTCCGCC GGGACGGCCT CACCGGCATC CGCATCCTGC CCCGGGACGC CGGGACGTAT
CGGAGCGAGC GTCTCGACTC CGCTTACGAG ATCTCCTTCC CTGAGCCGAT CTACGACGTC
TCGCCGGCGG GTAACCCGGA GTTCACCACC GAACGGCTCC GCCTCGGCTA CACCAGCATG
ATCACTCCGC CCTCGGTCTA CGACTACGAC CTGCGGGCGC GCGAGCTGAT CCTGCTGAAG
CAGAAGGTCG TGCTCGGCGG CTACGACCCG GCCGACTACG AGCAGTTCCG CGAGTGGGCC
ACCGCCGCCG ACGGCACCCG GGTCCCCATC TCGATCGTGG CCAGGAAGGG CACCGCGCTG
CCCGCGCCCA CCGTGCTGTA CGGCTACGGC AGCTACGAGA TCTCGATCGA CCCGTCGTTC
TCGGTGGCCC GGCTGAGCCT GCTCGACAGG GGCTTCGTCT TCGCCATCGC GCACGTCAGG
GGCGGCGGTG AGATGGGACG CCGCTGGTAC GAGGACGGCA AGTTCCAGAA GAAGAAGAAC
ACCTTCACCG ACTTCGTGGC CGCCGCCGAG CACCTCAAGG CGGCGAACCG GTCGAGCGCG
ATCATCGCCA GGGGCGGTTC GGCCGGCGGC CTGCTGATGG GCGCGGTCAC CAACCTCTCG
CCGGAGACCT TCGCGGGCGT GGTCGCCGAG GTGCCGTTCG TGGACGCGCT GAACACGATC
CTCGATCCGT CGCTGCCCCT GACGGTCATC GAGTGGGACG AGTGGGGCGA CCCGCTGCAC
AACGCGGACG TGTACGAGTA CATGAAGTCC TACTCGCCCT ACGAGAACGT GGACGACCGG
GTCTACCCGC CGATCCTCGC CATCACCAGC CTCAACGACA CCCGTGTCTT CTACCACGAG
CCCGCCAAGT GGATCGCGAA GCTCCGCGAG GTCGCGGACG GCGGCCCCTT CCTCCTCAAG
ACGGAGATGG GTGCCGGGCA CGGCGGCCGC AGCGGCAGGT ACGACTCCTG GCGCGAGGAG
GCCCTCACCC TCTCCTGGAT CCTCGACACG GCCAAGGTGA CCCGATGA
 
Protein sequence
MSSQTPPTAK KVPSERVHHG DTVVDDYAWL TVKDDPETVA YLEAENAYTQ QATSHLKPLQ 
ETIFQEIKDR TLETDLSVPT RKNGWWYYSR TEEGKQYGIQ CRVAADGETP PELKAGESLP
GEQILLDGNE LAGESDFFSI GTSAVSPGGD LLAYSVNFTG DERFTLKVKD LTTGETLPDE
IPGIFYGGAW SADGTAFFYT TVDEAWRPNR VHRHTIGTPA EEDVLVHEEG DERFWIGIGL
SRSERYLVLS AGSKITSEVR ILEADDPAGE FRLVRPRETG VEYGIDHAGD HFLVLHNRNA
ENFELATAPL DAPGDWTPLI EHREDTRLLD VDAFQSHTVV HFRRDGLTGI RILPRDAGTY
RSERLDSAYE ISFPEPIYDV SPAGNPEFTT ERLRLGYTSM ITPPSVYDYD LRARELILLK
QKVVLGGYDP ADYEQFREWA TAADGTRVPI SIVARKGTAL PAPTVLYGYG SYEISIDPSF
SVARLSLLDR GFVFAIAHVR GGGEMGRRWY EDGKFQKKKN TFTDFVAAAE HLKAANRSSA
IIARGGSAGG LLMGAVTNLS PETFAGVVAE VPFVDALNTI LDPSLPLTVI EWDEWGDPLH
NADVYEYMKS YSPYENVDDR VYPPILAITS LNDTRVFYHE PAKWIAKLRE VADGGPFLLK
TEMGAGHGGR SGRYDSWREE ALTLSWILDT AKVTR