Gene Sros_8172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8172 
Symbol 
ID8671500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9012324 
End bp9014306 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content73% 
IMG OID 
ProductSubtilisin-like protein serine protease-like protein 
Protein accessionYP_003343566 
Protein GI271969370 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGACTTC GAGCCCTCGT GGCCATGGCG ATCGTGCTCA CCGTCGCCCC TGGCGTCCCC 
GCCCTCGCCG AGGAGCCGCC ACCACCCCCT CCGGCGGCCA CCGCGGAGAC ACCGGCCCCC
ACCGGGGAGA CACCGGCTCC CACCGGGGAG ACACCGGCTC CGGCGGAACA GCCGGCGGCG
CCCAAGCTGG AACCCGGGCT GGCCGCCGAC GGCGACGGGG CCCGGGTCAT CGTCGAGGTC
ACCGCGCCGG CGGAGGCCGC ACCCGTCGCG GGCCAGGCCC AGGACCTGCC CGGCGCCGAG
GTCGTCCTCC AGCCGCCGGA CACCTCGTTC ATCGTGGTCG AGGGCACGAG CGAGTCGCTG
GCCGCGCTGG CCCAGGACCC CCGGGTGGTG TCGGTCCGCC GGGACCGCGC CTACTCCCCC
GTCTCGCTGG CTTCGGGTCT GAAGCTGATC GGGGCGGACC AGGCCCAGGC CGAGGGGGCC
ACCGGCGAGG GCAAGATGAT CGCGGTCATC GACACCGGGA TCGACCGGGA CCATCCCGCG
CTGGCGGGCA AGGTGGTGGA GGAGGCCTGC TTCTCCGCGA CGGACGGCGG CGCGAAGTCC
CTGTGCCCCG GCGGGGCTGA CACGCAGACC GGTCAGGGTT CCGCCGACGC GAAGACCCCC
ATGTGCGTGG AGGGCGCCGT CAACCTGTGC GACCACGGCA CCCACGTGGC GGGCATCGCC
CACGCCGTCG CACCCGGCGC GGACATCGCG GCGATCCAGG TCTTCAGCCG CATCGACGAC
TGCGAAGGCG GGGAGGCCTG CCTGAGCGCC TACGAGTCCA CGATCCTGCT CGCCCTCGAC
CACGTCGCCA AGCTGAAGGA CTCCCACCCC GGCCTCGTCG CGGTGAACCT CAGCCTCGGT
GGCGGCCTCT ATGAGGGGGC GTGCGACGGC GCACCCGAGA TCGGGGCGAT GAAGCAGAAG
ATCGAGACCC TGCGCGCCAA GGGCGTGGTG ACCGTCGCCG CCGCGGGGAA CGAGGGGATG
TCCGGTGCCG GCGCTCCGGG ATGCATCTCC GGCGCGGTGA CCGTGGGCGC GACCGGTGAC
GACGACCGCG TCCCCGAGTG GTCCAACTAC GGCTCCGTGC TCGACCTCTT CGCCCCCGGT
GTCGAGATCG ACTCCGCGGT GCCGAACGGC GGCACCGCGG TCTACAGCGG GACCTCGATG
TCGACCCCGC ACGTCACGGG CGCGCTGGCG GTGCTGGCCG GGAAGTCGGC GGACGTCACC
CCGGACGCGC TTGTCGGCAA GCTCACCGCG GCGGGCCGCC CCATCGTCTA CGACGGAGTG
ACCACGCCCC GCCTCGACCT GTACGGCGCG CTCACCGGCC GCGCGCCCTC GCCCCCCGCC
ACCCAGGATC CCGCCCCCGG GGACGGCTCC ACACCGGACG ACCCCGGCGA CGACCCCGAT
CCCAACCCGA GCTCCGGCCC GTCCCCGGTC CCCGACCCGG TGGACCCGCC CGCGCCCAGC
CCGGTCCCGC TGCCCACGGT GACCGTCACC GTGACCGTGA CCGCCACGCC CACCACCGCG
CCGGTGGTGT GCAGCCGGGG CAAGGCCGCC AAGACCCTGA CCGCCGCCGG ATGGGCCACG
GAGATGACCC GGGGCAAGGG GGAGCTCTCG GACGAGACGC TCAGCTGTTA CCTGCGCCTG
GTCGCGAAGG CCAGCGACGT CTTCCCCGAG CTCACCCGCG CCTCCACGCC GGGTACGGCC
TACCGGGTGC TCAAGCCCGC CAAGAAGACG AAGCTCACCC AGAAGATCAA GATGGAGAGC
GAGCTGCTGG CCGCCTGGCT GAACTGGGCG CACGGCGGGG TCAACTTCAC CGCGAAGATC
AGCAGGTCGA CCACGGTCAG GGACACTCTC ATCGCCGCGG AGAGGCAGCG GCTGAAGGGC
TCCTCCCTCT CCGAATACAC CTCAATACTG AAAAAGCATG TGAACGCCCG CCGTATCGCA
TAA
 
Protein sequence
MRLRALVAMA IVLTVAPGVP ALAEEPPPPP PAATAETPAP TGETPAPTGE TPAPAEQPAA 
PKLEPGLAAD GDGARVIVEV TAPAEAAPVA GQAQDLPGAE VVLQPPDTSF IVVEGTSESL
AALAQDPRVV SVRRDRAYSP VSLASGLKLI GADQAQAEGA TGEGKMIAVI DTGIDRDHPA
LAGKVVEEAC FSATDGGAKS LCPGGADTQT GQGSADAKTP MCVEGAVNLC DHGTHVAGIA
HAVAPGADIA AIQVFSRIDD CEGGEACLSA YESTILLALD HVAKLKDSHP GLVAVNLSLG
GGLYEGACDG APEIGAMKQK IETLRAKGVV TVAAAGNEGM SGAGAPGCIS GAVTVGATGD
DDRVPEWSNY GSVLDLFAPG VEIDSAVPNG GTAVYSGTSM STPHVTGALA VLAGKSADVT
PDALVGKLTA AGRPIVYDGV TTPRLDLYGA LTGRAPSPPA TQDPAPGDGS TPDDPGDDPD
PNPSSGPSPV PDPVDPPAPS PVPLPTVTVT VTVTATPTTA PVVCSRGKAA KTLTAAGWAT
EMTRGKGELS DETLSCYLRL VAKASDVFPE LTRASTPGTA YRVLKPAKKT KLTQKIKMES
ELLAAWLNWA HGGVNFTAKI SRSTTVRDTL IAAERQRLKG SSLSEYTSIL KKHVNARRIA