Gene Sros_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3301 
Symbol 
ID8666589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3595107 
End bp3596330 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content70% 
IMG OID 
Productputative serine protease 
Protein accessionYP_003338983 
Protein GI271964787 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.44867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGA CGCTCCTCAC AGCTCTCTTC GCAAGTTCCC TCGCTATGAC GGCGATTCCG 
GCCACGGCCG CGATCGCCGC CGCACCGGAC CCTGTCAGCA ACTACATCGT CGTCCTGCAG
GACGGCACGG ACCCCTCGGC CTTCGCCGGC ACCCAGGCAC GTTCCCGTGG CGCCGCGGTG
GACAAGATCT TCAACCACGC GCTCCGCGGC TACTCGGCGA AGATGAGCGC CACCGCGGCC
GCCGCCGTCG CCCGCGACCC GCAGGTGCAG TTCGTGCAGC CTGACGGGGT GGTGTCGATC
AGCGCCCAGA CGCTGCCCAC AGGGGTCAAC CGGGTCGACG CCGAGCTCAG CCCCACCGCC
GCCATCAACG GCGTGGACAC GCGGGTCAAC GTCGACGTGG CGATCATCGA CACCGGCATC
CAGCTCACCC ACCCCGACCT GAACGTCTAC ACCGCGGGGG CCAAGAACTG CAACACCGGC
ACGAGCGCCA ACGACGGCCA CGGCCACGGA ACACACGTGG CGGGCACGGT CGGGGCGCTG
GACAACACCA GCGCCGTCGT CGGCGTGGCA CCTGGCGCCC GCCTGTGGCC GGTGCGCGTG
CTGAACAACA GCGGCGGCGG CAGCTGGTCG CAGGTGATCT GCGGCATCGA CTACGTCACC
GCCCACGCCT CCGAGATCGA GGTCGCGAAC ATGAGCCTCG GCGGCCTCGG CGCCGACGAC
GGCAACTGCG GCAACACCAA CAACGACGCC ATGCACCGGG CGATCTGCGC CGCCGTCGCG
GCGGGCGTGA CCTTCGTGGT CGCGGCCGGC AACGAGACCG ACAACGCGGC CAACCACGTG
CCCGCGGCGT ACGACGAGGT CATAACGGTC AGCGCGCTGG CCGACTTCAA CGGGCTTCCC
GGTGGCGGGG CGGCGTCCAC CTGCCGCAGC GACGTCGACG ACACGTTCGC CAGCTTCTCC
AACTACGGCG CCGACGTGGA CATCATCGCC CCGGGCGTGT GCATCCTGTC CACCTGGAGG
AGCAGCGGCA CCAGCACCAT CTCGGGCACC TCGATGGCCA GCCCGCACGT TGCCGGTGGA
GCGGCCCTCT ACAAGGCCAC GCATCCGGCG GCGACGCCGG CGGCGGTGAA GTCCGCGCTC
CAGGCGGCGG GCACCACCAA CTGGAACAAC GCCGACGACC CTGACGGCAT CAAGGAGAAG
CTGCTCAACG TCGCCACCTT CTGA
 
Protein sequence
MRKTLLTALF ASSLAMTAIP ATAAIAAAPD PVSNYIVVLQ DGTDPSAFAG TQARSRGAAV 
DKIFNHALRG YSAKMSATAA AAVARDPQVQ FVQPDGVVSI SAQTLPTGVN RVDAELSPTA
AINGVDTRVN VDVAIIDTGI QLTHPDLNVY TAGAKNCNTG TSANDGHGHG THVAGTVGAL
DNTSAVVGVA PGARLWPVRV LNNSGGGSWS QVICGIDYVT AHASEIEVAN MSLGGLGADD
GNCGNTNNDA MHRAICAAVA AGVTFVVAAG NETDNAANHV PAAYDEVITV SALADFNGLP
GGGAASTCRS DVDDTFASFS NYGADVDIIA PGVCILSTWR SSGTSTISGT SMASPHVAGG
AALYKATHPA ATPAAVKSAL QAAGTTNWNN ADDPDGIKEK LLNVATF