Gene Sros_3412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3412 
Symbol 
ID8666700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3749564 
End bp3750925 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content74% 
IMG OID 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003339092 
Protein GI271964896 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.731274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTCC TGATCCAGCT CCGCCCCTCC CCGGACGTCG TCGCGGCGGT GGCCGATCCC 
GATGTGACCG CGACCACGGC CGACGTCGCC GACGGCCTGC CCGGGGTCGT CCTGGACCCC
TCCTTCACGC CGGTCGCCGT GCCGCGGCCG GTGCCCGCCG CCGCCGACGG CGACCCCCTG
TCCCTGAACC AGTCGCTGAC CTTCTCCCTC GCCGCCGGCG ACGCCTCGGT CATGGTCCGC
GGGGAGATCT CCGACGACGA GCTGTCCACC CGGGTCACCC TGCTGCCCAC CCTCCGCAAC
GACGTCGTGG GGGTGTTCGC CGACCCGGTG ATCGAGTCGA ACCTCACCTG CGGCGGCGAC
GCCCCGGTGG GCGACTGGCA CGACGTGGAG CGGCTGCTGC ACGTCGCCGA GCTGCACGCG
GAGGGCCTGG ACGGCTCAGG CGTGGCGCTG GCGGTGCTGG ACACCGGCAT CAACGCCGCG
CACGTGGCCC GCCACCTCGG CCGGGACCTG CTGCTGGACA AGGAACGGAG CTGGAACCCC
GACGGGGTGA CCGGCAGGCC GGGCGAGTTC GAGGTCGACC ACGGCACGAT GTGCGCGTTC
GACACGCTGA TCGCCGCCCC GCGGGCCACG CTGATCGACA TTCCCGTGCT GCTCTCCCGG
CGCCCCGGCG GTTCGGCCCT CGACGGCCTG CTGTCGGACG CGGTGGCGGC CTTCGCCCAC
CTGCGCACCG TCCTCGAGGC CCAGCCCGCG GAGACACGGT CCCTGGTGGT CAGCAACAGC
TGGGGCTCCT TCTCCCCCCG CTGGGACTTC CCCGTCGGCC ATCCCGGCAA CTACTCCGAC
AACCCGGCCC ACCCGTTCAA CCTGATCGTC GCCAGCCTGG AGCAGGCGGG CGCCGACGTG
CTGTTCGCCG CCGGCAACTG CGGGCGTGAC TGCAGGGACG GCCGGTGCGC GTATCCGAAC
CGGCCGATCG CGGGCGCCAA CTCCCACCCG GGCGTGCTGT CCATCGGCGG CGTGGACACC
GGCGGGCAGC GGGTCGGATA CTCCTCCCAG GGCCCCGGCC GCCTCACCCT CCGCAAGCCC
GACATCTGCT CCTACACCCA CTTCTCCGGT TCCAAGGCCT TCGGCGCCGG CGAGCCCGAC
TCGGGCACCT CGGCCGCCTG CCCCGTGGCC GCCGGCCTGG TCGCGGCCAT CCGCACCCGG
TGGCCCGTCT CCGCCCTCTC CCCCGCCCAG CTGCGCACCC TGCTCCGCCG CACCGCCGAC
GACCGCAGCG ACATCGGGTT CGACTACGAC TACGGCTACG GCGTCACCGA CACACCGGGG
GTGCTCGCGT CGTTGCGGCG CAGGGCGATG CGCGTAGCGT GA
 
Protein sequence
MRVLIQLRPS PDVVAAVADP DVTATTADVA DGLPGVVLDP SFTPVAVPRP VPAAADGDPL 
SLNQSLTFSL AAGDASVMVR GEISDDELST RVTLLPTLRN DVVGVFADPV IESNLTCGGD
APVGDWHDVE RLLHVAELHA EGLDGSGVAL AVLDTGINAA HVARHLGRDL LLDKERSWNP
DGVTGRPGEF EVDHGTMCAF DTLIAAPRAT LIDIPVLLSR RPGGSALDGL LSDAVAAFAH
LRTVLEAQPA ETRSLVVSNS WGSFSPRWDF PVGHPGNYSD NPAHPFNLIV ASLEQAGADV
LFAAGNCGRD CRDGRCAYPN RPIAGANSHP GVLSIGGVDT GGQRVGYSSQ GPGRLTLRKP
DICSYTHFSG SKAFGAGEPD SGTSAACPVA AGLVAAIRTR WPVSALSPAQ LRTLLRRTAD
DRSDIGFDYD YGYGVTDTPG VLASLRRRAM RVA