Gene Strop_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0039 
Symbol 
ID5056470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp45230 
End bp46789 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content68% 
IMG OID640472304 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001156902 
Protein GI145592605 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTAT CCCGAAGGTC CGTGTTCGTT GGACTAGCCA CGATGGCCAT GGTGGCGTCC 
GCGACCCCCG CCATGGCCGC TGAACCAGTT GGTGAGATCC GTAGTGCGGG GGGCGCTACG
GCCGTAACCG ACAGCTACAT CGTGGTCTTC AAGGACAACG TGGTCAGCCG TGCCACGGTC
GAGACGTCGG TTGATCGTTT GGTGGACCGG CACGGTGGCC AGGTAAGCCG GACGTACAGC
ACCGCGCTCC GCGGAGCCGA ACTGCGGGTG GATGCCGGTG CCGCCGCCCG AATCGCGGCC
GACCCGGCGG TGGCGTACGT GGAGCAGAAC CACCGGGTGT CGATCACCGA CACCCAGACC
AACCCCCCGT CCTGGGGTTT GGACCGAGTT GACCAGCGCG ATCTGCCGCT GGACAACTCC
TACACCTATC CGAACACCGC CAGTGACGTG AACATCTACA TCCTCGACAC CGGCATCCGC
ACCACCCACC AGGACTTCGG CGGCAGGGCC ACCTGGGGCA CCAACACCGC CGACAACAAC
GATACCGACT GCAACGGGCA CGGCACGCAC GTCGCCGGCA CCGCTGCCGG CACGGCGCAC
GGCATCGCCA AGGAGGCCAA CCTGGTGGCG GTGAAGGTGC TGGACTGCGC GGGCAACGGC
ACCTTCGCCG GGGTCGTGGC CGGCGTCGAC TGGGTGACCG CGAACGCGGT CCAGCCCGCG
GTGGCGAACA TGAGCCTCGG TGGCGGTGCG AACAGCGCGC TGGACAACGC GGTGAGCAAC
TCGATCGACT CCGGTGTCAC CTACGCGCTG GCGGCGGGCA ACAGCAGCGC CAACGCCTGT
AACTACTCAC CGGCCCGTAC CCCGGACGCG ATCACCGTCG GGTCTACGAC CAGCACTGAT
GGACTGTCCT GGTTCTCCAA CATCGGCACC TGTCTGGACA TCTTCGCGCC GGGCTCGTCG
ATCACCGCGC CGTGGATCAC CAGTGACACC AGCACGAACA CGATCAGCGG CACGTCGATG
GCATCGCCGC ATGTCGCGGG TGCCGCGGCG TTGGTCCTGT CGGCCAACCC CTCGTACACC
CCGCAGCAGA TTCGGGACGA GCTAGTCGAC AACGCCACCG ACGGCGCGAT CGGCTCCCCC
GGCAGCGGCT CGCCGAACAA GCTCCTCTAC GTCGGTGACG GCGGCACCAC GCCTCCGCCG
CCTCCGCCGC CGGGCTGCTC CGGCACCAAC GACACCGACG TGGCGATCCC GGACGCCGGT
TCCGCGGTGA CCAGCTCGAT CACCATCGCC GGCTGCGACC GGGACGCCGC CGCCACCTCG
ACCGTGGCCG TGGACATTCC CCACACCTGG CGGGGTGACC TCGTCATCGA CCTGATCGCG
CCGGACGGCT CGTCCTACCG GCTGAAGACC AACAACCTGT CCGACTCCGC CGACAACGTC
AACGAGACCT ACACGGTGAA CCTCTCCAGC GAGGCAGCCG ACGGCACCTG GCAGCTCCAG
GTCCGCGATG TCTACCGCCA GGACACCGGC TACATCGACA CCTGGACCCT GACGGTCTGA
 
Protein sequence
MGLSRRSVFV GLATMAMVAS ATPAMAAEPV GEIRSAGGAT AVTDSYIVVF KDNVVSRATV 
ETSVDRLVDR HGGQVSRTYS TALRGAELRV DAGAAARIAA DPAVAYVEQN HRVSITDTQT
NPPSWGLDRV DQRDLPLDNS YTYPNTASDV NIYILDTGIR TTHQDFGGRA TWGTNTADNN
DTDCNGHGTH VAGTAAGTAH GIAKEANLVA VKVLDCAGNG TFAGVVAGVD WVTANAVQPA
VANMSLGGGA NSALDNAVSN SIDSGVTYAL AAGNSSANAC NYSPARTPDA ITVGSTTSTD
GLSWFSNIGT CLDIFAPGSS ITAPWITSDT STNTISGTSM ASPHVAGAAA LVLSANPSYT
PQQIRDELVD NATDGAIGSP GSGSPNKLLY VGDGGTTPPP PPPPGCSGTN DTDVAIPDAG
SAVTSSITIA GCDRDAAATS TVAVDIPHTW RGDLVIDLIA PDGSSYRLKT NNLSDSADNV
NETYTVNLSS EAADGTWQLQ VRDVYRQDTG YIDTWTLTV