Gene Strop_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0471 
Symbol 
ID5056910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp533303 
End bp534604 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content66% 
IMG OID640472744 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001157334 
Protein GI145593037 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0717859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTCG CCCCCCGCCC GGCAACCACC ATCGCCGCCG CAACCCTCGG CGCCATAGTG 
GCCCTCGGCG TGTCAGTCCT GCCGGCACCC GCCGCTGACC GAAAGGACGC CTGGCACCTC
GACGCCCTCG AACTAGCCGA CATGCACAAG ATCACCCAAG GCGAAGGAAT CACCGTCGCC
GTCATCGACT CCGGTGTAGA CGCCACCCAC CCCGACCTCA AAAACAACGT ACTACCCGGC
ATAGACTTCT TCGACGAACA GGCCAGCGGC CACGAGGACC GAAGCGGGCA CGGCACCGCG
ATGGCTTCCC TAATCGCCGG GCACGGACAC GGTCCCGGCG GCCATGAAGG AGTGCTTGGC
GTCGCTCCCA AAGCCAAAAT CCTGCCAATC ACCGTGCGAG CACCCGAAGG CAAGTCGAAC
TACTCGCTGG AAGCAATCGC GCTAGGAATC CACTGGGCGA TCGAGCAGGA CGTGGACGTC
ATCAACATCT CCCTGGGCGG CCCACAAAAC GTCGAACTCA GCCAGGCCGT GGAACGGGCA
TATCAGAACA ACGTGATCGT CGTCGCTGGT GTCGGCAACA AGGAAAATGC GGCAATCGGC
AGCCCGGCAA ACCTGCCCGG CTCCCTCGCG GTAACCGGCA CCGACCGCGA CGGAATGCCC
AGCAGCATCG CCTCACTCCC CGCCGCGCAG ACCCACCTCG CCGCACCCGC CGAGGACCTC
TACCAAGCAG TACCCGGCGG CGGGTACGCA ACAATCACCG GAAACAGCGG CGCCACCGCG
CTCGTCTCCG GCGCGGTCGC CCTGGTCAAG TCGAAGTACC CGAACCTCAA CTCACTGGAT
CTGTTCAAAC GGATGGTGGA AACCACCCGC GACGCCGGTA AACCAGGCAA GGACTTCGAC
TATGGCTGGG GCCAACTCGA CCTACGCGCA GCCCTGACCG GCGAACCAGA CGGCCGCGCC
AGCCGAACCC AGGCCCCCAG TCAAGAGCCC CAGCTCGACC CGACCCTCGC CCGAGCTCGG
GCGCAGGGGC CGGCGGGGGA AAACGAAGCC CTCGTAATGG CGCTCTCGAT CGCGTTCATC
CTCGTCGTCC TCGGCCTCAT CGCGACGCCC GTGCTACTCC TGCTGCGATA CCGACGCCGA
CGACGAGCCA CAACACCCGC GATGGAAGCC GCCACCGCGA CCCGGACATC CGCCACCGGG
CTCGCGTCGC CACCCACCGC AACAACCAAC CCATCAGAGA GCGGACCACC AGCCCACCCC
ACCGATCCAG CGGACGACTC ACCCTGGCGC CGCCCCGACT GA
 
Protein sequence
MRLAPRPATT IAAATLGAIV ALGVSVLPAP AADRKDAWHL DALELADMHK ITQGEGITVA 
VIDSGVDATH PDLKNNVLPG IDFFDEQASG HEDRSGHGTA MASLIAGHGH GPGGHEGVLG
VAPKAKILPI TVRAPEGKSN YSLEAIALGI HWAIEQDVDV INISLGGPQN VELSQAVERA
YQNNVIVVAG VGNKENAAIG SPANLPGSLA VTGTDRDGMP SSIASLPAAQ THLAAPAEDL
YQAVPGGGYA TITGNSGATA LVSGAVALVK SKYPNLNSLD LFKRMVETTR DAGKPGKDFD
YGWGQLDLRA ALTGEPDGRA SRTQAPSQEP QLDPTLARAR AQGPAGENEA LVMALSIAFI
LVVLGLIATP VLLLLRYRRR RRATTPAMEA ATATRTSATG LASPPTATTN PSESGPPAHP
TDPADDSPWR RPD