Gene Strop_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3040 
Symbol 
ID5059504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3470672 
End bp3471850 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content68% 
IMG OID640475290 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001159855 
Protein GI145595558 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.667791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.694624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCGT CCTCAGAACG CGTCAACCGA TTCGCTGCCC GCCGTCGGGC GCGGCTGGAG 
GGCCTTCCAA AGCTCGCGCA GACCGCTGTC GGCGGTAGCG CCCGGGCGTG GTTCGTCGCT
GACGAGTTAC TGGTCGTCGA CGACAGTCGC CGGGATGCCG AACGCTATCT TGGACGTGCT
CGGGCCGCGC AGCCTGACGC CGGCGACGAG GAGTTACTGC CCGGCCTGCG TCGCTATCGG
GCGCCGGGGC TGGACGTACC GGCTGCGGTT CGCGCGCTGC GCTCGGGTCG TCCGGCCGGC
AACCAGGTGG TCAGCCCGAA CCATGTCTTT CTGTCCAGTC CGTTCAACCA CGGAGGTCCG
TTCGGGCCGC CGGCGCCCGT AGCCGCATCG ACGTTCAAGA TGCCGGCCGA GACCGATCGG
GTCGCGGTAT CCATCGTTGA CACCGGGTTC TGGACCGAGA CCCCCCTTCC GGCCGACTAC
CTCGCCTCGG ACGGTGTGGA GGTGGAGACG GAAACCGATG TCGATGAAGA CGGGCTGCTC
GACGGCGACG TGGGGCACGC CAACTTCATC GGTGGTGTCA TCGCGAATCA TACGGACCGG
GCAATGTTAC GGGTGATCCG GACATTGGAT ACTTTTGGTG TCTGCACGGA GGATCAGTTG
ATCGCCTCGC TGGGCCGGCT GCACCCGGAC ACCAAGGTGA TCAACCTTTC CCTCGGTGGC
TTCACCGCCG ACGGATCCGC GCCGCTCGGC GTACGCGCGG CGTTGGGGCA GGCCCTGTCC
GGGATCGACC GGGTGGTGGT CGCTGCTGCC GGCAACGACG GCAACCGCAG CGACCCGTTC
TGGCCCGCAG CGTTCGCCAA TGCCGGCGAG TCGTGGAGTG GGCAGGTACT GGCGGTCGCC
GCGCACGACG GCAGCGACCT GTGCTCCTGG AGCAACGCTG GACCGTGGGT CAGCGTCGTC
GCGCCGGGTG AGGACGTTCG AAGCACGTAC ATCGACCACG CTCTGTTCCC AGAGGGGTGG
GCGCAATGGA GCGGAACGTC GTTCGCCGCG CCGCGAGTGG CTGCCGAACT CTCCGCGCGG
ATCGACTCGG AGGTCGGCGC GGTGGCCGCT GCCAACCAGC TAATGGCCGA TCTGAGGGCG
TCCAACCAGC GGTTTGGAGG CCACCTCGGG CTGATCTGA
 
Protein sequence
MPPSSERVNR FAARRRARLE GLPKLAQTAV GGSARAWFVA DELLVVDDSR RDAERYLGRA 
RAAQPDAGDE ELLPGLRRYR APGLDVPAAV RALRSGRPAG NQVVSPNHVF LSSPFNHGGP
FGPPAPVAAS TFKMPAETDR VAVSIVDTGF WTETPLPADY LASDGVEVET ETDVDEDGLL
DGDVGHANFI GGVIANHTDR AMLRVIRTLD TFGVCTEDQL IASLGRLHPD TKVINLSLGG
FTADGSAPLG VRAALGQALS GIDRVVVAAA GNDGNRSDPF WPAAFANAGE SWSGQVLAVA
AHDGSDLCSW SNAGPWVSVV APGEDVRSTY IDHALFPEGW AQWSGTSFAA PRVAAELSAR
IDSEVGAVAA ANQLMADLRA SNQRFGGHLG LI