Gene Strop_3684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3684 
Symbol 
ID5060160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4223965 
End bp4225521 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content72% 
IMG OID640475940 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_001160493 
Protein GI145596196 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.358044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT ACGAGTCCGA CCCGCAGCGT CGGCAGGCCC CCGGGGACGC CGAGCCGTCG 
CACCCCACTG TCGACCTGTC TCGCCTCGAG CGCGCCCAGT CCGACTCTTC GGCCGCTACC
TCCGGTGCGG ACAGTAGCTT CGCGCCGCCT GCCGACACCA CGACCGGCCC CACTGAGGCG
TTGCCGGTCT CGGCTTCGGG CGCCTCGCCG TCGGCTGACC CCACTGAGGC GTTGCCGGTC
TCGGCTTCGG GCGCCTCGCC GTCGGCTGAC CCCACTGAGG CGCTCGCCCC CCAGCTCGGC
GCCGCGCCCG TCGCCGGCCC GGCCGGTGGC CACCCCTACC CACCCGGCTA CTCGCCACAA
CACCCCGGTG CTCCCTGGTA CGGGCCGCGG TCCACCGCCT GGAGCGGAGG GCAGCCAGGC
GGGTACGGAG CCCCGTTCTA CCCGGGTCAG CCGGCGCAGC TGGCCGGACA GCCCGCGCCA
CCGTGGGCGG CGCCGCAAAC CGGTCCGCAT CCGGGCAGCC GGATCGCGAA GTTCATCGGC
GCGGGCGTCG CGGTGCTCGC CCTGATGTTC GGCTCCGGTG TCGCCGGCGG CGCGCTCGCA
CTCGCCCTGA ATGACGGCTC CGGCGTCACG CGCACCTACT CCGCGGCCCC GATCATCGAC
AGCGCCGACC TGCCGCGGAT CGCCGCCGCG GTGCAGCCCA GCGTGGTGTC GATCGGCACC
GACAGCGGCG GGGGCTCGGG CGTGATCCTC ACCGCCGACG GATATGTGCT GACCAACAAC
CACGTGATCG CCACGGCCAG CGGCGACACC GTGCTGGTGA CCTTCGCCGA CGGCGAGACG
GCGTCGGCGG AGATCACCGG CACCGACCCC AAGACCGACC TGGCGGTGGT GAAGGCCGCC
GGGGTCAGCG ACCTGACGCC GGCGGAATTC GGCGACAGCG ACGCGATGCA GGTCGGCGAC
CAGGTTCTCG CCCTCGGTAG TCCACTGGGC CTGCAGGGGT CGGTGACCGC CGGCATCCTC
AGCGCGCGGG ACCGCACCAT CCAGGCCGGC AGCTCGGAGC AGGACCCGAC GGCGGGGGTC
ACCTCGATCT CGGGGCTGTT GCAGACCGAC GCGCCGATCA ACCCGGGCAA CTCCGGTGGG
GCGCTGGTCA ACACCCGGGG CGAGGTGATC GGGATCAACA CCGCGATCGC CACCAGTGGC
CAGGGCAGCA CCGGCAACAT CGGGGTCGGG TTCGCCATCC CCAGCAACAA GGCCGAGGAC
GTCGCCGAGA AGCTGCAACG GGGTGAGAAG GTCAGCCATC CCACCCTCGG TGTCAGCGTC
ACCGCCGCCG AGGGCGGCGG TGCCCTGGTG GCCGCGGTCC TCCCCGACAG CGCTGCCGAG
CGGGCGGGCT TCCAGCAGGG CGACGTCATC ACTCGGTTCG GCGACAAGGT GATCGCTGAC
TCCGAGGATC TGGTCGCCGT GGTCCAGGCC GGCAAGGTGG GCGACCGGGT GGATGTGACA
TACAAGCGCA ACAATGTTGA AGCGACCGCA ACCGTGACGC TCGCCGAAGC GTCCTAA
 
Protein sequence
MTDYESDPQR RQAPGDAEPS HPTVDLSRLE RAQSDSSAAT SGADSSFAPP ADTTTGPTEA 
LPVSASGASP SADPTEALPV SASGASPSAD PTEALAPQLG AAPVAGPAGG HPYPPGYSPQ
HPGAPWYGPR STAWSGGQPG GYGAPFYPGQ PAQLAGQPAP PWAAPQTGPH PGSRIAKFIG
AGVAVLALMF GSGVAGGALA LALNDGSGVT RTYSAAPIID SADLPRIAAA VQPSVVSIGT
DSGGGSGVIL TADGYVLTNN HVIATASGDT VLVTFADGET ASAEITGTDP KTDLAVVKAA
GVSDLTPAEF GDSDAMQVGD QVLALGSPLG LQGSVTAGIL SARDRTIQAG SSEQDPTAGV
TSISGLLQTD APINPGNSGG ALVNTRGEVI GINTAIATSG QGSTGNIGVG FAIPSNKAED
VAEKLQRGEK VSHPTLGVSV TAAEGGGALV AAVLPDSAAE RAGFQQGDVI TRFGDKVIAD
SEDLVAVVQA GKVGDRVDVT YKRNNVEATA TVTLAEAS