Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_3684 |
Symbol | |
ID | 5060160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4223965 |
End bp | 4225521 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640475940 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_001160493 |
Protein GI | 145596196 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.358044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACT ACGAGTCCGA CCCGCAGCGT CGGCAGGCCC CCGGGGACGC CGAGCCGTCG CACCCCACTG TCGACCTGTC TCGCCTCGAG CGCGCCCAGT CCGACTCTTC GGCCGCTACC TCCGGTGCGG ACAGTAGCTT CGCGCCGCCT GCCGACACCA CGACCGGCCC CACTGAGGCG TTGCCGGTCT CGGCTTCGGG CGCCTCGCCG TCGGCTGACC CCACTGAGGC GTTGCCGGTC TCGGCTTCGG GCGCCTCGCC GTCGGCTGAC CCCACTGAGG CGCTCGCCCC CCAGCTCGGC GCCGCGCCCG TCGCCGGCCC GGCCGGTGGC CACCCCTACC CACCCGGCTA CTCGCCACAA CACCCCGGTG CTCCCTGGTA CGGGCCGCGG TCCACCGCCT GGAGCGGAGG GCAGCCAGGC GGGTACGGAG CCCCGTTCTA CCCGGGTCAG CCGGCGCAGC TGGCCGGACA GCCCGCGCCA CCGTGGGCGG CGCCGCAAAC CGGTCCGCAT CCGGGCAGCC GGATCGCGAA GTTCATCGGC GCGGGCGTCG CGGTGCTCGC CCTGATGTTC GGCTCCGGTG TCGCCGGCGG CGCGCTCGCA CTCGCCCTGA ATGACGGCTC CGGCGTCACG CGCACCTACT CCGCGGCCCC GATCATCGAC AGCGCCGACC TGCCGCGGAT CGCCGCCGCG GTGCAGCCCA GCGTGGTGTC GATCGGCACC GACAGCGGCG GGGGCTCGGG CGTGATCCTC ACCGCCGACG GATATGTGCT GACCAACAAC CACGTGATCG CCACGGCCAG CGGCGACACC GTGCTGGTGA CCTTCGCCGA CGGCGAGACG GCGTCGGCGG AGATCACCGG CACCGACCCC AAGACCGACC TGGCGGTGGT GAAGGCCGCC GGGGTCAGCG ACCTGACGCC GGCGGAATTC GGCGACAGCG ACGCGATGCA GGTCGGCGAC CAGGTTCTCG CCCTCGGTAG TCCACTGGGC CTGCAGGGGT CGGTGACCGC CGGCATCCTC AGCGCGCGGG ACCGCACCAT CCAGGCCGGC AGCTCGGAGC AGGACCCGAC GGCGGGGGTC ACCTCGATCT CGGGGCTGTT GCAGACCGAC GCGCCGATCA ACCCGGGCAA CTCCGGTGGG GCGCTGGTCA ACACCCGGGG CGAGGTGATC GGGATCAACA CCGCGATCGC CACCAGTGGC CAGGGCAGCA CCGGCAACAT CGGGGTCGGG TTCGCCATCC CCAGCAACAA GGCCGAGGAC GTCGCCGAGA AGCTGCAACG GGGTGAGAAG GTCAGCCATC CCACCCTCGG TGTCAGCGTC ACCGCCGCCG AGGGCGGCGG TGCCCTGGTG GCCGCGGTCC TCCCCGACAG CGCTGCCGAG CGGGCGGGCT TCCAGCAGGG CGACGTCATC ACTCGGTTCG GCGACAAGGT GATCGCTGAC TCCGAGGATC TGGTCGCCGT GGTCCAGGCC GGCAAGGTGG GCGACCGGGT GGATGTGACA TACAAGCGCA ACAATGTTGA AGCGACCGCA ACCGTGACGC TCGCCGAAGC GTCCTAA
|
Protein sequence | MTDYESDPQR RQAPGDAEPS HPTVDLSRLE RAQSDSSAAT SGADSSFAPP ADTTTGPTEA LPVSASGASP SADPTEALPV SASGASPSAD PTEALAPQLG AAPVAGPAGG HPYPPGYSPQ HPGAPWYGPR STAWSGGQPG GYGAPFYPGQ PAQLAGQPAP PWAAPQTGPH PGSRIAKFIG AGVAVLALMF GSGVAGGALA LALNDGSGVT RTYSAAPIID SADLPRIAAA VQPSVVSIGT DSGGGSGVIL TADGYVLTNN HVIATASGDT VLVTFADGET ASAEITGTDP KTDLAVVKAA GVSDLTPAEF GDSDAMQVGD QVLALGSPLG LQGSVTAGIL SARDRTIQAG SSEQDPTAGV TSISGLLQTD APINPGNSGG ALVNTRGEVI GINTAIATSG QGSTGNIGVG FAIPSNKAED VAEKLQRGEK VSHPTLGVSV TAAEGGGALV AAVLPDSAAE RAGFQQGDVI TRFGDKVIAD SEDLVAVVQA GKVGDRVDVT YKRNNVEATA TVTLAEAS
|
| |