Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4062 |
Symbol | |
ID | 5704145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4619094 |
End bp | 4620668 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641273488 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001538843 |
Protein GI | 159039590 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.286322 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACT ACGAGTCCGA CCCACAGCGC CGGCAGGCCC CCGGGGACGC CGAGCCGTCG CACCCCACCG TCGATCTGCC CCGCCTCGAG CGCGCCCAGT CCGACTCCTC GGCCTCGGTC TCCGACGCGG CTTCCCCAGC GGGCCCCGCC GCGGCTGCGG CTTCCCCGGC GAGCCCCGCC GCGCCGTCCG TCGGCGCCCC GACCGACCCC ACTGACGCCC GGTCCGGGTC GGACGAGCCC GCTGTCGAGA ACCCCGTACC GGCCGCAAAC GCATCGCCGT CCGCCGACCC CTCCGCGCCG CGATATCCCC AGTTCGGTGT CGGGCCTGCC GGCGGGCACG GTGGGCACCC GCACCCACCC GGTTATCCGC AACACCCGAA CCCGGCCACC CTCTGGTATG GGCAGCAGAG CGCCGGCTGG AGCGGGGGGC AACCCGGCGG GTACGGCCAG CCCTATCAGC CGGGCCAGCC GGCGCACCTG GCCGGGGCGT CCATGCCGCC GTGGGCGGCG CCGCAGAGCG GCCCGCGCTC CGGCGGCCGG GTGGCGAAGT TCGTCGGCGC TGGCGTTGCG GTGGTCGCCC TGATGTTCGG CTCTGGTGTT GCCGGCGGCG CACTCGCGCT CGCCCTCGCC GGCGACTCCG GCATCACCCG AACCTACTCG GCGGCCCCGA TCATCGATGG TGCCGACCTG CCGCGCATCG CCGCCGCGGT GCAGCCCAGT GTGGTGTCGA TCGGCACCGG CAACGGCGAG GGCTCGGGTG TGATCCTCAG CACCGACGGC TACGTGCTGA CCAACAACCA TGTGATCGCC TCGGCGAGCG GCGGCACCGT GCTGGTGACC TTCGCCGACG GCGAGACGGC GCAGGCGAAG ATCGTCGGCA CCGACCCGAA GACCGACTTG GCCGTCGTCA AGGCATCCGG GGTCAGCGAC CTGACGCCGG CGACGTTCGG CGACAGCGAC GCGATGCAGG TCGGCGACCA GGTCCTCGCC CTGGGTAGCC CGTTGGGCTT GCAGGGGTCG GTGACCGCGG GCATTCTCAG CGCCCGGGAC CGCACCATCC AGGCCGGTGG CTCGCCGCAG GACCCGCGCC AGGGGGTCAC CTCGATCTCC GGGTTGCTGC AGACGGATGC GCCGATCAAC CCCGGTAACT CCGGTGGCGC GCTGGTCAAC ACCCGGGGAG AGGTGATCGG GATCAACACG GCGATCGCCA CCAGCGGTCA GGGCAGCACC GGCAACATCG GGGTCGGTTT CGCCATCCCC AGCAACAAGG CCAATGACGT CGCCGGGAAG CTGCAACGGG GGGAGAAGGT CTCTCACCCC ACCCTCGGTG TCAGCGTCAC CACCGCCGAC GGCGGTGGCG CCCTGGTGGC CGAGGTCCTC CCCGACAGTG CCGCCGAGCG GGCCGGCCTC CAGCGCGGTG ACGTCATCAC CCGGTTTGGT GACAAGGCGA TCGACGGCTC CGACGACCTG GTCGCCGAAG TCCAGGCCGG CAAGGTGGGT GACCGGGTCG ATGTGACGTA CAAACGCAAC AATGCCGAAA CGACGGCAAC CGTGACGCTC GCCGAAGCGT CCTAG
|
Protein sequence | MTDYESDPQR RQAPGDAEPS HPTVDLPRLE RAQSDSSASV SDAASPAGPA AAAASPASPA APSVGAPTDP TDARSGSDEP AVENPVPAAN ASPSADPSAP RYPQFGVGPA GGHGGHPHPP GYPQHPNPAT LWYGQQSAGW SGGQPGGYGQ PYQPGQPAHL AGASMPPWAA PQSGPRSGGR VAKFVGAGVA VVALMFGSGV AGGALALALA GDSGITRTYS AAPIIDGADL PRIAAAVQPS VVSIGTGNGE GSGVILSTDG YVLTNNHVIA SASGGTVLVT FADGETAQAK IVGTDPKTDL AVVKASGVSD LTPATFGDSD AMQVGDQVLA LGSPLGLQGS VTAGILSARD RTIQAGGSPQ DPRQGVTSIS GLLQTDAPIN PGNSGGALVN TRGEVIGINT AIATSGQGST GNIGVGFAIP SNKANDVAGK LQRGEKVSHP TLGVSVTTAD GGGALVAEVL PDSAAERAGL QRGDVITRFG DKAIDGSDDL VAEVQAGKVG DRVDVTYKRN NAETTATVTL AEAS
|
| |