Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0938 |
Symbol | |
ID | 5708049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1058778 |
End bp | 1060016 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641270456 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001535844 |
Protein GI | 159036591 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.980042 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCATC CCCGTCGGAT GGTGACGCCA CGGCGCCGAT CGGCCGGAAC CAGCGCCATC GTCTTCGTCT TTGCGCTCGT GCATGTGCTC TGTTTCGCCG GAACTGCGGG AGCGGATCCA CTCGGGGACG GGCTGCCGCC GCTTCGGCAG CAGTCCGACG GTTGTGTGCC GGCCTCCGAC GTCCGTATCC GTGACGTTCC CTGGGCCCAG CGACGCCTGA CGCCCGAGCG GGTCTGGCCG CTGACCCGTG GGGCTGGCCA GGTGGTAGCG GTGATCGACT CCGGTGTGGC CCGGGTGCCC CAACTCGCCG GCGGACGGCG CGACCAGGTG GAGATCATCG GCGGCCGGAC CGGCGTGGAC GACGACTGCC CCGGGCACGG CACGTTCGTC GCTGGTCTGA TCGCAGCCCG CCCGGCGAGG GACACGGGCT TCAGCGGAGT CGCCCCGGCG AGTACGATCC TGCCGATCCG GCAGACGCGC AACGGACGCG ACGGCACCGC CGACGGTCTG GCCAAGGCCA TCCGGGTGGC CGCCGACCAG GGGGCGGACA TCATCAACGT CTCCTCGGCG TCGCTGTTTC CCGACGACAC GCTGCGCCGG GCCGTCGAGT ACGCCACCAG CAAGGACGTG CTGATCGTGG CGGCGGTCGC CAACGAGCTC GGCAACGGCA ACGCCCACCC GTACCCCGCC GCGTACCCGC AGGTTCTCGC GGTCGGAGCG ATCGGCTCCG ACGGTGCCGC CGCCGACTTC TCCGGCGCCG GAGAGTTCGT CGACCTGGTG GCTCCGGGAA GCAGCATCGT CAGTGTCGGG CCGCGCGGTG GCGGTCACCT GACCGCCACC GGCACCAGCT ACGCCGCACC GCTGGTCGCC GGCGCCGCCG CACTCGTGCG GGCCTACCAT CCGCAGTTGA CAGCCGCGCA GGTAAAACAC CGGCTGCAGG TGACCGCCGA CCCGCCGAGC AGTACGGTGC CCGACCCGCG ACTTGGTTGG GGTGTCGTCA ACCCGTACGC GGCGGTGACG TCCATCCTGC CGAACGAGGC CGGTGCCACG CCGGCTGTCG CTCCGCCGGC CACTGTCTCA GGCCCGACGT GGCCGAGCGG CGGCCTCTCG GGCCGCCGGT CGGCGTTCAT CATCACGGTG GTTGCCACCG TGCTGGTTGC CGCGGTGGTG GTGGCCCGGG CGGTCGTGCC GCGCGGACGG CGGCGGCGCT GGCGGCCGGC CGGATGGACG GGCCGGTGA
|
Protein sequence | MSHPRRMVTP RRRSAGTSAI VFVFALVHVL CFAGTAGADP LGDGLPPLRQ QSDGCVPASD VRIRDVPWAQ RRLTPERVWP LTRGAGQVVA VIDSGVARVP QLAGGRRDQV EIIGGRTGVD DDCPGHGTFV AGLIAARPAR DTGFSGVAPA STILPIRQTR NGRDGTADGL AKAIRVAADQ GADIINVSSA SLFPDDTLRR AVEYATSKDV LIVAAVANEL GNGNAHPYPA AYPQVLAVGA IGSDGAAADF SGAGEFVDLV APGSSIVSVG PRGGGHLTAT GTSYAAPLVA GAAALVRAYH PQLTAAQVKH RLQVTADPPS STVPDPRLGW GVVNPYAAVT SILPNEAGAT PAVAPPATVS GPTWPSGGLS GRRSAFIITV VATVLVAAVV VARAVVPRGR RRRWRPAGWT GR
|
| |