Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4440 |
Symbol | |
ID | 5705918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5017358 |
End bp | 5018554 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641273856 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001539205 |
Protein GI | 159039952 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.223031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00412922 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTACGAA GCCGGTCCCG CCCCCTCCTC GCCGCTGTTT CGGCGGCGAT GCTGACCGCC GTGGTGTCGT CGTTGACCGC CGCCCCCGCA GGCGCGACAC CGCGGTGCGC GTCCCCCCTC GCCCCGGCTC CCCCGATCGC CACCATGCCG TGGCCGCAGC AACGGTACGC GTCGGAACAG CTACTGTCCC TCGCCACCGG GGCAGGGGTG ACCATCGCGG TGGTCGACTC CGGGGTGGAC CGCTCGCACC CACAACTCGC CGGTCGAGTT CTCCCCGGCG CGGACCTGCT CGATCCCGGC GGAGACGGCA CCCGGGACTG CGCCGGACAC GGCACGGGCG TGGCGAGCAT CATCGTGGCC GCCCGCCACG ATGGGGTGGC CTTCCACGGC CTCGCGCCGC AGGCCCGGAT CCTGCCGGTA CGCGTCAGCG AACAGCAGGT GGTCGACGGG CGACAGTCCG GACGCACGGT GGGCGTGGCC GACTTCGCCG CGGCGATCCG ATGGGCGGTC GACAACGGCG CCGAGGTGCT CAACCTCTCC GTCGTGCTGC ACGTCGACGC CCCGGCCGTG CGGGCGGCCA TCGCCCACGC GCTGGCCCGG GACGTGGTCG TGGTGGCAGC CGCCGGCAAC CTCCACGACC AGGGCGACCC CCGCTCCTAC CCGGCCGCGT ACGACGGGGT GCTGGGGGTG GGCGCGATCG GTGCGGACGG GGTGCGCGCC TCCTTCTCCC AGGACGGTCC GGAGGTGGAT CTGGTAGCGC CCGGTGCCGA CGTGGTGACC GCCGCGCCCG GCCAGGGGCA CCACCGGGCC GAGGGCACCA GCTACGCGGC GCCCTTCGTG GCGGCCACCG CCGCCCTGCT GCGCGGGCAC CGGCCGGAGC TGACGGCGGA GCAGGTGGTA CGACGAATCC TGGTCAGCAC CGATCCCGCC CCCGGAGGCG GATACGGCGC GGGCGTGCTG AACCCGTATC GGGCGCTCAC CGAGAGCGGG GGTGCGGCGG CGCCGGCCCG ACCGGCCACC GCGCTGCTCG ACGACCGGGC CGACCCGGAC CGGATCGCCG AACAGGCCCG TCGGGCGGCG GCCCAGGATA GGGCGCTGGT GGTGGCCCTG GTGGGCGGGG CGTTGGTCAC GGTGGCGGTG CTGCTCGCCC TCGTGCTGCC GCGCGGCATC CGCCGTCGCT GGCGGCCTCC GGCGTGA
|
Protein sequence | MLRSRSRPLL AAVSAAMLTA VVSSLTAAPA GATPRCASPL APAPPIATMP WPQQRYASEQ LLSLATGAGV TIAVVDSGVD RSHPQLAGRV LPGADLLDPG GDGTRDCAGH GTGVASIIVA ARHDGVAFHG LAPQARILPV RVSEQQVVDG RQSGRTVGVA DFAAAIRWAV DNGAEVLNLS VVLHVDAPAV RAAIAHALAR DVVVVAAAGN LHDQGDPRSY PAAYDGVLGV GAIGADGVRA SFSQDGPEVD LVAPGADVVT AAPGQGHHRA EGTSYAAPFV AATAALLRGH RPELTAEQVV RRILVSTDPA PGGGYGAGVL NPYRALTESG GAAAPARPAT ALLDDRADPD RIAEQARRAA AQDRALVVAL VGGALVTVAV LLALVLPRGI RRRWRPPA
|
| |