Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4754 |
Symbol | |
ID | 5705345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5383073 |
End bp | 5384410 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641274152 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001539498 |
Protein GI | 159040245 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00404816 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000540965 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGTGG TGCGGCGTGG TGGCAGGCCC ACTGCCCGTC GGGGCGCCGG TCGGGCATGG CTCGGCCTGA CGGCCATCGC CGCCTCGGTC GCCGTGCCGT TCGCCAGCGA GCCCCACGCC GGGCAGGTCG CCGCCGCGGT CCCGGGCCGT CCGGCTGACA CGGTCACCAC CGCTGACCCC GTCTATTCCG CCGGCAAGAC CCTGGTGGGT CGCTCGGAGC AGTCCGACCA GGTCCGCGAC ACTCAGTGGC AGCTCGAATT GCTGCGAGCC CGGACCGCCT GGCGCATCTC GACCGGGGCC GGAGTGGTGG TGGCGGTGAT CGACTCCGGT GTGGATGGCG CGCACCCGGA CCTGGCCGGC CGCGTCTTGC CCGGTCTCGA CCTGGTCGGG CCGTCGGGCG CCGCCGGCCC CGATCCGGTG GGTCACGGCA CGACCGTGGC CGGGCTGATC GCCGGCCGGC GCGACGACGG CCGTGGAGTC GTGGGCCTCG CACCCGATGC CCGGATCCTG CCGGTACGCG TCCTTGACGC GGAGAATCGC TACGACGACG CGTTGATCGT CGCGAAGGGC GTGCGCTGGG CCGTTGACAA CGGCGCCCGT GTGATCAATC TGTCGCTGGG CGGCAGCAGT GCCAGCCCGG CGCTGGCCGC CGCGCTGGAC TACGCGTTCG CCCGGGACGT GGTGGTCGTC GCCTGCACCG GCAACGTGAG TACCTCGACC ACCAGCACCG TGTGGTATCC GGCCCGGGAG CCGGGCGTGG TCGCGGTTTC CGGGCTGGAC CGGGACAGTG AGAACCTGTG GTCGGGCGCG ATCACCGGTC GCCAGACGGT ACTCACCGCG CCCGCCACCG GTCTGGTGGG CGCCCGATCG CCGAAGGGGT ACTGGCGGGT GCAGGGCACG AGTTTCGGCA CGCCGCTGGT CGCGGCCACC GCCGCGTTGC TCCGCGCCCG GTACCCGGAA ATGCCCGCCG GCGACGTGAT CAATCGAATG CTGGTCACGG CGCGGGACAT CGGCGCCGCC GGACGGGACG ACCGCTTCGG GTACGGGCTG GTCGACCCGG TCGCGGCACT GACCGCCGAG GTGACACCGG TCGGGCGGAA CCCATTGGAC GACCAGTCCT CGCCGGGGGT GGTCGGCTTC GGGCCGGCGC CGGAAACGGC GGACGCGAAT GCCGACAGTG GCGTGCTCGA CGTCGCACCC CGGCTGTGGG ATCGGCGATC GCAGCAACCA CCTGGCACCG GGCCCGACTC CACGTCGGAG CGGATCTGGA CCGGGACAAC GCGCCCGGTC GCCCTGTTCA CCGGTAGCGC CCGAGCGGCC CGTAGGTTCC GCCGGTAG
|
Protein sequence | MTVVRRGGRP TARRGAGRAW LGLTAIAASV AVPFASEPHA GQVAAAVPGR PADTVTTADP VYSAGKTLVG RSEQSDQVRD TQWQLELLRA RTAWRISTGA GVVVAVIDSG VDGAHPDLAG RVLPGLDLVG PSGAAGPDPV GHGTTVAGLI AGRRDDGRGV VGLAPDARIL PVRVLDAENR YDDALIVAKG VRWAVDNGAR VINLSLGGSS ASPALAAALD YAFARDVVVV ACTGNVSTST TSTVWYPARE PGVVAVSGLD RDSENLWSGA ITGRQTVLTA PATGLVGARS PKGYWRVQGT SFGTPLVAAT AALLRARYPE MPAGDVINRM LVTARDIGAA GRDDRFGYGL VDPVAALTAE VTPVGRNPLD DQSSPGVVGF GPAPETADAN ADSGVLDVAP RLWDRRSQQP PGTGPDSTSE RIWTGTTRPV ALFTGSARAA RRFRR
|
| |