Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0044 |
Symbol | |
ID | 5707324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 51562 |
End bp | 53118 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641269569 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001534971 |
Protein GI | 159035718 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.61029 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCTAT CCCGCAGGTC CGTATTCGTC GGACTGGCCA CGATGGCCAT GGTGGCGTCC GCAACCCCCG CCATGGCCGC CGAGCCGGTC GGCACGATCA GAAGTGCCGG CGGCGCCACC GCCGTCGCCG ACAGCTACAT CGTGGTCTTC AAGGACAGCC GGGTCAGCCG TGGCGCTGTC GAGCAGTCCG TCGACCGCCT GCTTGACCGG CACGGTGGCC AGATGGCCCG GATGTACACC GCAGCACTCC GCGGAGCAGA GGTGCGGGTG GACGCCAGCG CCGCCGCCCG AATCGCGGCC GACCCGGCCG TAGCCTACGT CGAGCAGAAC CACACCGTCT CGATCGCCGG TACCCAGCCC AACCCCCCGT CCTGGGGCCT GGACCGAGTC GACCAGCGAA ACCTGCCGCT GGACAGTTCC TACACGTACC CGAACACCGC CAGTGACGTG ACCGCCTACA TCATCGACAC CGGAATCCGC ACCACTCACA CGGACTTCGG TGGTCGGGCC ACGTGGGGCA CCAACACGGC CGACAACAAC GACACCGACT GCAACGGGCA CGGCACGCAC GTCGCCGGCA CCGTCGGTGG CTCGGCGTAC GGCATCGCCA AGGAAGCCAA ACTGGTCGCG GTCAAGGTGC TGAACTGCGC CGGCAGCGGC AGCTACGCCG GGGTCATCGC CGGCGTCGAC TGGGTCACCG CGAACGCGGA CAAGCCGGCC GTGGCGAACA TGAGCCTCGG TGGCGGTGCG AACAGCTCGG TGGACAACGC GGTGACCAAC TCGATCAACT CCGGTGTCAC CTACGCGCTG GCGGCGGGGA ACAGCAACGC CAACGCCTGT AACTACTCGC CGGCCCGTAC CCCTGCGGCG ATCACCGTCG GGTCGACGAC CAGCACCGAC GGACGGTCCT GGTTCTCCAA CTACGGCACC TGCCTGGACC TCTTCGCACC GGGCTCGTCG ATCACCGCGC CGTGGAACGA CAGCGACAAC GGCACGAACA CGATCAGCGG CACGTCGATG GCCTCGCCGC ACGCCGCGGG TGCCGCGGCG CTGGTCCTCT CGGCCAACCC GTCGTACACC CCGCAACAGA TTCGGGACGC TCTGGTCGAC AACGCCACGG ACAACGTGGT GGGCGGCCCG GGCAGTGGCT CGCCGAACAA GCTCCTCTAC ATCGGTGACG GCGGCACCCC GCCGCCCCCG CCCCCGCCCG GCTGCACCGG CACCAACGAC ACCGACGTAG CGATCCCGGA CGCCGGTGCC GCGGTGACCA GCTCGATCAC CATCACCGAC TGTGACGGAA ACGCCTCGGC GGCCTCGACC GTGGCAGTGG ACATCCCCCA CACCTGGCGT GGTGACCTCG TGATCGACCT GATCGCGCCG GACGGCTCGT CCTACCGGCT CAAGACCAAC AACCTGTCCG ACTCCGCCGA CAACGTCAAC GAGACCTACA CGGTGAACCT CTCCAGCGAG GTAGCGGACG GCACCTGGAA ACTCCAGGTC CAGGACGTCT ACCGCGCGGA CACCGGCTAC ATCAACACCT GGACCCTGAC GGTCTGA
|
Protein sequence | MGLSRRSVFV GLATMAMVAS ATPAMAAEPV GTIRSAGGAT AVADSYIVVF KDSRVSRGAV EQSVDRLLDR HGGQMARMYT AALRGAEVRV DASAAARIAA DPAVAYVEQN HTVSIAGTQP NPPSWGLDRV DQRNLPLDSS YTYPNTASDV TAYIIDTGIR TTHTDFGGRA TWGTNTADNN DTDCNGHGTH VAGTVGGSAY GIAKEAKLVA VKVLNCAGSG SYAGVIAGVD WVTANADKPA VANMSLGGGA NSSVDNAVTN SINSGVTYAL AAGNSNANAC NYSPARTPAA ITVGSTTSTD GRSWFSNYGT CLDLFAPGSS ITAPWNDSDN GTNTISGTSM ASPHAAGAAA LVLSANPSYT PQQIRDALVD NATDNVVGGP GSGSPNKLLY IGDGGTPPPP PPPGCTGTND TDVAIPDAGA AVTSSITITD CDGNASAAST VAVDIPHTWR GDLVIDLIAP DGSSYRLKTN NLSDSADNVN ETYTVNLSSE VADGTWKLQV QDVYRADTGY INTWTLTV
|
| |