Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0587 |
Symbol | |
ID | 5703717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 666565 |
End bp | 668436 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641270112 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001535506 |
Protein GI | 159036253 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATCC AGGCCCCGTA CGGCAGCTGG CCGTCCGGCT GGCAGGCGGC CGACGCGTCA CGCGGCCACT CGGTCGTGGA CTGGGTTGGT TTCGCCGGCA GCGAGGTGTG GTGGGTGGCC GCCGATGCGG GTGACGGTCG CAACCACCTG GTACGTCCAG GTGCGGACGG CCGCCCCGAA GACGTGCTGC CGGGCGATTG GGACGTGCGG ACCGCGTTCA TGGAGTACGG CGGGCGGCCC TGGGTGTTTC TCGGTACCGG CGGCGCGGTC TTCGTCCACT GGTCGGACCA GCGGGTCTAC CGCTGGACGC CCGAGGCCGC CGTGCGGCCG CTGAGCCCAC GCTCGGACCG GTACCGGTAC TGCGACTTCG CGGTACGGGG CGACGAGGTG TGGTGTGTGC GGGAGACGAC CGGCGGTGAG GTCCGGCGTG ACCTCGTCGC GCTGCCGCTG GATGGTTCGG CGCGAATCCG GGTGCTGGCC GCCACCCACG ACTTCCTGTC CGGCCCACGG ATCTCGCCGG ACGGAGGCCG CGTCGCCTGG CTGGGCTGGA ACCATCCGGA CATGCCGTGG ACGCGTACCG CGGTGATGGT GGCGAACGTC GACCCGGACG GCTCGCTGGT CGGCCTGCGC CGGCTGGCGA CCGGTGCCGA CGAGTCGGTG ACCCAGATCG AGTGGACCAG CGACGGCGCG GCGCTGCTCG TGGTGAGCGA CCGGAGCGGC TGGTGGAACG TCCACGAGGT GAGCGGGGAC GGACGGTGGC GGGCCCGGTG CCCGCGCGCC GAGGAGTTCG GCGAGGCCCT GTGGCGGATC GGCGCCAGCA CCTGCGCCGC CCTCACCGGC GGAGGCCTCG CGGCGGCACA CGGCACGGGC GTCCGCCGGT TGGGCCTGTG TGACGCCGAC GGCGGCCTGG TCGATGTGGA CGACGGCTTC ACGGACTGGC GGTCCGTGGT CAGCGACGGG CGGCGGGTGG CAGCCGTGGC GGCCGGACCC CGCAGGTCGC GCTCGGTCGT GCTCGTCGAG CAGGGGTGCA CCCGTGTGCT CTGGTCCAGT CCCGGTGCTC TGGCCAGCTA CGCCTCCGTA CCGATGCTCC GCACCTACCA GGGCGTCCAT GCGCACGTGT ACGAACCCCA CCACCCCGGG TACGCGGGTC CGCCCGGTGA GCCGCCGCCG TACATCGTCC AGGCGCACGG CGGCCCAACC AGTCGCGGCG TGCCGGTGGC CGACGCGGTG ACCACGTACT TCACCAGCCG GGGGATCGGT GTGGTGGATG TCCAGTATGG CGGTTCCACC GGCTACGGGC GCGCCTACCG GGACCGGCTC CGGCATCGCT GGGGTGAGGT TGACGCTCGG GACTGCGCGA CCGTCGCCCG TGGCCTTGTC GCCGAGGGAC GGGCCGACCC AAGCCGGATC GCCCTCCGCG GCGCCAGCGC TGGTGGGTGG ACCGCACTGC GGTCGCTGAT CGACGACCCC GACCTCTACC AGGCGGCCGT GGTCTACTTC CCCGTCCTGG ACGCCCGTTC CTGGGCGAAG TCGACGCACG ACTTCGAGTC GCGGTACGCG GAGTGGCTGA TCGGCCCGTG GCCACAGGAG CGTGGCCGCT ACGAGTCCCG TTCGCCGGCC GCCGCGGTGG AACGGATCCG GACCCCGCTG CTGCTGATGC AGGGTGCCCG GGACGCGATC TGCGTGCCGG AACAGGCGGA CCAGTTCGCC AGGTACCTTG CCTCGATCTC CGTGCCGATA CGCTACCTGC GCTTCCATGC CGAGGCGCAC GGCTTCCGGC AAGCCGACAC CGTCGCCCGG TGCCTGAACG CCGAGCTCGA CCTGTACGCC AAGGCGTTGC GTTTTCCGCT GCCGGTCGAA GCGCGAGCAT GA
|
Protein sequence | MPIQAPYGSW PSGWQAADAS RGHSVVDWVG FAGSEVWWVA ADAGDGRNHL VRPGADGRPE DVLPGDWDVR TAFMEYGGRP WVFLGTGGAV FVHWSDQRVY RWTPEAAVRP LSPRSDRYRY CDFAVRGDEV WCVRETTGGE VRRDLVALPL DGSARIRVLA ATHDFLSGPR ISPDGGRVAW LGWNHPDMPW TRTAVMVANV DPDGSLVGLR RLATGADESV TQIEWTSDGA ALLVVSDRSG WWNVHEVSGD GRWRARCPRA EEFGEALWRI GASTCAALTG GGLAAAHGTG VRRLGLCDAD GGLVDVDDGF TDWRSVVSDG RRVAAVAAGP RRSRSVVLVE QGCTRVLWSS PGALASYASV PMLRTYQGVH AHVYEPHHPG YAGPPGEPPP YIVQAHGGPT SRGVPVADAV TTYFTSRGIG VVDVQYGGST GYGRAYRDRL RHRWGEVDAR DCATVARGLV AEGRADPSRI ALRGASAGGW TALRSLIDDP DLYQAAVVYF PVLDARSWAK STHDFESRYA EWLIGPWPQE RGRYESRSPA AAVERIRTPL LLMQGARDAI CVPEQADQFA RYLASISVPI RYLRFHAEAH GFRQADTVAR CLNAELDLYA KALRFPLPVE ARA
|
| |