Gene Sare_0587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0587 
Symbol 
ID5703717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp666565 
End bp668436 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content73% 
IMG OID641270112 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001535506 
Protein GI159036253 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCC AGGCCCCGTA CGGCAGCTGG CCGTCCGGCT GGCAGGCGGC CGACGCGTCA 
CGCGGCCACT CGGTCGTGGA CTGGGTTGGT TTCGCCGGCA GCGAGGTGTG GTGGGTGGCC
GCCGATGCGG GTGACGGTCG CAACCACCTG GTACGTCCAG GTGCGGACGG CCGCCCCGAA
GACGTGCTGC CGGGCGATTG GGACGTGCGG ACCGCGTTCA TGGAGTACGG CGGGCGGCCC
TGGGTGTTTC TCGGTACCGG CGGCGCGGTC TTCGTCCACT GGTCGGACCA GCGGGTCTAC
CGCTGGACGC CCGAGGCCGC CGTGCGGCCG CTGAGCCCAC GCTCGGACCG GTACCGGTAC
TGCGACTTCG CGGTACGGGG CGACGAGGTG TGGTGTGTGC GGGAGACGAC CGGCGGTGAG
GTCCGGCGTG ACCTCGTCGC GCTGCCGCTG GATGGTTCGG CGCGAATCCG GGTGCTGGCC
GCCACCCACG ACTTCCTGTC CGGCCCACGG ATCTCGCCGG ACGGAGGCCG CGTCGCCTGG
CTGGGCTGGA ACCATCCGGA CATGCCGTGG ACGCGTACCG CGGTGATGGT GGCGAACGTC
GACCCGGACG GCTCGCTGGT CGGCCTGCGC CGGCTGGCGA CCGGTGCCGA CGAGTCGGTG
ACCCAGATCG AGTGGACCAG CGACGGCGCG GCGCTGCTCG TGGTGAGCGA CCGGAGCGGC
TGGTGGAACG TCCACGAGGT GAGCGGGGAC GGACGGTGGC GGGCCCGGTG CCCGCGCGCC
GAGGAGTTCG GCGAGGCCCT GTGGCGGATC GGCGCCAGCA CCTGCGCCGC CCTCACCGGC
GGAGGCCTCG CGGCGGCACA CGGCACGGGC GTCCGCCGGT TGGGCCTGTG TGACGCCGAC
GGCGGCCTGG TCGATGTGGA CGACGGCTTC ACGGACTGGC GGTCCGTGGT CAGCGACGGG
CGGCGGGTGG CAGCCGTGGC GGCCGGACCC CGCAGGTCGC GCTCGGTCGT GCTCGTCGAG
CAGGGGTGCA CCCGTGTGCT CTGGTCCAGT CCCGGTGCTC TGGCCAGCTA CGCCTCCGTA
CCGATGCTCC GCACCTACCA GGGCGTCCAT GCGCACGTGT ACGAACCCCA CCACCCCGGG
TACGCGGGTC CGCCCGGTGA GCCGCCGCCG TACATCGTCC AGGCGCACGG CGGCCCAACC
AGTCGCGGCG TGCCGGTGGC CGACGCGGTG ACCACGTACT TCACCAGCCG GGGGATCGGT
GTGGTGGATG TCCAGTATGG CGGTTCCACC GGCTACGGGC GCGCCTACCG GGACCGGCTC
CGGCATCGCT GGGGTGAGGT TGACGCTCGG GACTGCGCGA CCGTCGCCCG TGGCCTTGTC
GCCGAGGGAC GGGCCGACCC AAGCCGGATC GCCCTCCGCG GCGCCAGCGC TGGTGGGTGG
ACCGCACTGC GGTCGCTGAT CGACGACCCC GACCTCTACC AGGCGGCCGT GGTCTACTTC
CCCGTCCTGG ACGCCCGTTC CTGGGCGAAG TCGACGCACG ACTTCGAGTC GCGGTACGCG
GAGTGGCTGA TCGGCCCGTG GCCACAGGAG CGTGGCCGCT ACGAGTCCCG TTCGCCGGCC
GCCGCGGTGG AACGGATCCG GACCCCGCTG CTGCTGATGC AGGGTGCCCG GGACGCGATC
TGCGTGCCGG AACAGGCGGA CCAGTTCGCC AGGTACCTTG CCTCGATCTC CGTGCCGATA
CGCTACCTGC GCTTCCATGC CGAGGCGCAC GGCTTCCGGC AAGCCGACAC CGTCGCCCGG
TGCCTGAACG CCGAGCTCGA CCTGTACGCC AAGGCGTTGC GTTTTCCGCT GCCGGTCGAA
GCGCGAGCAT GA
 
Protein sequence
MPIQAPYGSW PSGWQAADAS RGHSVVDWVG FAGSEVWWVA ADAGDGRNHL VRPGADGRPE 
DVLPGDWDVR TAFMEYGGRP WVFLGTGGAV FVHWSDQRVY RWTPEAAVRP LSPRSDRYRY
CDFAVRGDEV WCVRETTGGE VRRDLVALPL DGSARIRVLA ATHDFLSGPR ISPDGGRVAW
LGWNHPDMPW TRTAVMVANV DPDGSLVGLR RLATGADESV TQIEWTSDGA ALLVVSDRSG
WWNVHEVSGD GRWRARCPRA EEFGEALWRI GASTCAALTG GGLAAAHGTG VRRLGLCDAD
GGLVDVDDGF TDWRSVVSDG RRVAAVAAGP RRSRSVVLVE QGCTRVLWSS PGALASYASV
PMLRTYQGVH AHVYEPHHPG YAGPPGEPPP YIVQAHGGPT SRGVPVADAV TTYFTSRGIG
VVDVQYGGST GYGRAYRDRL RHRWGEVDAR DCATVARGLV AEGRADPSRI ALRGASAGGW
TALRSLIDDP DLYQAAVVYF PVLDARSWAK STHDFESRYA EWLIGPWPQE RGRYESRSPA
AAVERIRTPL LLMQGARDAI CVPEQADQFA RYLASISVPI RYLRFHAEAH GFRQADTVAR
CLNAELDLYA KALRFPLPVE ARA