Gene Sare_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1960 
Symbol 
ID5705207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2253979 
End bp2255811 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content66% 
IMG OID641271465 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001536836 
Protein GI159037583 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.317631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.295208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCTGG GCGACAGAAT CGCTGAAGTG TCGTTGGACG ACGCCGTTCG CGCCTCCCTT 
AGCTATGACC AGCTACGTGC CGGTGATGAC GGCCTGTACT GGCTGGAGAC TCACCCCGAA
GGCGGGGGTA CTACGGTAAC GCGTTGGCGG TCCGAGGAGG GCAAGCGTGT TGTCAGTCCG
GCGGGCTTCG ATGTCACCAG TATTGTTCAC GCGTACGGGG GTGGATCGTT CGCTGTCGTC
GAAGGGACTC TGTGGTGTGT CGGCGGAGAC GGCCTCTATC GGAGCAGCCT TCACCGCGAT
GAACCCGAAA TGGTCAGCGG TGGATGCTTC GGCGACCTGA CGATCAGCAA CGGTGAGTTG
CTGGCTGTGC GGGAGTCGGG GCAGGGTGAC GAACTCGTCG CGGCGTCGTC TTCCGGCCCT
CCGCAGGTGC GGACTCTTGT CTCGTCGCCA GGGTTTCTCG GTGCGCCACG GCCAAGTCCG
GGATTGCTGG CGTGGACCGC GTGGAGTGAG CGGGACATGC CCTGGGACGC CTGCGAGGTG
TGGGTGGCGT CCTACTCGCC GCGGGGATCG GTCGAAGGCG CAGTGAGACT AGCCGGCGGT
CCGGAGGAGT CTGCGGTGGA ACCGCAGTGG GGACCCGACG GGGCGTTGTA TTTCGTGTCG
GACCGTAACG GCTGGTGGAA CCTCTACCGA TGGGACGGCC ACCGGGTGGA GGCGGTGGCA
CCGCTTGCGG GCGAGTGCGC CGCAGATCCG TGGGAACTGG GCTACGCGTC GTATGGGTTC
CTTGACGGCG GTCGGATCGT CATAGCGGTG CAGGAGGGGC CGCGACACCG CCTCGTCGTC
GTCGAGGCTA GCGGCTCTGT TCACCCGGTC GATCTGCCGT ACACGTCGAT CAAGCCGTAC
CTAGCTGTTC AAGGGACGAC GGTGGCATTG ATCGGATCCT CACCGACCGT AGCCCCGCAG
GTGGCGCTCG TGGACTTGGC TGACATCGTC CCCCAAGTGG TGGTGCTTGC CCGCTCGGAA
CACGGCGCAC TCGACGGGGC AAGCGTTTCC ACGCCGACGG AGTTACGTGT TCGAGTGGCC
AGTGGTCGGG AGGTCCTTGC CCTGGTGTAT CCACCGACGA GCTCGACGAC GGATTGGCAG
GCGCCAGTGA TCGTGCGGGC ACATCCTGGG CCCACCGACT CCTGCTTGCT ACGGCTGGAT
TGGCAGGCGC AGTTCTTCAC CAGCCGCGGG TTCGCCGTCG TTGATGTCGA CTACCTGGGT
AGCACCGGGT ACGGCCGGAT GTTCCGGGAA TCGCTCTACG GTCGGTGGGG CCTGGACGAT
GTCGATGACT GCGCCGCGGT GGCGGATCAC CTGCTATCGA CGGGACGGGC GCTGCCCGGA
CAGGTATTCA TCCGCGGTGC CAGCGCAGGC GGATACACCG CGTTGCAGGC GGTGGCACAG
GACACCCCGT TCGCCGCCGC CACCGCCGTG TCCGCGATTG TGGATCCTGA CCGGTGGGCG
GAAACGGTAC CCCGATTCCA GCGACCGCAC GCGATGCGGC TGCGCGGGGG CGCCGGCCCG
GTCCGTGCCG CCGCAATCCA ACGGCCGGTG CTCCTCATTC ACGGCACCGC AGATGAAGTA
GCCGTAGCCG AGGATATTCG CGAACTCGCC GACGAGTTGA CATCTGCAGA TAGAGCGGCT
GGGCTTTTGC TTCTTCCTGA GGTCGGCCAC TATGTGGCGT CATCCCATCG AGCAGGCGCG
GCGCTGAAGG CCGAGCTGGC TCACTACCGC TCGGTGATGG TTGATGGAGC AACGGTCAGC
GGGGGCTACA CCGCTGCCAA CGGTAGCCGG TGA
 
Protein sequence
MRLGDRIAEV SLDDAVRASL SYDQLRAGDD GLYWLETHPE GGGTTVTRWR SEEGKRVVSP 
AGFDVTSIVH AYGGGSFAVV EGTLWCVGGD GLYRSSLHRD EPEMVSGGCF GDLTISNGEL
LAVRESGQGD ELVAASSSGP PQVRTLVSSP GFLGAPRPSP GLLAWTAWSE RDMPWDACEV
WVASYSPRGS VEGAVRLAGG PEESAVEPQW GPDGALYFVS DRNGWWNLYR WDGHRVEAVA
PLAGECAADP WELGYASYGF LDGGRIVIAV QEGPRHRLVV VEASGSVHPV DLPYTSIKPY
LAVQGTTVAL IGSSPTVAPQ VALVDLADIV PQVVVLARSE HGALDGASVS TPTELRVRVA
SGREVLALVY PPTSSTTDWQ APVIVRAHPG PTDSCLLRLD WQAQFFTSRG FAVVDVDYLG
STGYGRMFRE SLYGRWGLDD VDDCAAVADH LLSTGRALPG QVFIRGASAG GYTALQAVAQ
DTPFAAATAV SAIVDPDRWA ETVPRFQRPH AMRLRGGAGP VRAAAIQRPV LLIHGTADEV
AVAEDIRELA DELTSADRAA GLLLLPEVGH YVASSHRAGA ALKAELAHYR SVMVDGATVS
GGYTAANGSR