Gene Sala_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1978 
Symbol 
ID4081022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2085053 
End bp2087212 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content65% 
IMG OID638010354 
Productprolyl oligopeptidase 
Protein accessionYP_617022 
Protein GI103487461 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.370537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCA AACGCCTGTC CGCGCCGCTC GCGCTGGTCG CCCTTGCCTT GACCCCCACC 
GCAGCACAGG CGGCCGCGGC CGCGGCCGCA TCGGCGCCCG CCGCCGCGCT CGCCTATCCC
GACACGGCGC GCGGCGATAC GGTCGATCCG CAGTTCGGCG TCGACGTCGC CGACCCCTAT
CGCTGGCTGG AGGACGACGT CCGCGTCAAT CCGGAGGTTG CGGCGTGGGT CGAAGCGCAG
AACAGGGTGA CCGACGCCTA TCTCGACACG CTGCCCGGTC GCGACGCCTT CCGGGCGCGG
ATGACTGAGC TGTACGATTA TGAACGCTTC GGCCTGCCGA CCAAGGCGGG CGCGCGCTAT
TTCTACACGC GCAACGACGG GCTCCAGCCG CAGTCGGTGC TCTATGTCCG CGAAGGGTTG
AAAGGCGAGG GCCGCGTGCT CATCGACCCC AATCTGTGGG CCAGGGACGG TGCGACCGCG
CTCGCCGAAT GGGAACCGTC GGAGGATGGC AAATATCTTC TCTATGCGGT GCAGGACGGC
GGCACCGACT GGCGCATCGT GCGCGTCAAG GATGTCGCGA CGGGGCAGGA CCTGCCCGAC
GAGGTGCGCT GGGTGAAGTT TTCGGCGCTC GACTGGGCAA AGGACGGCAG CGGCTTTTAC
TATTCGCGCT TCCCGGAGCC AAAGGAGGGC GAAGCCTTCC AGTCGCTCAA CGAAAATCAC
GCCGTCTATT TCCACCGCCT CGGCACGCCG CAAAGCGCCG ATGTGCTGAT CCACGCGACG
CCCGACAAGC CCAAGCTCAA CAACAGCGCA CTCGTCACCG ACGATGGCGA CTATCTGCTT
GTCGTCTCGT CCGAAGGGAC CGACGAACGC TATGGCCTGA CGCTGCATCC GCTCGGCAGG
CCGGGGGCGA AGCCGATCGT CCTTGTCGAC GATTATGCGA ACAACTGGGA ATATGTGACC
AACGCGGGAA CGCGCTTCAC TTTCCTCACC AACAAGGGCG CGCCGCGCGG CCGCCTCGTT
TCGTTCGACA TCCGCAAGCC GGACAAACTC ACCGAACTCG TCGCCGAAAA CCCCGCCACG
CTCGTCGGCG CCTCGCGCGT CGGCGACCGC ATCATCCTCT CCTATCTTGG CGACGCCAAG
TCGGAAGCGC GCATGGTCGC ACTGAACGGC GAGCCGATCG CGAACATCAA CCTCGCCGAC
ATCGGCGCGG CGTCGGGGTT CGGCGGCAAG TCGAGCGACC CCGAAACCTT CTATGCCTTT
TCCAGCTTTG CGCGGCCGAC GACCATCTAT CGCTTCGACA CCGAAACCGG AAATAGCGAG
ATTTTCGCCG AACCCAGGCT GACCTTCAAC CCTGCCGATT TCAGCGTCGA GCAACGCTTC
TATAAATCAA AGGACGGCAC CGAAGTGCCG ATGTTCCTCG TGATGAAAAA GGGCCTCGAC
CGCAGCAAGG GCTCGCCGAC GCTGCTTTAC GGCTATGGCG GCTTCAACGT CTCGCTGACC
CCAGGCTTTT CGCCGACGCG GCTCGCGTGG GTCGACAAGG GCGGCGTGCT CGCGATCGCG
AACCTGCGGG GCGGCGGCGA ATATGGCAAG GCGTGGCACG ACGCCGGCCG CCTTGCGAAC
AAGCAGAATG TCTTCGACGA TTTCATCGCC GCGGGCGAAT ATCTGATCGC CGAGGGCATC
ACCGGCAAGG GTCAGCTTGC GATCGAGGGC GGATCGAACG GCGGCCTGCT CGTCGGCGCC
GTCACCAACC AGCGCCCCGA CCTGTTCGCC GCGGCGCTGC CTGCGGTCGG CGTGATGGAC
ATGCTGCGCT TCGACCGCTT CACTGCGGGT CGTTACTGGG TCGACGATTA TGGCTATCCG
TCGAAGGAGG CCGATTTCCG GAACCTGCTC AGCTATTCGC CCTACCACAA TATCCGCAGC
GGCGTGGCCT ATCCGGCGGT GCTGGTGACG ACCGCCGACA CCGACGACCG CGTCGTGCCG
GGGCACAGTT TCAAATATAC CGCCGCGCTC CAGCACGCGA AGGCGGGCAG CAAGCCGCAC
CTCATCCGCA TCGAAACGCG CGCGGGCCAT GGCAGCGGCA AGCCGACCGA CAAGATCATC
GCCGAGGCCG CCGACAAATA TGCCTTTGCG GCGAAATGGA CCGGGCTGGA CGTCGAATAG
 
Protein sequence
MPAKRLSAPL ALVALALTPT AAQAAAAAAA SAPAAALAYP DTARGDTVDP QFGVDVADPY 
RWLEDDVRVN PEVAAWVEAQ NRVTDAYLDT LPGRDAFRAR MTELYDYERF GLPTKAGARY
FYTRNDGLQP QSVLYVREGL KGEGRVLIDP NLWARDGATA LAEWEPSEDG KYLLYAVQDG
GTDWRIVRVK DVATGQDLPD EVRWVKFSAL DWAKDGSGFY YSRFPEPKEG EAFQSLNENH
AVYFHRLGTP QSADVLIHAT PDKPKLNNSA LVTDDGDYLL VVSSEGTDER YGLTLHPLGR
PGAKPIVLVD DYANNWEYVT NAGTRFTFLT NKGAPRGRLV SFDIRKPDKL TELVAENPAT
LVGASRVGDR IILSYLGDAK SEARMVALNG EPIANINLAD IGAASGFGGK SSDPETFYAF
SSFARPTTIY RFDTETGNSE IFAEPRLTFN PADFSVEQRF YKSKDGTEVP MFLVMKKGLD
RSKGSPTLLY GYGGFNVSLT PGFSPTRLAW VDKGGVLAIA NLRGGGEYGK AWHDAGRLAN
KQNVFDDFIA AGEYLIAEGI TGKGQLAIEG GSNGGLLVGA VTNQRPDLFA AALPAVGVMD
MLRFDRFTAG RYWVDDYGYP SKEADFRNLL SYSPYHNIRS GVAYPAVLVT TADTDDRVVP
GHSFKYTAAL QHAKAGSKPH LIRIETRAGH GSGKPTDKII AEAADKYAFA AKWTGLDVE