Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_1978 |
Symbol | |
ID | 4081022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | + |
Start bp | 2085053 |
End bp | 2087212 |
Gene Length | 2160 bp |
Protein Length | 719 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638010354 |
Product | prolyl oligopeptidase |
Protein accession | YP_617022 |
Protein GI | 103487461 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.370537 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCCA AACGCCTGTC CGCGCCGCTC GCGCTGGTCG CCCTTGCCTT GACCCCCACC GCAGCACAGG CGGCCGCGGC CGCGGCCGCA TCGGCGCCCG CCGCCGCGCT CGCCTATCCC GACACGGCGC GCGGCGATAC GGTCGATCCG CAGTTCGGCG TCGACGTCGC CGACCCCTAT CGCTGGCTGG AGGACGACGT CCGCGTCAAT CCGGAGGTTG CGGCGTGGGT CGAAGCGCAG AACAGGGTGA CCGACGCCTA TCTCGACACG CTGCCCGGTC GCGACGCCTT CCGGGCGCGG ATGACTGAGC TGTACGATTA TGAACGCTTC GGCCTGCCGA CCAAGGCGGG CGCGCGCTAT TTCTACACGC GCAACGACGG GCTCCAGCCG CAGTCGGTGC TCTATGTCCG CGAAGGGTTG AAAGGCGAGG GCCGCGTGCT CATCGACCCC AATCTGTGGG CCAGGGACGG TGCGACCGCG CTCGCCGAAT GGGAACCGTC GGAGGATGGC AAATATCTTC TCTATGCGGT GCAGGACGGC GGCACCGACT GGCGCATCGT GCGCGTCAAG GATGTCGCGA CGGGGCAGGA CCTGCCCGAC GAGGTGCGCT GGGTGAAGTT TTCGGCGCTC GACTGGGCAA AGGACGGCAG CGGCTTTTAC TATTCGCGCT TCCCGGAGCC AAAGGAGGGC GAAGCCTTCC AGTCGCTCAA CGAAAATCAC GCCGTCTATT TCCACCGCCT CGGCACGCCG CAAAGCGCCG ATGTGCTGAT CCACGCGACG CCCGACAAGC CCAAGCTCAA CAACAGCGCA CTCGTCACCG ACGATGGCGA CTATCTGCTT GTCGTCTCGT CCGAAGGGAC CGACGAACGC TATGGCCTGA CGCTGCATCC GCTCGGCAGG CCGGGGGCGA AGCCGATCGT CCTTGTCGAC GATTATGCGA ACAACTGGGA ATATGTGACC AACGCGGGAA CGCGCTTCAC TTTCCTCACC AACAAGGGCG CGCCGCGCGG CCGCCTCGTT TCGTTCGACA TCCGCAAGCC GGACAAACTC ACCGAACTCG TCGCCGAAAA CCCCGCCACG CTCGTCGGCG CCTCGCGCGT CGGCGACCGC ATCATCCTCT CCTATCTTGG CGACGCCAAG TCGGAAGCGC GCATGGTCGC ACTGAACGGC GAGCCGATCG CGAACATCAA CCTCGCCGAC ATCGGCGCGG CGTCGGGGTT CGGCGGCAAG TCGAGCGACC CCGAAACCTT CTATGCCTTT TCCAGCTTTG CGCGGCCGAC GACCATCTAT CGCTTCGACA CCGAAACCGG AAATAGCGAG ATTTTCGCCG AACCCAGGCT GACCTTCAAC CCTGCCGATT TCAGCGTCGA GCAACGCTTC TATAAATCAA AGGACGGCAC CGAAGTGCCG ATGTTCCTCG TGATGAAAAA GGGCCTCGAC CGCAGCAAGG GCTCGCCGAC GCTGCTTTAC GGCTATGGCG GCTTCAACGT CTCGCTGACC CCAGGCTTTT CGCCGACGCG GCTCGCGTGG GTCGACAAGG GCGGCGTGCT CGCGATCGCG AACCTGCGGG GCGGCGGCGA ATATGGCAAG GCGTGGCACG ACGCCGGCCG CCTTGCGAAC AAGCAGAATG TCTTCGACGA TTTCATCGCC GCGGGCGAAT ATCTGATCGC CGAGGGCATC ACCGGCAAGG GTCAGCTTGC GATCGAGGGC GGATCGAACG GCGGCCTGCT CGTCGGCGCC GTCACCAACC AGCGCCCCGA CCTGTTCGCC GCGGCGCTGC CTGCGGTCGG CGTGATGGAC ATGCTGCGCT TCGACCGCTT CACTGCGGGT CGTTACTGGG TCGACGATTA TGGCTATCCG TCGAAGGAGG CCGATTTCCG GAACCTGCTC AGCTATTCGC CCTACCACAA TATCCGCAGC GGCGTGGCCT ATCCGGCGGT GCTGGTGACG ACCGCCGACA CCGACGACCG CGTCGTGCCG GGGCACAGTT TCAAATATAC CGCCGCGCTC CAGCACGCGA AGGCGGGCAG CAAGCCGCAC CTCATCCGCA TCGAAACGCG CGCGGGCCAT GGCAGCGGCA AGCCGACCGA CAAGATCATC GCCGAGGCCG CCGACAAATA TGCCTTTGCG GCGAAATGGA CCGGGCTGGA CGTCGAATAG
|
Protein sequence | MPAKRLSAPL ALVALALTPT AAQAAAAAAA SAPAAALAYP DTARGDTVDP QFGVDVADPY RWLEDDVRVN PEVAAWVEAQ NRVTDAYLDT LPGRDAFRAR MTELYDYERF GLPTKAGARY FYTRNDGLQP QSVLYVREGL KGEGRVLIDP NLWARDGATA LAEWEPSEDG KYLLYAVQDG GTDWRIVRVK DVATGQDLPD EVRWVKFSAL DWAKDGSGFY YSRFPEPKEG EAFQSLNENH AVYFHRLGTP QSADVLIHAT PDKPKLNNSA LVTDDGDYLL VVSSEGTDER YGLTLHPLGR PGAKPIVLVD DYANNWEYVT NAGTRFTFLT NKGAPRGRLV SFDIRKPDKL TELVAENPAT LVGASRVGDR IILSYLGDAK SEARMVALNG EPIANINLAD IGAASGFGGK SSDPETFYAF SSFARPTTIY RFDTETGNSE IFAEPRLTFN PADFSVEQRF YKSKDGTEVP MFLVMKKGLD RSKGSPTLLY GYGGFNVSLT PGFSPTRLAW VDKGGVLAIA NLRGGGEYGK AWHDAGRLAN KQNVFDDFIA AGEYLIAEGI TGKGQLAIEG GSNGGLLVGA VTNQRPDLFA AALPAVGVMD MLRFDRFTAG RYWVDDYGYP SKEADFRNLL SYSPYHNIRS GVAYPAVLVT TADTDDRVVP GHSFKYTAAL QHAKAGSKPH LIRIETRAGH GSGKPTDKII AEAADKYAFA AKWTGLDVE
|
| |